Update README.md
Browse files
README.md
CHANGED
|
@@ -8,9 +8,9 @@ base_model:
|
|
| 8 |
---
|
| 9 |
## SmallThinker-4BA0.6B-Instruct-GGUF
|
| 10 |
|
| 11 |
-
- GGUF models with `.gguf` suffix can used with [*llama.cpp*](https://github.com/ggml-org/llama.cpp)
|
| 12 |
|
| 13 |
-
- GGUF models with `.powerinfer.gguf` suffix are integrated with fused sparse FFN operators and sparse LM head operators. These models are only compatible to [*powerinfer*](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker)
|
| 14 |
|
| 15 |
|
| 16 |
## Introduction
|
|
|
|
| 8 |
---
|
| 9 |
## SmallThinker-4BA0.6B-Instruct-GGUF
|
| 10 |
|
| 11 |
+
- GGUF models with `.gguf` suffix can used with [*llama.cpp*](https://github.com/ggml-org/llama.cpp) framework.
|
| 12 |
|
| 13 |
+
- GGUF models with `.powerinfer.gguf` suffix are integrated with fused sparse FFN operators and sparse LM head operators. These models are only compatible to [*powerinfer*](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker) framework.
|
| 14 |
|
| 15 |
|
| 16 |
## Introduction
|