PowerInfer
/

SmallThinker-4BA0.6B-Instruct-GGUF

Text Generation

Model card Files Files and versions

yixinsong commited on Aug 5

Commit

b6ca58d

·

verified ·

1 Parent(s): 411737b

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -8,9 +8,9 @@ base_model:
 ---
 ## SmallThinker-4BA0.6B-Instruct-GGUF
-- GGUF models with `.gguf` suffix can used with [*llama.cpp*](https://github.com/ggml-org/llama.cpp) framwork.
-- GGUF models with `.powerinfer.gguf` suffix are integrated with fused sparse FFN operators and sparse LM head operators. These models are only compatible to [*powerinfer*](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker) framwork.
 ## Introduction

 ---
 ## SmallThinker-4BA0.6B-Instruct-GGUF
+- GGUF models with `.gguf` suffix can used with [*llama.cpp*](https://github.com/ggml-org/llama.cpp) framework.
+- GGUF models with `.powerinfer.gguf` suffix are integrated with fused sparse FFN operators and sparse LM head operators. These models are only compatible to [*powerinfer*](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker) framework.
 ## Introduction