sarvamai
/

sarvam-m

@@ -193,13 +193,4 @@ print("content:", content)
 messages.append(
     {"role": "assistant", "content": output_text}
 )
-```
-# Running the model on a CPU
-This repo contains gguf versions of `sarvam-m` in both bf16 and q8 precisions. You can use the model on your local machine (without gpu) as explained [here](https://github.com/ggml-org/llama.cpp/tree/master/tools/main).
-Example Command:
-```
-./build/bin/llama-cli -i -m /your/folder/path/sarvam-m-q8_0.gguf -c 8192 -t 16
 ```

 messages.append(
     {"role": "assistant", "content": output_text}
 )
 ```