Safetensors
mistral
vllm
8-bit precision
gptq
googlefan commited on
Commit
7fc7947
·
verified ·
1 Parent(s): a864a53

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -25,6 +25,8 @@ tags:
25
  ---
26
  Quantized using gptq with random articles sampled from finewiki
27
 
 
 
28
  # Below is the original README
29
 
30
  # Riva-Translate-4B-Instruct
 
25
  ---
26
  Quantized using gptq with random articles sampled from finewiki
27
 
28
+ Somehow broken, use [neody/riva-translate-4b-instruct-gptq-int8-w64](https://huggingface.co/neody/riva-translate-4b-instruct-gptq-int8-w64)
29
+
30
  # Below is the original README
31
 
32
  # Riva-Translate-4B-Instruct