neody
/

riva-translate-4b-instruct-gptq-int8

8-bit precision

Model card Files Files and versions

googlefan commited on 6 days ago

Commit

7fc7947

·

verified ·

1 Parent(s): a864a53

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -25,6 +25,8 @@ tags:
 ---
 Quantized using gptq with random articles sampled from finewiki
 # Below is the original README
 # Riva-Translate-4B-Instruct

 ---
 Quantized using gptq with random articles sampled from finewiki
+Somehow broken, use [neody/riva-translate-4b-instruct-gptq-int8-w64](https://huggingface.co/neody/riva-translate-4b-instruct-gptq-int8-w64)
 # Below is the original README
 # Riva-Translate-4B-Instruct