cgus
/

Snowpiercer-15B-v2-exl2

4-bit precision

Model card Files Files and versions

cgus commited on Jul 13

Commit

cb76c84

·

verified ·

1 Parent(s): 9415930

Update README.md

Files changed (1) hide show

README.md +18 -1

README.md CHANGED Viewed

@@ -1,8 +1,25 @@
 ---
 base_model:
-- SillyTilly/ServiceNow-AI-Apriel-Nemotron-15b-Thinker-Chatml
 license: mit
 ---
 # Join our Discord! https://discord.gg/BeaverAI
 ## More than 6000 members of helpful, LLM enthusiasts! A hub for players and makers alike!
 ---

 ---
 base_model:
+- TheDrummer/Snowpiercer-15B-v2
 license: mit
 ---
+# Snowpiercer-15B-v2-exl2
+Original model: [Snowpiercer-15B-v2](https://huggingface.co/TheDrummer/Snowpiercer-15B-v2) by [TheDrummer](https://huggingface.co/TheDrummer/Snowpiercer-15B-v2)
+Based on: [Apriel-Nemotron-15b-Thinker](https://huggingface.co/ServiceNow-AI/Apriel-Nemotron-15b-Thinker) by [ServiceNow-AI](https://huggingface.co/ServiceNow-AI)
+## Quants
+[4bpw h6 (main)](https://huggingface.co/cgus/Snowpiercer-15B-v2-exl2/tree/main)
+[4.5bpw h6](https://huggingface.co/cgus/Snowpiercer-15B-v2-exl2/tree/4.5bpw-h6)
+[5bpw h6](https://huggingface.co/cgus/Snowpiercer-15B-v2-exl2/tree/5bpw-h6)
+[6bpw h6](https://huggingface.co/cgus/Snowpiercer-15B-v2-exl2/tree/6bpw-h6)
+[8bpw h8](https://huggingface.co/cgus/Snowpiercer-15B-v2-exl2/tree/8bpw-h8)
+## Quantization notes
+Made with Exllamav2 0.3.1 with default dataset.
+These quants require RTX GPU on Windows or RTX/ROCm GPU on Linux and can be used with TabbyAPI or Text-Generation-WebUI.
+They require to be fully loaded into your GPU, if you have to rely on system RAM then GGUF quants is a better choice.
+# Original model card
 # Join our Discord! https://discord.gg/BeaverAI
 ## More than 6000 members of helpful, LLM enthusiasts! A hub for players and makers alike!
 ---