fearlessdots
/

Llama-3-Alpha-Centauri-v0.1-LoRA

Text Generation

text-generation-inference

Model card Files Files and versions

fearlessdots commited on May 25, 2024

Commit

80592df

·

verified ·

1 Parent(s): 3edc5a8

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -30,6 +30,13 @@ This model and its related LoRA was fine-tuned on [https://huggingface.co/failsp
 ## Fine Tuning
 ### - PEFT Parameters
 - lora_alpha=64
@@ -58,6 +65,7 @@ This model and its related LoRA was fine-tuned on [https://huggingface.co/failsp
 ## Credits
 - Meta ([https://huggingface.co/meta-llama](https://huggingface.co/meta-llama)): for the original Llama-3;
 - failspy ([https://huggingface.co/failspy](https://huggingface.co/failspy)): for the base model and the orthogonalization implementation;
 - NobodyExistsOnTheInternet ([https://huggingface.co/NobodyExistsOnTheInternet](https://huggingface.co/NobodyExistsOnTheInternet)): for the incredible dataset;
 - Undi95 ([https://huggingface.co/Undi95](https://huggingface.co/Undi95)) and Sao10k ([https://huggingface.co/Sao10K](https://huggingface.co/Sao10K)): my main inspirations for doing these models =]

 ## Fine Tuning
+### - Quantization Configuration
+- load_in_4bit=True
+- bnb_4bit_quant_type="fp4"
+- bnb_4bit_compute_dtype=compute_dtype
+- bnb_4bit_use_double_quant=False
 ### - PEFT Parameters
 - lora_alpha=64
 ## Credits
 - Meta ([https://huggingface.co/meta-llama](https://huggingface.co/meta-llama)): for the original Llama-3;
+- HuggingFace: for hosting this model and for creating the fine-tuning tools;
 - failspy ([https://huggingface.co/failspy](https://huggingface.co/failspy)): for the base model and the orthogonalization implementation;
 - NobodyExistsOnTheInternet ([https://huggingface.co/NobodyExistsOnTheInternet](https://huggingface.co/NobodyExistsOnTheInternet)): for the incredible dataset;
 - Undi95 ([https://huggingface.co/Undi95](https://huggingface.co/Undi95)) and Sao10k ([https://huggingface.co/Sao10K](https://huggingface.co/Sao10K)): my main inspirations for doing these models =]