Triangle104
/

OLMo-2-0425-1B-Instruct-Q4_K_S-GGUF

Text Generation

Model card Files Files and versions

Triangle104 commited on May 10

Commit

c0d86af

·

verified ·

1 Parent(s): 8d47aa6

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -16,6 +16,17 @@ tags:
 This model was converted to GGUF format from [`allenai/OLMo-2-0425-1B-Instruct`](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`allenai/OLMo-2-0425-1B-Instruct`](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) for more details on the model.
+---
+OLMo 2 1B Instruct April 2025 is post-trained variant of the allenai/OLMo-2-0425-1B-RLVR1 model, which has undergone supervised finetuning on an OLMo-specific variant of the Tülu 3 dataset, further DPO training on this dataset, and final RLVR training on this dataset.
+Tülu 3 is designed for state-of-the-art performance on a diversity of
+tasks in addition to chat, such as MATH, GSM8K, and IFEval.
+Check out the OLMo 2 paper or Tülu 3 paper for more details!
+OLMo is a series of Open Language Models designed to enable the science of language models.
+These models are trained on the Dolma dataset. We are releasing all code, checkpoints, logs, and associated training details.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)