Update README.md
Browse files
README.md
CHANGED
|
@@ -16,6 +16,17 @@ tags:
|
|
| 16 |
This model was converted to GGUF format from [`allenai/OLMo-2-0425-1B-Instruct`](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
| 17 |
Refer to the [original model card](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) for more details on the model.
|
| 18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
## Use with llama.cpp
|
| 20 |
Install llama.cpp through brew (works on Mac and Linux)
|
| 21 |
|
|
|
|
| 16 |
This model was converted to GGUF format from [`allenai/OLMo-2-0425-1B-Instruct`](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
| 17 |
Refer to the [original model card](https://huggingface.co/allenai/OLMo-2-0425-1B-Instruct) for more details on the model.
|
| 18 |
|
| 19 |
+
---
|
| 20 |
+
OLMo 2 1B Instruct April 2025 is post-trained variant of the allenai/OLMo-2-0425-1B-RLVR1 model, which has undergone supervised finetuning on an OLMo-specific variant of the Tülu 3 dataset, further DPO training on this dataset, and final RLVR training on this dataset.
|
| 21 |
+
Tülu 3 is designed for state-of-the-art performance on a diversity of
|
| 22 |
+
tasks in addition to chat, such as MATH, GSM8K, and IFEval.
|
| 23 |
+
Check out the OLMo 2 paper or Tülu 3 paper for more details!
|
| 24 |
+
|
| 25 |
+
|
| 26 |
+
OLMo is a series of Open Language Models designed to enable the science of language models.
|
| 27 |
+
These models are trained on the Dolma dataset. We are releasing all code, checkpoints, logs, and associated training details.
|
| 28 |
+
|
| 29 |
+
---
|
| 30 |
## Use with llama.cpp
|
| 31 |
Install llama.cpp through brew (works on Mac and Linux)
|
| 32 |
|