Update README.md
Browse files
README.md
CHANGED
|
@@ -62,7 +62,7 @@ print(out[0]["generated_text"])
|
|
| 62 |
|
| 63 |
---
|
| 64 |
|
| 65 |
-
## 📊 Results
|
| 66 |
|
| 67 |
*Hala models are placed at the end of each size category; best **Average** per category is in bold.*
|
| 68 |
|
|
@@ -99,7 +99,7 @@ print(out[0]["generated_text"])
|
|
| 99 |
| 7B–9B | QCRI/Fanar-1-9B-Instruct | 9B | 76.4 | 65.8 | 52.7 | 73.3 | 88.3 | 58.6 | 69.2 |
|
| 100 |
| 7B–9B | **Hala‑9B** | 9B | 78.3 | 65.6 | 53.8 | 70.4 | 89.6 | 61.4 | **69.9** |
|
| 101 |
|
| 102 |
-
> **Evaluation protocol**: `lighteval` on **ArabicMMLU (OALL‑2)
|
| 103 |
|
| 104 |
---
|
| 105 |
|
|
@@ -108,11 +108,10 @@ print(out[0]["generated_text"])
|
|
| 108 |
If you find **Hala** useful, please cite:
|
| 109 |
|
| 110 |
```bibtex
|
| 111 |
-
@
|
| 112 |
-
|
| 113 |
-
|
| 114 |
-
|
| 115 |
-
|
| 116 |
-
note = {KAUST},
|
| 117 |
}
|
| 118 |
```
|
|
|
|
| 62 |
|
| 63 |
---
|
| 64 |
|
| 65 |
+
## 📊 Results
|
| 66 |
|
| 67 |
*Hala models are placed at the end of each size category; best **Average** per category is in bold.*
|
| 68 |
|
|
|
|
| 99 |
| 7B–9B | QCRI/Fanar-1-9B-Instruct | 9B | 76.4 | 65.8 | 52.7 | 73.3 | 88.3 | 58.6 | 69.2 |
|
| 100 |
| 7B–9B | **Hala‑9B** | 9B | 78.3 | 65.6 | 53.8 | 70.4 | 89.6 | 61.4 | **69.9** |
|
| 101 |
|
| 102 |
+
> **Evaluation protocol**: `lighteval` on **ArabicMMLU (OALL‑2)** excluding AlRage.
|
| 103 |
|
| 104 |
---
|
| 105 |
|
|
|
|
| 108 |
If you find **Hala** useful, please cite:
|
| 109 |
|
| 110 |
```bibtex
|
| 111 |
+
@misc{hammoud2025halatechnicalreportbuilding,
|
| 112 |
+
title={Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale},
|
| 113 |
+
author={Hasan Abed Al Kader Hammoud and Mohammad Zbeeb and Bernard Ghanem},
|
| 114 |
+
year={2025},
|
| 115 |
+
url={https://arxiv.org/abs/2509.14008},
|
|
|
|
| 116 |
}
|
| 117 |
```
|