Update README.md
Browse files
README.md
CHANGED
|
@@ -100,12 +100,13 @@ while True:
|
|
| 100 |
## Evaluations
|
| 101 |
The following data has been re-evaluated and calculated as the average for each test.
|
| 102 |
|
| 103 |
-
|
| 104 |
-
|
| 105 |
-
|
| 106 |
-
|
|
| 107 |
-
|
|
| 108 |
-
|
|
| 109 |
-
|
|
|
|
|
| 110 |
|
| 111 |
The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Qwen2.5-7B-Instruct-abliterated-v2/blob/main/eval.sh)
|
|
|
|
| 100 |
## Evaluations
|
| 101 |
The following data has been re-evaluated and calculated as the average for each test.
|
| 102 |
|
| 103 |
+
|
| 104 |
+
| Model | IF_Eval | BBH | GPQA | MMLU Pro | TruthfulQA |
|
| 105 |
+
|--------------------------------------|-----------------------|-----------|-----------|------------|
|
| 106 |
+
| Qwen2.5-0.5B-Instruct | **33.07** | **33.26** | 26.11 | **17.18** | 45.07 |
|
| 107 |
+
| Qwen2.5-0.5B-Instruct-CensorTune | 16.20 | 32.51 | 25.25 | 17.09 | **45.48** |
|
| 108 |
+
| Qwen2.5-0.5B-Instruct-abliterated-v3 | 33.02 | 32.58 | **26.45** | 16.42 | 39.24 |
|
| 109 |
+
| Qwen2.5-0.5B-Instruct-abliterated-v2 | 32.15 | 32.51 | 26.43 | 16.29 | 39.56 |
|
| 110 |
+
| Qwen2.5-0.5B-Instruct-abliterated-v1 | 32.96 | 32.83 | 26.23 | 16.42 | 45.40 |
|
| 111 |
|
| 112 |
The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Qwen2.5-7B-Instruct-abliterated-v2/blob/main/eval.sh)
|