huihui-ai
/

Qwen2.5-7B-Instruct-abliterated-v2

Text Generation

text-generation-inference

Model card Files Files and versions

huihui-ai commited on Apr 28

Commit

8a2de0e

·

verified ·

1 Parent(s): 11fdbd4

Update README.md

Files changed (1) hide show

README.md +8 -7

README.md CHANGED Viewed

@@ -100,12 +100,13 @@ while True:
 ## Evaluations
 The following data has been re-evaluated and calculated as the average for each test.
-| Benchmark   | Qwen2.5-7B-Instruct | Qwen2.5-7B-Instruct-abliterated-v2 | Qwen2.5-7B-Instruct-abliterated |
-|-------------|---------------------|------------------------------------|---------------------------------|
-| IF_Eval     | 76.44               | **77.82**                          | 76.49                           |
-| MMLU Pro    | **43.12**           | 42.03                              | 41.71                           |
-| TruthfulQA  | 62.46               | 57.81                              | **64.92**                       |
-| BBH         | **53.92**           | 53.01                              | 52.77                           |
-| GPQA        | 31.91               | **32.17**                          | 31.97                           |
 The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Qwen2.5-7B-Instruct-abliterated-v2/blob/main/eval.sh)

 ## Evaluations
 The following data has been re-evaluated and calculated as the average for each test.
+| Model                              |  IF_Eval  | BBH       | GPQA      | MMLU Pro  | TruthfulQA |
+|--------------------------------------|-----------------------|-----------|-----------|------------|
+| Qwen2.5-0.5B-Instruct                | **33.07** | **33.26** | 26.11     | **17.18** | 45.07      |
+| Qwen2.5-0.5B-Instruct-CensorTune     | 16.20     | 32.51     | 25.25     | 17.09     | **45.48**  |
+| Qwen2.5-0.5B-Instruct-abliterated-v3 | 33.02     | 32.58     | **26.45** | 16.42     | 39.24      |
+| Qwen2.5-0.5B-Instruct-abliterated-v2 | 32.15     | 32.51     | 26.43     | 16.29     | 39.56      |
+| Qwen2.5-0.5B-Instruct-abliterated-v1 | 32.96     | 32.83     | 26.23     | 16.42     | 45.40      |
 The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Qwen2.5-7B-Instruct-abliterated-v2/blob/main/eval.sh)