huihui-ai commited on
Commit
8a2de0e
·
verified ·
1 Parent(s): 11fdbd4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -7
README.md CHANGED
@@ -100,12 +100,13 @@ while True:
100
  ## Evaluations
101
  The following data has been re-evaluated and calculated as the average for each test.
102
 
103
- | Benchmark | Qwen2.5-7B-Instruct | Qwen2.5-7B-Instruct-abliterated-v2 | Qwen2.5-7B-Instruct-abliterated |
104
- |-------------|---------------------|------------------------------------|---------------------------------|
105
- | IF_Eval | 76.44 | **77.82** | 76.49 |
106
- | MMLU Pro | **43.12** | 42.03 | 41.71 |
107
- | TruthfulQA | 62.46 | 57.81 | **64.92** |
108
- | BBH | **53.92** | 53.01 | 52.77 |
109
- | GPQA | 31.91 | **32.17** | 31.97 |
 
110
 
111
  The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Qwen2.5-7B-Instruct-abliterated-v2/blob/main/eval.sh)
 
100
  ## Evaluations
101
  The following data has been re-evaluated and calculated as the average for each test.
102
 
103
+
104
+ | Model | IF_Eval | BBH | GPQA | MMLU Pro | TruthfulQA |
105
+ |--------------------------------------|-----------------------|-----------|-----------|------------|
106
+ | Qwen2.5-0.5B-Instruct | **33.07** | **33.26** | 26.11 | **17.18** | 45.07 |
107
+ | Qwen2.5-0.5B-Instruct-CensorTune | 16.20 | 32.51 | 25.25 | 17.09 | **45.48** |
108
+ | Qwen2.5-0.5B-Instruct-abliterated-v3 | 33.02 | 32.58 | **26.45** | 16.42 | 39.24 |
109
+ | Qwen2.5-0.5B-Instruct-abliterated-v2 | 32.15 | 32.51 | 26.43 | 16.29 | 39.56 |
110
+ | Qwen2.5-0.5B-Instruct-abliterated-v1 | 32.96 | 32.83 | 26.23 | 16.42 | 45.40 |
111
 
112
  The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Qwen2.5-7B-Instruct-abliterated-v2/blob/main/eval.sh)