Update README.md
Browse files
README.md
CHANGED
|
@@ -196,18 +196,32 @@ We rely on [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-h
|
|
| 196 |
<details>
|
| 197 |
<summary> Reproduce Model Quality Results </summary>
|
| 198 |
|
|
|
|
| 199 |
Need to install lm-eval from source:
|
| 200 |
https://github.com/EleutherAI/lm-evaluation-harness#install
|
| 201 |
|
| 202 |
-
## baseline
|
| 203 |
```Shell
|
|
|
|
| 204 |
lm_eval --model hf --model_args pretrained=google/gemma-3-12b-it --tasks mmlu --device cuda:0 --batch_size 8
|
| 205 |
```
|
| 206 |
|
| 207 |
-
##
|
|
|
|
|
|
|
|
|
|
| 208 |
```Shell
|
| 209 |
-
|
| 210 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 211 |
```
|
| 212 |
</details>
|
| 213 |
|
|
|
|
| 196 |
<details>
|
| 197 |
<summary> Reproduce Model Quality Results </summary>
|
| 198 |
|
| 199 |
+
## language eval
|
| 200 |
Need to install lm-eval from source:
|
| 201 |
https://github.com/EleutherAI/lm-evaluation-harness#install
|
| 202 |
|
|
|
|
| 203 |
```Shell
|
| 204 |
+
export MODEL=google/gemma-3-12b-it # or pytorch/gemma-3-12b-it-INT4
|
| 205 |
lm_eval --model hf --model_args pretrained=google/gemma-3-12b-it --tasks mmlu --device cuda:0 --batch_size 8
|
| 206 |
```
|
| 207 |
|
| 208 |
+
## multi-modal eval
|
| 209 |
+
Need to install lmms-eval from source:
|
| 210 |
+
`pip install git+https://github.com/EvolvingLMMs-Lab/lmms-eval.git`
|
| 211 |
+
|
| 212 |
```Shell
|
| 213 |
+
NUM_PROCESSES=8
|
| 214 |
+
MAIN_PORT=12345
|
| 215 |
+
MODEL_ID=google/gemma-3-12b-it # or pytorch/gemma-3-12b-it-INT4
|
| 216 |
+
TASKS=chartqa # or tasks from https://github.com/EvolvingLMMs-Lab/lmms-eval/tree/main/lmms_eval/models/simple
|
| 217 |
+
BATCH_SIZE=32
|
| 218 |
+
OUTPUT_PATH=./logs/
|
| 219 |
+
|
| 220 |
+
accelerate launch --num_processes "${NUM_PROCESSES}" --main_process_port "${MAIN_PORT}" -m lmms_eval \
|
| 221 |
+
--model gemma3 \
|
| 222 |
+
--model_args "pretrained=${MODEL_ID}" \
|
| 223 |
+
--tasks "${TASKS}" \
|
| 224 |
+
--batch_size "${BATCH_SIZE}" --output_path "${OUTPUT_PATH}"
|
| 225 |
```
|
| 226 |
</details>
|
| 227 |
|