Update README.md
Browse files
README.md
CHANGED
|
@@ -94,25 +94,28 @@ This model can also conduct in-depth analysis of AAAI's official website and ide
|
|
| 94 |
|
| 95 |
## Evaluation
|
| 96 |
|
|
|
|
| 97 |
|
|
|
|
| 98 |
|
| 99 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 100 |
|
| 101 |
-
|
| 102 |
-
| -------------- | ---------------------- | ------------------------- | ---- | ----------------------- | ------------------------ | ------------------- | --------------------- |
|
| 103 |
-
| 1672.3 / 341.1 | 76.6 / 75.4 | 71.5 / 70.1 | 87.2 | 39.1 / 35.3 | 34.8 / 34.0 | 344.5 | 76.3 |
|
| 104 |
|
| 105 |
-
|
|
|
|
|
|
|
| 106 |
|
| 107 |
-
|
| 108 |
-
| -------------------- | ------------------- | --------------------- | ------------------------- | ------------------- | ------------------ | ------------------ |
|
| 109 |
-
| 80.9 | 64.2 | 65.8 | 58.3 / 57.3 | 70.23 | 62.4 | 91.2 |
|
| 110 |
|
| 111 |
-
|
|
|
|
|
|
|
| 112 |
|
| 113 |
-
| COCO<sub>test</sub> | Flickr30K<sub>test</sub> | NoCaps<sub>val</sub> |
|
| 114 |
-
| ------------------- | ------------------------ | -------------------- |
|
| 115 |
-
| 141.8 | 84.3 | 120.4 |
|
| 116 |
|
| 117 |
|
| 118 |
## Citation
|
|
|
|
| 94 |
|
| 95 |
## Evaluation
|
| 96 |
|
| 97 |
+
**MultiModal Benchmark**
|
| 98 |
|
| 99 |
+
\* Training set observed.
|
| 100 |
|
| 101 |
+
| MathVista<br>(testmini) | MMB<br>(dev/test) | MMB−CN<br>(dev/test) | MMMU<br>(val/test) | CMMMU<br>(val/test) | MMVP | MME | POPE | Tiny LVLM | SEEDv1<br>(image) | LLaVA Wild | MM−Vet |
|
| 102 |
+
| ----------------------- | ----------------- | -------------------- | ---------------------------------------------------------------------------------- | ------------------- | ---- | -------------- | ---- | --------- | ----------------- | ---------- | ------ |
|
| 103 |
+
| 34.5 | 76.7 / 75.4 | 71.9 / 70.3 | 39.1 / 35.3 | 34.8 / 34.0 | 44.7 | 1675.1 / 348.6 | 87.1 | 343.2 | 73.2 | 73.2 | 46.7 |
|
| 104 |
+
|
| 105 |
+
**Image Captioning & Visual Question Answering**
|
| 106 |
|
| 107 |
+
\* Training set observed.
|
|
|
|
|
|
|
| 108 |
|
| 109 |
+
| COCO<br>(test) | Flickr30K<br>(test) | NoCaps<br>(val) | VQAv2<br>(testdev) | OKVQA<br>(val) | TextVQA<br>(val) | VizWiz<br>(val/test) | AI2D<br>(test) | GQA<br>(test) | ScienceQA<br>(image) |
|
| 110 |
+
| -------------- | ------------------- | --------------- | ------------------ | -------------- | ---------------- | -------------------- | -------------- | ------------- | -------------------- |
|
| 111 |
+
| 142.2\* | 85.3 | 120.8 | 80.9\* | 64.1\* | 65.9 | 59.0 / 57.3 | 70.3\* | 62.5\* | 90.1\* |
|
| 112 |
|
| 113 |
+
**Visual Grounding**
|
|
|
|
|
|
|
| 114 |
|
| 115 |
+
| RefCOCO<br>(val) | RefCOCO<br>(testA) | RefCOCO<br>(testB) | RefCOCO+<br>(val) | RefCOCO+<br>(testA) | RefCOCO+<br>(testB) | RefCOCO−g<br>(val) | RefCOCO−g<br>(test) |
|
| 116 |
+
| ---------------- | ------------------ | ------------------ | ----------------- | ------------------- | ------------------- | ------------------ | ------------------- |
|
| 117 |
+
| | | | | | | | |
|
| 118 |
|
|
|
|
|
|
|
|
|
|
| 119 |
|
| 120 |
|
| 121 |
## Citation
|