The table will scroll to the left
| Board | Result | Attempted Score | Coverage | Place in the rating |
|---|---|---|---|---|
| Multi | 0.037 | 0.122 | 0.303 | 65 |
| Images | 0.111 | 0.122 | 0.909 | 43 |
The table will scroll to the left
| Task | Modality | Result | Metric |
|---|---|---|---|
| WEIRD | 0.269 |
EM
JudgeScore
|
|
| RealVQA | 0.132 |
EM
JudgeScore
|
|
| ruCLEVR | 0.156 |
EM
JudgeScore
|
|
| LabTabVQA | 0.05 |
EM
JudgeScore
|
|
| ruMathVQA | 0.003 |
EM
JudgeScore
|
|
| ruCommonVQA | 0.223 |
EM
JudgeScore
|
|
| ruHHH-Image | 0.076 |
EM
JudgeScore
|
|
| UniScienceVQA | 0.044 |
EM
JudgeScore
|
|
| culture | 0 / 0.073 | ||
| business | 0.002 / 0.11 | ||
| medicine | 0.003 / 0.086 | ||
| social_sciences | 0.007 / 0.136 | ||
| fundamental_sciences | 0 / 0.066 | ||
| applied_sciences | 0 / 0.102 | ||
| SchoolScienceVQA | 0.098 |
EM
JudgeScore
|
|
| biology | 0.003 / 0.239 | ||
| chemistry | 0.002 / 0.175 | ||
| physics | 0.003 / 0.251 | ||
| economics | 0.01 / 0.195 | ||
| ru | 0.008 / 0.144 | ||
| all | 0 / 0.156 | ||
| ruNaturalScienceVQA | 0.168 |
EM
JudgeScore
|
|
| biology | 0 / 0.228 | ||
| chemistry | 0 / 0.299 | ||
| physics | 0.005 / 0.359 | ||
| science | 0 / 0.415 | ||