The table will scroll to the left
| Board | Result | Attempted Score | Coverage | Place in the rating |
|---|---|---|---|---|
| Multi | 0.18 | 0.269 | 0.667 | 19 |
| Images | 0.186 | 0.186 | 1 | 27 |
| Video | 0.575 | 0.575 | 1 | 11 |
The table will scroll to the left
| Task | Modality | Result | Metric |
|---|---|---|---|
| WEIRD | 0.123 |
EM
JudgeScore
|
|
| RealVQA | 0.283 |
EM
JudgeScore
|
|
| ruCLEVR | 0.234 |
EM
JudgeScore
|
|
| LabTabVQA | 0.037 |
EM
JudgeScore
|
|
| ruMathVQA | 0.07 |
EM
JudgeScore
|
|
| RealVideoQA | 0.647 |
EM
JudgeScore
|
|
| ruCommonVQA | 0.41 |
EM
JudgeScore
|
|
| ruHHH-Image | 0.101 |
EM
JudgeScore
|
|
| ruHHH-Video | 0.512 |
EM
JudgeScore
|
|
| ruTiE-Image | 0.401 |
EM
JudgeScore
|
|
| CommonVideoQA | 0.567 |
EM
JudgeScore
|
|
| UniScienceVQA | 0.16 |
EM
JudgeScore
|
|
| culture | 0.077 / 0.162 | ||
| business | 0.128 / 0.281 | ||
| medicine | 0.096 / 0.212 | ||
| social_sciences | 0.134 / 0.302 | ||
| fundamental_sciences | 0.101 / 0.179 | ||
| applied_sciences | 0.136 / 0.273 | ||
| SchoolScienceVQA | 0.172 |
EM
JudgeScore
|
|
| biology | 0.156 / 0.216 | ||
| chemistry | 0.15 / 0.192 | ||
| physics | 0.227 / 0.312 | ||
| economics | 0.162 / 0.2 | ||
| ru | 0.099 / 0.134 | ||
| all | 0.101 / 0.175 | ||
| ruNaturalScienceVQA | 0.055 |
EM
JudgeScore
|
|
| biology | 0 / 0.053 | ||
| chemistry | 0.104 / 0.179 | ||
| physics | 0.02 / 0.071 | ||
| science | 0 / 0 | ||