Qwen2-VL-2B-Instruct

MERA Created at 22.01.2026 05:08

Ratings for leaderboard tasks

The table will scroll to the left

Board	Result	Attempted Score	Coverage	Place in the rating
Multi	0.116	0.174	0.667	28
Images	0.165	0.165	1	33
Video	0.205	0.205	1	31

Tasks

The table will scroll to the left

Task	Result	Metric
WEIRD	0.2	EM JudgeScore
RealVQA	0.168	EM JudgeScore
ruCLEVR	0.22	EM JudgeScore
LabTabVQA	0.047	EM JudgeScore
ruMathVQA	0.045	EM JudgeScore
RealVideoQA	0.271	EM JudgeScore
ruCommonVQA	0.278	EM JudgeScore
ruHHH-Image	0.058	EM JudgeScore
ruHHH-Video	0.125	EM JudgeScore
ruTiE-Image	0.298	EM JudgeScore
CommonVideoQA	0.219	EM JudgeScore
UniScienceVQA	0.112	EM JudgeScore
culture	0.081 / 0.13
business	0.101 / 0.17
medicine	0.063 / 0.131
social_sciences	0.136 / 0.217
fundamental_sciences	0.061 / 0.12
applied_sciences	0.107 / 0.177
SchoolScienceVQA	0.207	EM JudgeScore
biology	0.043 / 0.425
chemistry	0.03 / 0.335
physics	0.029 / 0.414
economics	0.038 / 0.321
ru	0.039 / 0.323
all	0.02 / 0.428
ruNaturalScienceVQA	0.183	EM JudgeScore
biology	0 / 0.158
chemistry	0.075 / 0.209
physics	0.045 / 0.389
science	0.049 / 0.415

Information about the submission

Mera version

v1.0.0

Torch Version

2.8.0

The version of the codebase

7e640aa

CUDA version

12.8

Precision of the model weights

bfloat16

Seed

1234

Batch

Transformers version

4.57.1

The number of GPUs and their type

1 x NVIDIA A100-SXM4-80GB

Architecture

openai-chat-completions

Team:

MERA

Name of the ML model:

Qwen2-VL-2B-Instruct

Link to the ML model:

https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct

Model size

2.0B

Model type:

Opened

SFT

Inference parameters

Generation Parameters:
labtabvqa - until=["\n\n"];do_sample=false;temperature=0; \nrunaturalsciencevqa_biology - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=64; \nrunaturalsciencevqa_chemistry - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=64; \nrunaturalsciencevqa_earth_science - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=64; \nrunaturalsciencevqa_physics - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=64; \nruclevr - until=["\n\n"];do_sample=false;temperature=0; \nruhhh_image - until=["\n\n"];do_sample=false;temperature=0; \nrucommonvqa - until=["\n\n"];do_sample=false;temperature=0; \nrealvqa - until=["\n\n"];do_sample=false;temperature=0; \nrumathvqa - until=["\n\n"];do_sample=false;temperature=0; \nweird - until=["\n\n"];do_sample=false;temperature=0; \nunisciencevqa_applied_sciences - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nunisciencevqa_business - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nunisciencevqa_cultural_studies - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nunisciencevqa_fundamental_sciences - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nunisciencevqa_health_and_medicine - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nunisciencevqa_social_sciences - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nschoolsciencevqa_biology - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nschoolsciencevqa_chemistry - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nschoolsciencevqa_earth_science - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nschoolsciencevqa_economics - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nschoolsciencevqa_history_all - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nschoolsciencevqa_history_ru - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nschoolsciencevqa_physics - until=["<|endoftext|>"];temperature=0;do_sample=false;max_gen_toks=256; \nrealvideoqa - until=["\n\n"];do_sample=false;temperature=0; \nruhhh_video - until=["\n\n"];do_sample=false;temperature=0; \ncommonvideoqa - until=["\n\n"];do_sample=false;temperature=0;

The size of the context:
32768

Qwen2-VL-2B-Instruct

Ratings for leaderboard tasks

Tasks

Information about the submission

Team:

Name of the ML model:

Link to the ML model:

Model size

Model type:

Inference parameters

Confirm the deletion of the sub