Claude Opus 4.6

MERA Created at 26.02.2026 08:21
0.472
The overall result

Ratings for leaderboard tasks

The table will scroll to the left

Task name Result Metric
YABLoCo 0.13 / 0.149
EM pass@k
stRuCom 0.318
chrF
RealCode 0.501 / 0.943
pass@k execution_success
UnitTests 0.367
CodeBLEU
ruCodeEval 0.245 / 0.29 / 0.299
pass@k
JavaTestGen 0.405 / 0.639
pass@k compile@1
ruHumanEval 0.462 / 0.507 / 0.518
pass@k
RealCodeJava 0.433 / 0.966
pass@k execution_success
CodeLinterEval 0.973 / 0.973 / 0.973
pass@k
ruCodeReviewer 0.046 / 0.183 / 0.176 / 0.38 / 0.467
chrF BLEU judge@1 judge@5 judge@10
CodeCorrectness 0.912
EM

Information about the submission

Mera version
v1.0.0
Torch Version
2.9.0
The version of the codebase
7030e16
CUDA version
12.8
Precision of the model weights
auto
Seed
1234
Batch
1
Transformers version
4.57.1
The number of GPUs and their type
1 x NVIDIA A100-SXM4-80GB
Architecture
openai-chat-completions

Team:

MERA

Name of the ML model:

Claude Opus 4.6

Model type:

Closed

API

SFT

Additional links:

Model was assessed with 1k tokens for reasoning.

Inference parameters

Generation Parameters:
codecorrectness - until=[" \n \n"];do_sample=false;temperature=0; \ncodelintereval - do_sample=true;temperature=0.6;max_gen_toks=1024;until=[" \n \n"]; \njavatestgen - do_sample=true;max_gen_toks=4096;temperature=0.2;top_p=0.9;until=["<|endoftext|>","<|im_end|>"]; \nrealcode - do_sample=true;max_gen_toks=4096;temperature=0.7;repetition_penalty=1.05;top_p=0.8;until=["<|endoftext|>","<|im_end|>"]; \nrealcodejava - do_sample=true;max_gen_toks=4096;temperature=0.7;repetition_penalty=1.05;top_p=0.8;until=["<|endoftext|>","<|im_end|>"]; \nrucodeeval_code - do_sample=true;temperature=0.6;max_gen_toks=1024;until=[" \nclass"," \ndef"," \n#"," \nif"," \nprint"]; \nrucodereviewer - temperature=0;do_sample=false;max_gen_toks=1000;until=[" \n \n"]; \nruhumaneval_code - do_sample=true;temperature=0.6;max_gen_toks=1024;until=[" \nclass"," \ndef"," \n#"," \nif"," \nprint"]; \nstrucom - do_sample=false;max_gen_toks=512;until=[" \n \n"]; \nunittests - do_sample=false;max_gen_toks=1024;until=[" \n \n"]; \nyabloco_oracle - max_gen_toks=2048;do_sample=false;until=["<|endoftext|>","<|im_end|>"," \n \n \n","\sclass\s","\sdef\s","^def\s","^class\s","^if\s","@","^#"];