Claude Opus 4.6

MERA Created at 26.02.2026 08:21

0.472

The overall result

Ratings for leaderboard tasks

The table will scroll to the left

Task name	Result	Metric
YABLoCo	0.13 / 0.149	EM pass@k
stRuCom	0.318	chrF
RealCode	0.501 / 0.943	pass@k execution_success
UnitTests	0.367	CodeBLEU
ruCodeEval	0.245 / 0.29 / 0.299	pass@k
JavaTestGen	0.405 / 0.639	pass@k compile@1
ruHumanEval	0.462 / 0.507 / 0.518	pass@k
RealCodeJava	0.433 / 0.966	pass@k execution_success
CodeLinterEval	0.973 / 0.973 / 0.973	pass@k
ruCodeReviewer	0.046 / 0.183 / 0.176 / 0.38 / 0.467	chrF BLEU judge@1 judge@5 judge@10
CodeCorrectness	0.912	EM

Information about the submission

Mera version

v1.0.0

Torch Version

2.9.0

The version of the codebase

7030e16

CUDA version

12.8

Precision of the model weights

auto

Seed

1234

Batch

Transformers version

4.57.1

The number of GPUs and their type

1 x NVIDIA A100-SXM4-80GB

Architecture

openai-chat-completions

Team:

MERA

Name of the ML model:

Claude Opus 4.6

Link to the ML model:

https://www.anthropic.com/news/claude-opus-4-6

Model type:

Closed

API

SFT

Additional links:

Model was assessed with 1k tokens for reasoning.

Inference parameters

Generation Parameters:
codecorrectness - until=[" \n \n"];do_sample=false;temperature=0; \ncodelintereval - do_sample=true;temperature=0.6;max_gen_toks=1024;until=[" \n \n"]; \njavatestgen - do_sample=true;max_gen_toks=4096;temperature=0.2;top_p=0.9;until=["<|endoftext|>","<|im_end|>"]; \nrealcode - do_sample=true;max_gen_toks=4096;temperature=0.7;repetition_penalty=1.05;top_p=0.8;until=["<|endoftext|>","<|im_end|>"]; \nrealcodejava - do_sample=true;max_gen_toks=4096;temperature=0.7;repetition_penalty=1.05;top_p=0.8;until=["<|endoftext|>","<|im_end|>"]; \nrucodeeval_code - do_sample=true;temperature=0.6;max_gen_toks=1024;until=[" \nclass"," \ndef"," \n#"," \nif"," \nprint"]; \nrucodereviewer - temperature=0;do_sample=false;max_gen_toks=1000;until=[" \n \n"]; \nruhumaneval_code - do_sample=true;temperature=0.6;max_gen_toks=1024;until=[" \nclass"," \ndef"," \n#"," \nif"," \nprint"]; \nstrucom - do_sample=false;max_gen_toks=512;until=[" \n \n"]; \nunittests - do_sample=false;max_gen_toks=1024;until=[" \n \n"]; \nyabloco_oracle - max_gen_toks=2048;do_sample=false;until=["<|endoftext|>","<|im_end|>"," \n \n \n","\sclass\s","\sdef\s","^def\s","^class\s","^if\s","@","^#"];

Claude Opus 4.6

Ratings for leaderboard tasks

Information about the submission

Team:

Name of the ML model:

Link to the ML model:

Model type:

Additional links:

Inference parameters

Confirm the deletion of the sub