The table will scroll to the left
| Task name | Result | Metric |
|---|---|---|
| YABLoCo | 0.043 / 0.01 |
EM
pass@k
|
| stRuCom | 0.16 |
chrF
|
| RealCode | 0.004 / 0.955 |
pass@k
execution_success
|
| UnitTests | 0.088 |
CodeBLEU
|
| ruCodeEval | 0.006 / 0.026 / 0.043 |
pass@k
|
| JavaTestGen | 0.044 / 0.273 |
pass@k
compile@1
|
| ruHumanEval | 0.007 / 0.024 / 0.037 |
pass@k
|
| RealCodeJava | 0.087 / 0.973 |
pass@k
execution_success
|
| CodeLinterEval | 0.403 / 0.566 / 0.6 |
pass@k
|
| ruCodeReviewer | 0.014 / 0.123 / 0 / 0 / 0 |
chrF
BLEU
judge@1
judge@5
judge@10
|
| CodeCorrectness | 0.837 |
EM
|