The table will scroll to the left
| Task name | Result | Metric |
|---|---|---|
| YABLoCo | 0.014 / 0.005 |
EM
pass@k
|
| stRuCom | 0.115 |
chrF
|
| RealCode | 0.091 / 0.956 |
pass@k
execution_success
|
| UnitTests | 0.175 |
CodeBLEU
|
| ruCodeEval | 0.291 / 0.397 / 0.439 |
pass@k
|
| JavaTestGen | 0.057 / 0.33 |
pass@k
compile@1
|
| ruHumanEval | 0.272 / 0.395 / 0.433 |
pass@k
|
| RealCodeJava | 0.188 / 0.977 |
pass@k
execution_success
|
| CodeLinterEval | 0.482 / 0.491 / 0.5 |
pass@k
|
| ruCodeReviewer | 0.013 / 0.112 / 0.023 / 0.023 / 0.023 |
chrF
BLEU
judge@1
judge@5
judge@10
|
| CodeCorrectness | 0.794 |
EM
|