The table will scroll to the left
| Task name | Result | Metric |
|---|---|---|
| YABLoCo | 0.077 / 0.029 |
EM
pass@k
|
| stRuCom | 0.232 |
chrF
|
| RealCode | 0.188 / 0.986 |
pass@k
execution_success
|
| UnitTests | 0.153 |
CodeBLEU
|
| ruCodeEval | 0.448 / 0.552 / 0.579 |
pass@k
|
| JavaTestGen | 0.233 / 0.507 |
pass@k
compile@1
|
| ruHumanEval | 0.448 / 0.554 / 0.579 |
pass@k
|
| RealCodeJava | 0.342 / 0.987 |
pass@k
execution_success
|
| CodeLinterEval | 0.423 / 0.548 / 0.573 |
pass@k
|
| ruCodeReviewer | 0.022 / 0.132 / 0.074 / 0.129 / 0.144 |
chrF
BLEU
judge@1
judge@5
judge@10
|
| CodeCorrectness | 0.571 |
EM
|