The table will scroll to the left
| Task name | Result | Metric |
|---|---|---|
| YABLoCo | 0.038 / 0.005 |
EM
pass@k
|
| stRuCom | 0.21 |
chrF
|
| RealCode | 0.004 / 0.951 |
pass@k
execution_success
|
| UnitTests | 0.249 |
CodeBLEU
|
| ruCodeEval | 0.502 / 0.609 / 0.652 |
pass@k
|
| JavaTestGen | 0.066 / 0.198 |
pass@k
compile@1
|
| ruHumanEval | 0.666 / 0.764 / 0.78 |
pass@k
|
| RealCodeJava | 0.178 / 0.966 |
pass@k
execution_success
|
| CodeLinterEval | 0.422 / 0.575 / 0.636 |
pass@k
|
| ruCodeReviewer | 0.037 / 0.188 / 0.01 / 0.052 / 0.065 |
chrF
BLEU
judge@1
judge@5
judge@10
|
| CodeCorrectness | 0.741 |
EM
|