RU EN

Tasks

The benchmark comprises 23 tasks across various domains, assessing different skills of the model.

Type of task

All tasks

Select domains

All domains

Expand the list of domains Collapse the list of domains

Filters

The name of the dataset

The ability of the model

Top Score | Human Baseline

Metric