Qwen2.5-72B-Instruct

MERA Создан 16.07.2025 17:21
0.288
Общий результат

Оценки по задачам лидерборда

Таблица скроллится влево

Задача Результат Метрика
YABLoCo 0.144 / 0.024
EM pass@k
stRuCom 0.252
chrF
RealCode 0.362 / 0.988
pass@k execution_success
UnitTests 0.16
CodeBLEU
ruCodeEval 0.174 / 0.177 / 0.177
pass@k
JavaTestGen 0.189 / 0.476
pass@k compile@1
ruHumanEval 0.157 / 0.163 / 0.165
pass@k
RealCodeJava 0.349 / 1
pass@k execution_success
CodeLinterEval 0.481 / 0.497 / 0.5
pass@k
ruCodeReviewer 0.023 / 0.158 / 0.048 / 0.104 / 0.136
chrF BLEU judge@1 judge@5 judge@10
CodeCorrectness 0.702
EM

Информация о сабмите

Версия MERA
v.1.0.0
Версия Torch
2.7.0
Версия кодовой базы
e77f33f
Версия CUDA
12.6
Precision весов модели
bfloat16
Сид
1234
Батч
1
Версия transformers
4.53.0
Количество GPU и их тип
8 x NVIDIA A100-SXM4-80GB
Архитектура
local-chat-completions

Команда:

MERA

Название ML-модели:

Qwen2.5-72B-Instruct

Ссылка на ML-модель:

https://huggingface.co/Qwen/Qwen2.5-72B-Instruct

Размер модели

72.7B

Тип модели:

Открытая

SFT

Описание архитектуры:

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qwen2.5 brings the following improvements upon Qwen2: Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models in these domains. Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g, tables), and generating structured outputs especially JSON. More resilient to the diversity of system prompts, enhancing role-play implementation and condition-setting for chatbots. Long-context Support up to 128K tokens and can generate up to 8K tokens. Multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more.

Лицензия:

QWEN https://huggingface.co/Qwen/Qwen2.5-72B-Instruct/blob/main/LICENSE