Tasks

Evaluating specialized large language models in the sectors: agriculture and medicine

The name of the dataset
Domain
Metric
Agricultural industry
F1 Exact Match
Agricultural industry
F1 Exact Match
Medicine and healthcare
F1 Exact Match