--- license: cc-by-4.0 datasets: - prem-research/birdbench language: - en metrics: - exact_match - accuracy base_model: - Qwen/Qwen3-4B-Instruct-2507 pipeline_tag: text-generation --- Our Baseline Model