NAHYUN LEE's picture

NAHYUN LEE

2nhyn

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 9 hours ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

upvoted a paper 29 days ago

KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context

updated a dataset about 1 month ago

HAERAE-HUB/KMMMU

View all activity

Organizations

upvoted a paper about 9 hours ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published 19 days ago • 79

upvoted a paper 29 days ago

KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context

Paper • 2604.13058 • Published Mar 18 • 2

upvoted a paper 4 months ago

Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

Paper • 2602.06291 • Published Feb 6 • 24