EvalEval Coalition

community

https://evalevalai.com/

evaluatingevals

Activity Feed Request to join this org

AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

j-chim updated a dataset about 5 hours ago

evaleval/entity-registry-data

evijit updated a dataset about 9 hours ago

evaleval/card_backend

evijit updated a bucket about 14 hours ago

evaleval/general-eval-card-storage

View all activity

Papers

Every Eval Ever: A Unifying Schema and Community Repository for AI Evaluation Results

Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting

View all Papers

Articles

Introducing Evaluation Cards: A Live Interpretive Layer for Understanding the AI Evaluations Ecosystem

AI evals are becoming the new compute bottleneck

evaleval 's datasets 9

evaleval/entity-registry-data

Viewer • Updated about 5 hours ago • 229k • 851 • 1

evaleval/card_backend

Preview • Updated about 9 hours ago • 9.28k • 1

evaleval/auto-benchmarkcards

Viewer • Updated 13 days ago • 516 • 666 • 4

evaleval/EEE_datastore

Viewer • Updated 27 days ago • 4.89k • 9.48k • 37

evaleval/alphaxiv

Viewer • Updated about 1 month ago • 15 • 3.8k

evaleval/HELM_datastore

Updated Jun 18 • 71

evaleval/EEE_datastore-flat-temp

Updated Jun 10 • 22

evaleval/alphaxiv_datastore

Updated Feb 20 • 27 • 1

evaleval/social_impact_eval_annotations

Viewer • Updated Nov 28, 2025 • 4.24k • 34 • 4