Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

EvalEval Coalition

Team
community
https://evalevalai.com/
evaluatingevals
evaleval
Activity Feed Request to join this org

AI & ML interests

We’re building a research coalition on evaluating evaluations (EvalEval)! Hosted by Hugging Face, University of Edinburgh, and EleutherAI.

Recent Activity

evijit  updated a dataset about 3 hours ago
evaleval/card_backend
evijit  authored a paper about 4 hours ago
Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting
j-chim  published an article about 18 hours ago
Introducing Evaluation Cards: A Live Interpretive Layer for Understanding the AI Evaluations Ecosystem
View all activity

Papers

Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting

View all Papers

Articles

Introducing Evaluation Cards: A Live Interpretive Layer for Understanding the AI Evaluations Ecosystem

about 18 hours ago
• 1

AI evals are becoming the new compute bottleneck

Apr 29
• 29

Yacine Jernite's profile pictureIrene Solaiman's profile pictureCanyu Chen's profile pictureFelix Friedrich's profile pictureAlina Leidinger's profile pictureMargaret Mitchell's profile pictureJennifer Mickel's profile pictureUsman Gohar's profile pictureLevent Sagun's profile pictureShubham Singh's profile pictureAvijit Ghosh's profile pictureLeshem Choshen's profile pictureAurélien-Morgan CLAUDON's profile pictureAmita Shukla's profile picturePrajna Soni's profile pictureAnshuman Suri's profile pictureJoseph [open/acc] Pollack's profile pictureMowafak Allaham's profile picturewave's profile pictureAli El Filali's profile pictureAndrew Tran's profile pictureMonojit's profile pictureKevin Wei's profile pictureJan Batzner's profile pictureJenny Chim's profile pictureMubashara Akhtar's profile pictureSree Harsha Nelaturu's profile pictureHossein A. (Saeed) Rahmani's profile pictureAbdul Muhsin Hameed's profile pictureSrishti's profile pictureJoshua Noble's profile pictureEvalEval Bot's profile pictureDamian Stachura's profile pictureŠimon Podhajský's profile pictureAnastassia Kornilova's profile pictureInge V's profile pictureAris's profile pictureSriram Mohan's profile pictureTommaso Cerruti's profile pictureImamaShehzad's profile pictureMarek Suppa's profile pictureYifan Mai's profile pictureGeorgia Channing's profile pictureAsaf Yehudai's profile pictureHarsh's profile pictureAnka Reuel's profile pictureSteven Dillmann's profile pictureYiyang Nan's profile picture

evaleval 's datasets 8

evaleval/card_backend

Preview • Updated 30 minutes ago • 18.9k • 1

evaleval/EEE_datastore

Updated 1 day ago • 46.9k • 28

evaleval/entity-registry-data

Updated 1 day ago • 363

evaleval/EEE_datastore-flat-temp

Updated 2 days ago • 7.59k

evaleval/auto-benchmarkcards

Preview • Updated May 8 • 715 • 3

evaleval/alphaxiv_datastore

Updated Feb 20 • 18 • 1

evaleval/social_impact_eval_annotations

Viewer • Updated Nov 28, 2025 • 4.24k • 30 • 4

evaleval/old_eee

Preview • Updated Sep 13, 2025 • 31
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs