Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
HCAI-Lab 's Collections
Other projects under HCAI-Lab
Archive (pre-6T and legacy)
OLMES Evaluations
TrackStar — Scores + Analysis
TrackStar — Indices + Training Shards
Dolma3 — Query Data
Dolma3 — Working Samples + Preconditioner
Dolma3 — Source Corpus + Manifest

OLMES Evaluations

updated about 9 hours ago

OLMES benchmark evaluation results across OLMo-3-7B and SmolLM-3-3B model variants.

Upvote
-

  • HCAI-Lab/olmes-eval-olmo3-7b-base

    Updated about 9 hours ago • 27

    Note OLMo-3-7B base.


  • HCAI-Lab/olmes-eval-olmo3-7b-instruct-base

    Viewer • Updated about 9 hours ago • 30.8k • 196

    Note OLMo-3-7B instruct-base.


  • HCAI-Lab/olmes-eval-olmo3-7b-instruct-cot

    Viewer • Updated about 9 hours ago • 21.6k • 256

    Note OLMo-3-7B instruct + chain-of-thought.


  • HCAI-Lab/olmes-eval-olmo3-7b-thinking

    Viewer • Updated about 9 hours ago • 17.4k • 29

    Note OLMo-3-7B thinking.


  • HCAI-Lab/olmes-eval-smollm3-3b-base

    Viewer • Updated about 9 hours ago • 17.4k • 8

    Note SmolLM3-3B base.

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs