Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training Paper • 2602.07824 • Published 15 days ago • 15
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published 10 days ago • 27
view article Article Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs 27 days ago • 22
Running 37 Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale 📝 37 Generate text using extremely small yet powerful language models
tiiuae/Falcon-H1-Tiny-90M-Instruct-Curriculum-pre-DPO Text Generation • 91.1M • Updated Jan 15 • 14 • 1