Medmarks: A Comprehensive Open-Source LLM Benchmark Suite for Medical Tasks Paper • 2605.01417 • Published May 2 • 1
Compress-Distill: Reasoning Trace Compression for Efficient Knowledge Distillation Paper • 2606.05988 • Published 6 days ago • 2
Compress-Distill: Reasoning Trace Compression for Efficient Knowledge Distillation Paper • 2606.05988 • Published 6 days ago • 2
Compress-Distill: Reasoning Trace Compression for Efficient Knowledge Distillation Paper • 2606.05988 • Published 6 days ago • 2
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models nvidia • Jul 18, 2025 • 51
deepseek-ai/DeepSeek-R1-0528 Text Generation • 685B • Updated May 29, 2025 • 6.41M • • 2.45k
nvidia/Llama-Nemotron-Post-Training-Dataset Viewer • Updated May 8, 2025 • 3.91M • 4.96k • 675
Running 3.88k The Ultra-Scale Playbook 🌌 3.88k The ultimate guide to training LLM on large GPU Clusters
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models +1 loubnabnl, anton-l, davanstrien • Mar 20, 2024 • 114
OpenLLM-France/Lucie-7B-Instruct-human-data Text Generation • 7B • Updated Mar 19, 2025 • 636 • 7