Qwen/Qwen3.5-397B-A17B-FP8 Image-Text-to-Text • 403B • Updated about 19 hours ago • 4.31k • 29
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models Jul 18, 2025 • 50
nvidia/Llama-Nemotron-Post-Training-Dataset Viewer • Updated May 8, 2025 • 3.91M • 2.19k • 643
Running 3.7k The Ultra-Scale Playbook 🌌 3.7k The ultimate guide to training LLM on large GPU Clusters
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models +1 Mar 20, 2024 • 109
OpenLLM-France/Lucie-7B-Instruct-human-data Text Generation • 7B • Updated Mar 19, 2025 • 240 • 7
DolphinLabeled Datasets Collection Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated Jan 6, 2025 • 15