nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-FP8 Any-to-Any • 33B • Updated 1 day ago • 101k • 44
deepseek-ai/DeepSeek-V4-Flash Text Generation • 158B • Updated about 14 hours ago • 669k • • 964
Running 3.83k The Ultra-Scale Playbook 🌌 3.83k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-V4-Pro Text Generation • 862B • Updated about 14 hours ago • 787k • • 3.64k