trl-internal-testing/tiny-DeepseekV3ForCausalLM Text Generation • 3.81M • Updated 2 days ago • 272k • 3
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF Text Generation • 480B • Updated Jul 31, 2025 • 2.6k • 178