unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated 2 days ago • 20.7k • 12
huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated Image-Text-to-Text • 36B • Updated 11 days ago • 30.4k • 217
view reply Amazing work you guys. I can run this local and hit your API for excess load and get consistent output in both places.
How to Alleviate Catastrophic Forgetting in LLMs Finetuning? Hierarchical Layer-Wise and Element-Wise Regularization Paper • 2501.13669 • Published Jan 23, 2025 • 1