MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated Apr 15 β’ 3.62k β’ 9
MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated Apr 15 β’ 3.62k β’ 9
MJPansa/MiniMax-M2.7-REAP-172B-A10B-AutoRound-W4A16 Text Generation β’ 24B β’ Updated Apr 15 β’ 3.62k β’ 9
Running 3.85k The Ultra-Scale Playbook π 3.85k The ultimate guide to training LLM on large GPU Clusters
openGPT-X/Teuken-7B-instruct-commercial-v0.4 Text Generation β’ 7B β’ Updated Dec 11, 2024 β’ 1.61k β’ 74
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 164