AtomicChat/gemma-4-26B-A4B-it-assistant-GGUF Text Generation • 0.4B • Updated 4 days ago • 7.17k • 12
view article Article You could have designed state of the art positional encoding FL33TW00D-HF • Nov 25, 2024 • 477
Running on CPU Upgrade 233 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 233 Explore synthetic data experiments on a virtual bookshelf
Running on CPU Upgrade Featured 3.16k The Smol Training Playbook 📚 3.16k The secrets to building world-class LLMs