Running on CPU Upgrade Featured 2.93k The Smol Training Playbook π 2.93k The secrets to building world-class LLMs
google/embeddinggemma-300m Sentence Similarity β’ 0.3B β’ Updated Sep 25, 2025 β’ 741k β’ β’ 1.43k
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code +2 May 23, 2025 β’ 171
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths β’ 3 items β’ Updated 27 days ago β’ 127
Running on Zero Featured 2.02k Chat With Janus-Pro-7B π 2.02k A unified multimodal understanding and generation model.
Attention Heads of Large Language Models: A Survey Paper β’ 2409.03752 β’ Published Sep 5, 2024 β’ 92
Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models Paper β’ 2407.12327 β’ Published Jul 17, 2024 β’ 79