EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning Paper • 2606.03108 • Published 10 days ago • 7
Running on Zero Agents Featured 1.94k Qwen3-TTS Demo 🎙 1.94k Generate speech from text using voice design, cloning or presets
view article Article SmolLM - blazingly fast and remarkably powerful +1 loubnabnl, anton-l, eliebak • Jul 16, 2024 • 460
high-quality Chinese training datasets Collection a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated May 22, 2025 • 24
Running on CPU Upgrade Featured 3.2k The Smol Training Playbook 📚 3.2k The secrets to building world-class LLMs
view article Article mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL driaforall • Sep 11, 2025 • 26
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 309
Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings Paper • 2509.10534 • Published Sep 5, 2025 • 4 • 1
Decoupling the "What" and "Where" With Polar Coordinate Positional Embeddings Paper • 2509.10534 • Published Sep 5, 2025 • 4