Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 14 days ago • 87
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes Paper • 2603.25562 • Published Mar 26 • 14
view article Article Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️ Jan 4, 2025 • 9
UltraData Collection Ultra Scale, Ultra Quality, Ultra Coverage • 10 items • Updated 10 days ago • 81
Data Science and Technology Towards AGI Part I: Tiered Data Management Paper • 2602.09003 • Published Feb 9 • 7