From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 5 days ago • 68