Training Data Efficiency in Multimodal Process Reward Models Paper • 2602.04145 • Published 1 day ago • 69
VisPlay: Self-Evolving Vision-Language Models from Images Paper • 2511.15661 • Published Nov 19, 2025 • 43
Self-Rewarding Vision-Language Model via Reasoning Decomposition Paper • 2508.19652 • Published Aug 27, 2025 • 84