VLM-SubtleBench: How Far Are VLMs from Human-Level Subtle Comparative Reasoning? Paper • 2603.07888 • Published 2 days ago • 7