Do Thought Streams Matter? Evaluating Reasoning in Gemini Vision-Language Models for Video Scene Understanding Paper • 2604.11177 • Published 5 days ago • 7
Do Thought Streams Matter? Evaluating Reasoning in Gemini Vision-Language Models for Video Scene Understanding Paper • 2604.11177 • Published 5 days ago • 7
Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments Paper • 2502.06445 • Published Feb 10, 2025 • 1