WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models Paper • 2602.02537 • Published 13 days ago • 5
Generative Frame Sampler for Long Video Understanding Paper • 2503.09146 • Published Mar 12, 2025 • 1