Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published 12 days ago • 76
HiMu: Hierarchical Multimodal Frame Selection for Long Video Question Answering Paper • 2603.18558 • Published 7 days ago • 10
Step-Wise Refusal Dynamics in Autoregressive and Diffusion Language Models Paper • 2602.02600 • Published Feb 1 • 13
Step-Wise Refusal Dynamics in Autoregressive and Diffusion Language Models Paper • 2602.02600 • Published Feb 1 • 13
Step-Wise Refusal Dynamics in Autoregressive and Diffusion Language Models Paper • 2602.02600 • Published Feb 1 • 13 • 1
HERBench: A Benchmark for Multi-Evidence Integration in Video Question Answering Paper • 2512.14870 • Published Dec 16, 2025 • 15
Video Generation Models Are Good Latent Reward Models Paper • 2511.21541 • Published Nov 26, 2025 • 47