Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 3 days ago • 21
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 16 days ago • 204
Training Large Language Models to Predict Clinical Events Paper • 2605.12817 • Published 24 days ago • 17
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 23 days ago • 270
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 29 days ago • 233
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation Paper • 2604.28196 • Published Apr 30 • 72