Building a Precise Video Language with Human-AI Oversight Paper • 2604.21718 • Published 13 days ago • 15
Sparse Attention Vectors: Generative Multimodal Model Features Are Discriminative Vision-Language Classifiers Paper • 2412.00142 • Published Nov 28, 2024 • 5
Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration Paper • 2503.00737 • Published Mar 2, 2025
CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement Paper • 2501.06441 • Published Jan 11, 2025