Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection Paper • 2207.14192 • Published Oct 4, 2022
RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space Paper • 2606.14700 • Published 8 days ago • 14
CityRAG: Stepping Into a City via Spatially-Grounded Video Generation Paper • 2604.19741 • Published Apr 21 • 17
Tinted Frames: Question Framing Blinds Vision-Language Models Paper • 2603.19203 • Published Mar 19 • 17
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published Mar 3 • 63 • 7
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published Mar 3 • 63
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published Mar 3 • 63
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published Mar 3 • 63