Do You See What I Am Pointing At? Gesture-Based Egocentric Video Question Answering Paper • 2603.12533 • Published 4 days ago
Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback Paper • 2402.03746 • Published Feb 6, 2024