qjdcool
qjdcool
AI & ML interests
None yet
Recent Activity
upvoted a paper about 13 hours ago
See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding upvoted a paper 11 months ago
Depth Anything at Any Condition upvoted a paper 11 months ago
LLaVA-Scissor: Token Compression with Semantic Connected Components for
Video LLMsOrganizations
None yet