qjdcool's picture

3

qjdcool

qjdcool

qjdcool

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago

See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

upvoted a paper 11 months ago

Depth Anything at Any Condition

upvoted a paper 11 months ago

LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

View all activity

Organizations

None yet

upvoted a paper about 18 hours ago

See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

Paper • 2605.18018 • Published 2 days ago • 17

upvoted 2 papers 11 months ago

Depth Anything at Any Condition

Paper • 2507.01634 • Published Jul 2, 2025 • 49

LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

Paper • 2506.21862 • Published Jun 27, 2025 • 36