Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Harris Zhang's picture
1 3 2

Harris Zhang

HanSolo9682
jizhongpeng's profile picture mucai's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs
authored a paper about 1 month ago
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding
authored a paper about 1 month ago
Reasoning-Augmented Representations for Multimodal Retrieval
View all activity

Organizations

University of Wisconsin - Madison's profile picture vgbench's profile picture CounterCurate's profile picture LLaVA-R1's profile picture ThinkSpace's profile picture

upvoted a paper 3 days ago

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

Paper • 2603.18004 • Published 4 days ago • 12
upvoted a paper 4 months ago

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

Paper • 2511.03774 • Published Nov 5, 2025 • 13
upvoted a paper over 1 year ago

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs