Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xin Zhou's picture
2 10 9

Xin Zhou

LMD0311
·

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago
H-EmbodVis/HERMESV2
published a model 5 days ago
H-EmbodVis/HERMESV2
updated a model 6 days ago
H-EmbodVis/HERMESV2
View all activity

Organizations

H-EmbodVis's profile picture

authored a paper 25 days ago

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published 27 days ago • 115
authored 4 papers about 1 month ago

Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception

Paper • 2503.13587 • Published Mar 17, 2025

More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models

Paper • 2510.23574 • Published Oct 27, 2025

Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching

Paper • 2507.02860 • Published Jul 3, 2025

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156
authored 2 papers about 1 year ago

MINIMA: Modality Invariant Image Matching

Paper • 2412.19412 • Published Dec 27, 2024 • 4

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

Paper • 2501.14729 • Published Jan 24, 2025 • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs