Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zhumuzhi's picture
6 34 7

zhumuzhi

Z-MU-Z
Ka12un's profile picture Canyu's profile picture HeiXiong620's profile picture
·
  • Z-MU-Z

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago
GUI-Shepherd: Reliable Process Reward and Verification for Long-Sequence GUI Tasks
authored a paper 2 days ago
HieraTok: Multi-Scale Visual Tokenizer Improves Image Reconstruction and Generation
authored a paper 2 days ago
Bridge Thinking and Acting: Unleashing Physical Potential of VLM with Generalizable Action Expert
View all activity

Organizations

Zhejiang University's profile picture

commented 2 papers 11 months ago

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Paper • 2505.21457 • Published May 27, 2025 • 16 •
2

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Paper • 2505.20256 • Published May 26, 2025 • 19 •
1
commented a paper about 1 year ago

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Paper • 2503.08625 • Published Mar 11, 2025 • 27 •
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs