Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
egotextvqa-group
university
https://zhousheng97.github.io/
Activity Feed
Follow
3
AI & ML interests
None defined yet.
Recent Activity
DogNeverSleep
submitted
a paper
3 days ago
VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining
DogNeverSleep
authored
a paper
about 1 month ago
BrowseComp-$V^3$: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents
DogNeverSleep
authored
a paper
about 1 month ago
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
View all activity
Team members
2
egotextvqa-group
's models
None public yet