Zhongyu Yang's picture

Zhongyu Yang

yzzyu

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

MultiHaystack: Benchmarking Multimodal Retrieval and Reasoning over 40K Images, Videos, and Documents

authored a paper 2 days ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

upvoted a paper 5 days ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

View all activity

Organizations

None yet

authored 2 papers 2 days ago

MultiHaystack: Benchmarking Multimodal Retrieval and Reasoning over 40K Images, Videos, and Documents

Paper • 2603.05697 • Published Mar 5

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published 7 days ago • 85

authored a paper 3 months ago

XR: Cross-Modal Agents for Composed Image Retrieval

Paper • 2601.14245 • Published Jan 20 • 8

submitted a paper to Daily Papers 3 months ago

XR: Cross-Modal Agents for Composed Image Retrieval

Paper • 2601.14245 • Published Jan 20 • 8

authored a paper 4 months ago

InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration

Paper • 2512.02981 • Published Dec 2, 2025 • 1

authored a paper 5 months ago

Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models

Paper • 2512.01949 • Published Dec 1, 2025 • 9

authored a paper about 1 year ago

WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation

Paper • 2503.19065 • Published Mar 24, 2025 • 11