yeyang

sysuyy

2 25 3

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

GEAR: Guided End-to-End AutoRegression for Image Synthesis

upvoted a paper 2 months ago

Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models

upvoted a paper 3 months ago

Image Generators are Generalist Vision Learners

View all activity

Organizations

None yet

upvoted a paper 11 days ago

GEAR: Guided End-to-End AutoRegression for Image Synthesis

Paper • 2606.32039 • Published 13 days ago • 34

upvoted a paper 2 months ago

Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models

Paper • 2604.26951 • Published Apr 29 • 50

upvoted 5 papers 3 months ago

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

Paper • 2603.25823 • Published Mar 26 • 44

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 353

upvoted 3 papers 4 months ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 99

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published Mar 24 • 62

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published Mar 4 • 190

upvoted a paper 6 months ago

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published Jan 9 • 60

upvoted 2 papers 9 months ago

Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback

Paper • 2510.16888 • Published Oct 19, 2025 • 22

DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published Oct 13, 2025 • 27

upvoted a paper 10 months ago

Do You Need Proprioceptive States in Visuomotor Policies?

Paper • 2509.18644 • Published Sep 23, 2025 • 50

upvoted a paper about 1 year ago

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Paper • 2507.07982 • Published Jul 10, 2025 • 34

updated a dataset about 1 year ago

sysuyy/ImgEdit

Updated Jun 16, 2025 • 10.7k • 45

upvoted a paper about 1 year ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3, 2025 • 59

New activity in sysuyy/ImgEdit about 1 year ago

Update README.md

#2 opened about 1 year ago by

BestWishYsh

New activity in sysuyy/ImgEdit_recap_mask about 1 year ago

Update README.md

#2 opened about 1 year ago by

BestWishYsh

updated a collection about 1 year ago

imgedit

Collection

2 items • Updated Jun 1, 2025 • 2

yeyang

AI & ML interests

Recent Activity

Organizations

sysuyy's activity

Update README.md

Update README.md