1 5 11

CHEN Liyi PRO

mutou0308

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

liked a model 11 days ago

Efficient-Large-Model/SANA-WM_bidirectional

upvoted a paper 22 days ago

Audio-Visual Intelligence in Large Foundation Models

View all activity

Organizations

upvoted a paper 3 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 5 days ago • 124

liked a model 11 days ago

Efficient-Large-Model/SANA-WM_bidirectional

Image-to-Video • Updated 12 days ago • 115

upvoted a paper 22 days ago

Audio-Visual Intelligence in Large Foundation Models

Paper • 2605.04045 • Published 26 days ago • 35

published a model 2 months ago

mutou0308/Omni3DEdit

Updated Mar 29

updated a model 2 months ago

mutou0308/Omni3DEdit

Updated Mar 29

published a dataset 2 months ago

mutou0308/temp_log

Updated Mar 25 • 4

updated a dataset 2 months ago

mutou0308/temp_log

Updated Mar 25 • 4

liked a Space 2 months ago

Qwen Image Multiple Angles 3D Camera

🎥

2.51k

Transform image viewpoint with adjustable camera angles

updated a dataset 3 months ago

mutou0308/One2Scene

Updated Feb 26 • 64 • 1

published a dataset 3 months ago

mutou0308/One2Scene

Updated Feb 26 • 64 • 1

liked a model 3 months ago

Glanty/Capybara

Any-to-Any • Updated Feb 27 • 232

upvoted a paper 6 months ago

JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Paper • 2511.23002 • Published Nov 28, 2025 • 26

New activity in mutou0308/RE10K 6 months ago

Hello, may I ask if you uploaded the complete dataset of realestate10k? May I ask if train50 to 54 are missing?

#2 opened 6 months ago by

qwewreter

updated a dataset 6 months ago

mutou0308/co3dv2

Updated Dec 5, 2025 • 1.26k • 2

published a dataset 6 months ago

mutou0308/co3dv2

Updated Dec 5, 2025 • 1.26k • 2

liked a Space 10 months ago

BAGEL

🚀

220

Demo for BAGEL

upvoted a paper 10 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 256

liked a Space 10 months ago

Pi3

📈

Permutation-Equivariant Visual Geometry Learning

liked a Space 11 months ago

JarvisArt Preview

🏃

105

Generate Lightroom presets from images and prompts

published a dataset 11 months ago

mutou0308/RE10K

Preview • Updated Jul 11, 2025 • 202 • 13