Ying's picture

3 1

Ying

Heting

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models

upvoted a collection 4 months ago

liked a Space 11 months ago

stereoDrift/3d-model-playground

View all activity

Organizations

None yet

upvoted a paper about 20 hours ago

Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models

Paper • 2605.28132 • Published 7 days ago • 17

upvoted a collection 4 months ago

Qwen3-TTS

7 items • Updated Jan 22 • 364

liked a Space 11 months ago

3d-Model-Playground

Control 3D models using gestures and voice

upvoted a paper about 1 year ago

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

Paper • 2504.07615 • Published Apr 10, 2025 • 36

authored a paper over 1 year ago

OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer

Paper • 2406.16620 • Published Jun 24, 2024 • 3