Zhenxiong Tan PRO

Yuanshi

21 45 22

AI & ML interests

Reinforcement Learning; Large Language Model; Multimodality; AI Infrastructure;

Recent Activity

upvoted a paper about 1 month ago

World Action Models: A Survey

upvoted a paper about 2 months ago

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

upvoted a paper 2 months ago

Q-ARVD: Quantizing Autoregressive Video Diffusion Models

View all activity

Organizations

upvoted a paper about 1 month ago

World Action Models: A Survey

Paper • 2606.20781 • Published Jun 18 • 56

upvoted a paper about 2 months ago

Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation

Paper • 2606.17030 • Published Jun 15 • 47

upvoted 3 papers 2 months ago

upvoted 2 papers 3 months ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published Apr 30 • 92

Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms

Paper • 2604.23775 • Published Apr 26 • 46

upvoted a paper 4 months ago

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published Apr 9 • 54

authored 2 papers 4 months ago

FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation

Paper • 2511.14712 • Published Nov 18, 2025 • 2

Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers

Paper • 2603.27666 • Published Mar 29 • 18

upvoted a paper 4 months ago

Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers

Paper • 2603.27666 • Published Mar 29 • 18

submitted a paper to Daily Papers 4 months ago

Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers

Paper • 2603.27666 • Published Mar 29 • 18

upvoted a paper 4 months ago

Make Geometry Matter for Spatial Reasoning

Paper • 2603.26639 • Published Mar 27 • 33

upvoted 2 papers 5 months ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 187

Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models

Paper • 2603.15557 • Published Mar 16 • 30

authored a paper 5 months ago

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

Paper • 2603.15478 • Published Mar 16 • 24

upvoted a paper 5 months ago

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

Paper • 2603.15478 • Published Mar 16 • 24

upvoted a paper 6 months ago

dVoting: Fast Voting for dLLMs

Paper • 2602.12153 • Published Feb 12 • 23

liked a Space 7 months ago

AI Deadlines

⚡

773

Browse upcoming AI conference and workshop deadlines

liked a dataset 7 months ago

LucasFang/FLUX-Reason-6M

Viewer • Updated Feb 2 • 5.89M • 16.9k • 97

Zhenxiong Tan PRO

AI & ML interests

Recent Activity

Organizations

Yuanshi's activity

AI Deadlines