5 35 11

Lingdong Kong

ldkong

https://ldkong.com

AI & ML interests

3D Perception, Generation, and World Modeling

Recent Activity

authored a paper 2 days ago

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

upvoted a paper 3 days ago

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

authored a paper 7 days ago

U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences

View all activity

Organizations

authored a paper 2 days ago

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published 6 days ago • 211

authored 6 papers 7 days ago

U4D: Uncertainty-Aware 4D World Modeling from LiDAR Sequences

Paper • 2512.02982 • Published Dec 2, 2025 • 2

EditMGT: Unleashing Potentials of Masked Generative Transformers in Image Editing

Paper • 2512.11715 • Published Dec 12, 2025

WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Paper • 2512.10958 • Published Dec 11, 2025 • 1

Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

Paper • 2512.16760 • Published Dec 18, 2025 • 15

Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Paper • 2512.24385 • Published Dec 30, 2025 • 8

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published 10 days ago • 89

submitted a paper to Daily Papers 9 days ago

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published 10 days ago • 89

submitted a paper to Daily Papers 4 months ago

Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

Paper • 2512.16760 • Published Dec 18, 2025 • 15

authored 8 papers 5 months ago

Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving

Paper • 2405.05258 • Published May 8, 2024

Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations

Paper • 2507.05260 • Published Jul 7, 2025

SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining

Paper • 2503.19912 • Published Mar 25, 2025

Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation

Paper • 2407.15282 • Published Jul 21, 2024

SEE4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting

Paper • 2510.26796 • Published Oct 30, 2025 • 1

authored 3 papers 6 months ago

3EED: Ground Everything Everywhere in 3D

Paper • 2511.01755 • Published Nov 3, 2025 • 11

RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning

Paper • 2510.02240 • Published Oct 2, 2025 • 18

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23, 2025 • 56

Lingdong Kong

AI & ML interests

Recent Activity

Organizations

ldkong's activity