ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions Paper • 2501.12173 • Published Jan 21, 2025 • 1
BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation Paper • 2505.06985 • Published May 11, 2025
LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation Paper • 2508.07603 • Published Aug 11, 2025 • 1
Zero-shot 3D-Aware Trajectory-Guided image-to-video generation via Test-Time Training Paper • 2509.06723 • Published Sep 8, 2025
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation Paper • 2512.07831 • Published Dec 8, 2025 • 17
ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation Paper • 2512.03621 • Published Dec 3, 2025 • 9
From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing Paper • 2512.25066 • Published Dec 31, 2025 • 5
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 10 days ago • 219
StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors Paper • 2512.16915 • Published Dec 18, 2025 • 38
view article Article Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation Dec 16, 2025 • 57