ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions Paper • 2501.12173 • Published Jan 21, 2025 • 1
BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation Paper • 2505.06985 • Published May 11, 2025
LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation Paper • 2508.07603 • Published Aug 11, 2025 • 1
Zero-shot 3D-Aware Trajectory-Guided image-to-video generation via Test-Time Training Paper • 2509.06723 • Published Sep 8, 2025
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation Paper • 2512.07831 • Published Dec 8, 2025 • 17
ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation Paper • 2512.03621 • Published Dec 3, 2025 • 9
From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing Paper • 2512.25066 • Published Dec 31, 2025 • 5
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving Paper • 2404.16771 • Published Dec 28, 2024 • 19