DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing Paper β’ 2602.12205 β’ Published Feb 12 β’ 83
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper β’ 2601.03233 β’ Published Jan 6 β’ 179
Unified Multimodal Model Collection A curated list for Multimodal Model Generation papers. β’ 22 items β’ Updated Feb 18 β’ 4
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper β’ 2512.08765 β’ Published Dec 9, 2025 β’ 134
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper β’ 2511.22699 β’ Published Nov 27, 2025 β’ 246
MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues Paper β’ 2512.03046 β’ Published Dec 2, 2025 β’ 12
Style Customization of Text-to-Vector Generation with Image Diffusion Priors Paper β’ 2505.10558 β’ Published May 15, 2025 β’ 16
AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation Paper β’ 2102.12593 β’ Published Feb 24, 2021 β’ 1
A Large-scale Dataset for Robust Complex Anime Scene Text Detection Paper β’ 2510.07951 β’ Published Oct 9, 2025 β’ 9
Emu3.5 Collection Native Multimodal Models are World Learners π β’ 4 items β’ Updated Feb 4 β’ 77
Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping Paper β’ 2509.04582 β’ Published Sep 4, 2025 β’ 8
Speed Up Model Collection A curated list of speed up model in multimodal generation. β’ 18 items β’ Updated Nov 12, 2025 β’ 1
USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning Paper β’ 2508.18966 β’ Published Aug 26, 2025 β’ 56
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing Paper β’ 2508.10881 β’ Published Aug 14, 2025 β’ 54
Skywork-Unipic2 Collection A Unified DiT Multimodal Model for Image Generation, Editing, and Understanding β’ 8 items β’ Updated Mar 2 β’ 11
NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining Paper β’ 2507.14119 β’ Published Jul 18, 2025 β’ 60