DyaDiT: A Multi-Modal Diffusion Transformer for Socially Favorable Dyadic Gesture Generation Paper • 2602.23165 • Published Feb 26 • 2
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published Mar 2 • 150
When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains Paper • 2603.01301 • Published Mar 1 • 8
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation Paper • 2602.11451 • Published Feb 11 • 16
EasyV2V: A High-quality Instruction-based Video Editing Framework Paper • 2512.16920 • Published Dec 18, 2025 • 18
CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives Paper • 2512.14696 • Published Dec 16, 2025 • 8
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding Paper • 2510.10868 • Published Oct 13, 2025 • 13
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding Paper • 2510.10868 • Published Oct 13, 2025 • 13
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding Paper • 2510.10868 • Published Oct 13, 2025 • 13 • 2