🤝 Open to Collab

12 20 15

Jiehui Huang

JackAILab

https://jackailab.github.io/

JackAILab

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

upvoted a paper 3 days ago

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

submitted a paper 3 days ago

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

View all activity

Organizations

authored a paper 3 days ago

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

Paper • 2606.21661 • Published 9 days ago • 24

upvoted a paper 3 days ago

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

Paper • 2606.21661 • Published 9 days ago • 24

submitted a paper to Daily Papers 3 days ago

UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating

Paper • 2606.21661 • Published 9 days ago • 24

liked a dataset 3 days ago

KlingTeam/UnityShotsBench

Viewer • Updated 3 days ago • 1.43k • 1.17k • 6

updated a dataset 3 days ago

KlingTeam/UnityShotsBench

Viewer • Updated 3 days ago • 1.43k • 1.17k • 6

published a dataset 4 days ago

KlingTeam/UnityShotsBench

Viewer • Updated 3 days ago • 1.43k • 1.17k • 6

liked a model 8 days ago

nvidia/Cosmos3-Super

65B • Updated 5 days ago • 83.5k • 187

upvoted a paper 11 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 18 days ago • 204

published a Space 16 days ago

StarWAM

💻

Community

upvoted a collection 23 days ago

Cosmos3

Collection

Omnimodal World Models for Physical AI • 16 items • Updated 2 days ago • 132

upvoted a paper about 1 month ago

KVPO: ODE-Native GRPO for Autoregressive Video Alignment via KV Semantic Exploration

Paper • 2605.14278 • Published May 14 • 37

liked a model about 1 month ago

robbyant/lingbot-va-posttrain-libero-long

Robotics • Updated Apr 24 • 2

liked a model about 2 months ago

nvidia/DreamDojo

Updated Feb 23 • 53 • 36

authored 7 papers 2 months ago

ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal Conditions

Paper • 2501.12173 • Published Jan 21, 2025 • 1

BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation

Paper • 2505.06985 • Published May 11, 2025

LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation

Paper • 2508.07603 • Published Aug 11, 2025 • 1

Zero-shot 3D-Aware Trajectory-Guided image-to-video generation via Test-Time Training

Paper • 2509.06723 • Published Sep 8, 2025

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Paper • 2512.07831 • Published Dec 8, 2025 • 17

ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation

Paper • 2512.03621 • Published Dec 3, 2025 • 9

From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing

Paper • 2512.25066 • Published Dec 31, 2025 • 5

Jiehui Huang

AI & ML interests

Recent Activity

Organizations

JackAILab's activity

StarWAM