Visual-Interactive Text-Image Universal Embedder (ICLR-26)
AI & ML interests
None defined yet.
Papers
Woosh: A Sound Effects Foundation Model
Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models
None defined yet.
Woosh: A Sound Effects Foundation Model
Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models