Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt Paper • 2403.11780 • Published Mar 18, 2024
Accompanied Singing Voice Synthesis with Fully Text-controlled Melody Paper • 2407.02049 • Published Jul 2, 2024
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling Paper • 2408.16532 • Published Aug 29, 2024 • 50
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks Paper • 2409.13832 • Published Sep 20, 2024 • 1
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization Paper • 2410.12957 • Published Oct 16, 2024 • 8
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control Paper • 2409.15977 • Published Sep 24, 2024 • 2
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis Paper • 2312.10741 • Published Dec 17, 2023 • 1
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching Paper • 2502.12572 • Published Feb 18, 2025 • 2
MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis Paper • 2502.18924 • Published Feb 26, 2025 • 16
Versatile Framework for Song Generation with Prompt-based Control Paper • 2504.19062 • Published Apr 27, 2025 • 6
Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching Paper • 2406.00320 • Published Jun 1, 2024
STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation Paper • 2507.06670 • Published Jul 9, 2025
ALIVE: Animate Your World with Lifelike Audio-Video Generation Paper • 2602.08682 • Published Feb 10 • 2
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue Paper • 2605.30993 • Published 4 days ago • 38
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue Paper • 2605.30993 • Published 4 days ago • 38
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization Paper • 2410.12957 • Published Oct 16, 2024 • 8