MIT/ast-finetuned-audioset-10-10-0.4593
Audio Classification β’ 86.6M β’ Updated β’ 575k β’ 353
Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Generate music from lyrics and genre tags
Generate expressive speech audio from text with custom voice
Generate audio from text, video, or audio prompts
Generate expressive speech from text and voice prompts
Music Generation Foundation Model v1.5
Answer questions about uploaded audio or YouTube videos