audio models - a bann Collection

bann 's Collections

uncensored chat

professional 3d

Drawing Process

Video generation

video tratament

object detction

sound generation

music_generation

face manipulation

Image Captioning

Documents retriever

anime tratament

mask segmentation

Video Description

chat_with_upload

math_physics_etc

audio models

updated Jun 6

MIT/ast-finetuned-audioset-10-10-0.4593

Audio Classification • 86.6M • Updated Sep 6, 2023 • 546k • 360
Running on Zero

Agents

315

Llasa 3b Tts

🔥

315

Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Paused

Agents

Featured

202

YuE

👩

202

Generate music from lyrics and genre tags
Running on Zero

Agents

Featured

414

Zonos

🌍

414

Generate high-quality speech from text with optional voice cloning
Running on Zero

Agents

Featured

174

AudioX

👀

174

Generate audio from text, video, or audio prompts
Running on Zero

Agents

821

IndexTTS 2 Demo

🏢

821

Generate expressive speech from text with voice and emotion control
Running on Zero

Agents

Featured

560

ACE-Step v1.5

🎵

560

Music Generation Foundation Model v1.5
Running on Zero

Agents

Featured

35

Audio Flamingo Next

🔊

35

Answer questions about uploaded audio or YouTube videos
Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • 8B • Updated Jan 12, 2025 • 602k • 546
laion/clap-htsat-fused

Audio Classification • 0.2B • Updated Jan 12 • 12.1M • 114