MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 9 items • Updated 15 days ago • 66
Running on Zero Agents Featured 266 LongCat-Video-Avatar 1.5 🎤 266 Audio-driven talking-head video generation (Meituan LongCat)
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 16 items • Updated Dec 24, 2025 • 244
Llasa Collection TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 12 items • Updated 2 days ago • 22
Running on Zero Agents 315 Llasa 3b Tts 🔥 315 Zero Shot voice cloning with llasa 3b (Unofficial Demo)