Running on Zero Featured 1.16k Qwen3-TTS Demo 🎙 1.16k Generate realistic speech from text with custom voices or voice cloning
LightOnOCR-2 🦉 Collection LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 13 days ago • 20
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 16 items • Updated 4 days ago • 53
Step-Audio-R1 Collection Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling. • 4 items • Updated 20 days ago • 18