Small Language Models microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 224k • 1.56k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 224k • 1.56k
TTS SparkAudio/Spark-TTS-0.5B Text-to-Speech • Updated Mar 7, 2025 • 908 • 724 sesame/csm-1b Text-to-Speech • Updated Dec 1, 2025 • 48.7k • 2.33k hexgrad/Kokoro-82M Text-to-Speech • Updated Apr 10, 2025 • 2.27M • • 5.63k speaches-ai/Kokoro-82M-v1.0-ONNX-fp16 Text-to-Speech • Updated Mar 21, 2025 • 2
Small Language Models microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 224k • 1.56k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 224k • 1.56k
TTS SparkAudio/Spark-TTS-0.5B Text-to-Speech • Updated Mar 7, 2025 • 908 • 724 sesame/csm-1b Text-to-Speech • Updated Dec 1, 2025 • 48.7k • 2.33k hexgrad/Kokoro-82M Text-to-Speech • Updated Apr 10, 2025 • 2.27M • • 5.63k speaches-ai/Kokoro-82M-v1.0-ONNX-fp16 Text-to-Speech • Updated Mar 21, 2025 • 2