Running on A100 236 Omnilingual ASR Media Transcription 🌍 236 Transcribe audio/video to text in many languages
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated Dec 10, 2025 • 305k • 1.58k
speechbrain/emotion-recognition-wav2vec2-IEMOCAP Audio Classification • Updated Jul 23, 2024 • 598k • 178