Sound Event Detection MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 86.6M • Updated Sep 6, 2023 • 275k • 357 laion/larger_clap_music_and_speech Feature Extraction • Updated Oct 31, 2023 • 39.4k • • 40
MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 86.6M • Updated Sep 6, 2023 • 275k • 357
Speaker Diarization and Transcripts openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.8M • • 3.08k pyannote/segmentation Voice Activity Detection • Updated May 10, 2024 • 3.43M • 678 pyannote/segmentation-3.0 Voice Activity Detection • Updated May 10, 2024 • 6.92M • 1.14k pyannote/speaker-diarization-3.1 Automatic Speech Recognition • Updated May 10, 2024 • 8.18M • 2.27k
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.8M • • 3.08k
Sound Event Detection MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 86.6M • Updated Sep 6, 2023 • 275k • 357 laion/larger_clap_music_and_speech Feature Extraction • Updated Oct 31, 2023 • 39.4k • • 40
MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 86.6M • Updated Sep 6, 2023 • 275k • 357
Speaker Diarization and Transcripts openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.8M • • 3.08k pyannote/segmentation Voice Activity Detection • Updated May 10, 2024 • 3.43M • 678 pyannote/segmentation-3.0 Voice Activity Detection • Updated May 10, 2024 • 6.92M • 1.14k pyannote/speaker-diarization-3.1 Automatic Speech Recognition • Updated May 10, 2024 • 8.18M • 2.27k
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.8M • • 3.08k