FluidInference/parakeet-realtime-eou-120m-coreml Automatic Speech Recognition β’ Updated Mar 14 β’ 17.2k β’ 4
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ 6B β’ Updated Dec 10, 2025 β’ 410k β’ 1.6k
HuggingFaceTB/SmolVLM2-500M-Video-Instruct Image-Text-to-Text β’ Updated Apr 8, 2025 β’ 474k β’ 135
openai/clip-vit-large-patch14 Zero-Shot Image Classification β’ 0.4B β’ Updated Sep 15, 2023 β’ 24.8M β’ 2k
Runtime error Agents Featured 272 Edit Video By Editing Text β 272 Audio-based video editing using AI-generated transcription