Running on Zero Agents 7 Gemma 4 Object Detection π¦ 7 Detect objects in images with bounding boxes
Paused Agents Featured 1.94k Qwen3-TTS Demo π 1.94k Generate custom speech from text, voice descriptions, or samples
Configuration error Featured 446 FastVLM WebGPU π 446 Real-time video captioning powered by FastVLM
nomic-ai/colnomic-embed-multimodal-3b Visual Document Retrieval β’ Updated Apr 15, 2025 β’ 42.5k β’ 37
illuin-conteb/modernbert-large-insent Sentence Similarity β’ 0.4B β’ Updated Jun 2, 2025 β’ 15 β’ 4
Running Agents 12 Jina Embeddings V4 Retrieval Visual π 12 Generate tokenβwise heatmaps for an image based on a prompt
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ 0.3B β’ Updated Sep 17, 2025 β’ 30.7k β’ 1.61k
Sleeping Agents 43 YOLOv10 Document Layout Analysis π 43 Analyze scanned documents to detect and label content