Running on Zero 14 Qwen3-VL Multimodal Search Engine π₯ 14 Cross-modal text-image search powered by Qwen3-VL
huihui-ai/Huihui-Qwen3-VL-8B-Instruct-abliterated Image-Text-to-Text β’ 9B β’ Updated Dec 15, 2025 β’ 7.46k β’ 155
fancyfeast/llama-joycaption-beta-one-hf-llava Image-Text-to-Text β’ 8B β’ Updated May 16, 2025 β’ 60.4k β’ 309
huihui-ai/Huihui-MiniCPM-V-4_5-abliterated Image-Text-to-Text β’ 9B β’ Updated Sep 8, 2025 β’ 3.1k β’ 29
Running on Zero 23 Joy Caption Beta One πΌ 23 Generate descriptive captions for images with various styles and formats
Running on Zero Featured 933 Joy Caption Beta One πΌ 933 Generate detailed captions or tags for any uploaded image
HuggingFaceTB/SmolVLM2-500M-Video-Instruct Image-Text-to-Text β’ Updated Apr 8, 2025 β’ 133k β’ 119
yayayaaa/florence-2-large-ft-moredetailed Image-to-Text β’ 0.8B β’ Updated Dec 13, 2025 β’ 74 β’ 16
Runtime error Featured 198 Better Florence 2 π₯ 198 Analyze images to detect objects, generate captions, or perform OCR
Running on Zero Featured 826 Florence 2 π 826 Perform image captioning, detection, OCR and more with Florenceβ2