Running on Zero Agents 184 Music Flamingo 🎵 184 Analyze music and answer questions from audio or YouTube links
Running on Zero MCP Featured 91 BitDance-14B-64x 🚀 91 Open-source autoregressive model with binary visual tokens.
Running on Zero Agents Featured 276 granite-docling-258M demo 📝 276 Convert images of documents to structured data and answer queries
Configuration error Agents Featured 300 USO FLUX ⚡ 300 Create custom images by blending prompts with style references
Running on Zero Agents Featured 145 ToonComposer 🎨 145 Generate animated videos from images and sketches
Running on Zero Agents Featured 430 OmniGen2 👀 430 OmniGen2: Unified Image Understanding and Generation.
Paused Agents Featured 215 OmniConsistency 🚀 215 Generate styled image from reference image and external LoRA
Runtime error Agents Featured 83 MMaDA 🌍 83 Demo for MMaDA: Multimodal Large Diffusion Language Models
fancyfeast/llama-joycaption-beta-one-hf-llava Image-Text-to-Text • 8B • Updated May 16, 2025 • 119k • 349