NVIDIA OmniDreams Collection NVIDIA OmniDreams model checkpoints and sample datasets. • 3 items • Updated 3 days ago • 5
MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 7 items • Updated May 2 • 66
Chatterbox TTS Collection Chatterbox and Chatterbox Turbo By ResembleAI • 10 items • Updated Mar 2 • 6
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning Paper • 2511.18659 • Published Nov 24, 2025 • 25
Chatterbox Turbo Collection Ultra-Fast, Open-Source Text-to-Speech for Real-Time Voice AI • 3 items • Updated Dec 15, 2025 • 21
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published Dec 4, 2025 • 178
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 629
XVLA Collection X-VLA is a soft-prompted Transformer for cross-embodiment robot learning • 6 items • Updated Dec 4, 2025 • 13
Real-time Vision Models Collection A collection of real-time detectors. • 20 items • Updated Feb 18 • 23
MobileCLIP2 Collection MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B • 30 items • Updated Apr 23 • 62
FastVLM Collection Efficient Vision Encoding for Vision Language Models • 8 items • Updated Mar 2 • 113
view article Article Open-source DeepResearch – Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier • Feb 4, 2025 • 1.32k
WaveUI Collection WaveUI is a collection of datasets and tools to improve UI object detection • 5 items • Updated Mar 2 • 10
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 3 days ago • 164
view article Article Enjoy the Power of Phi-3 with ONNX Runtime on your device Emma-N • May 22, 2024 • 26