Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC Paper • 2505.24200 • Published May 30, 2025
Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence Paper • 2604.24954 • Published 3 days ago • 6
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 Any-to-Any • 33B • Updated about 20 hours ago • 9.82k • 142
Running on CPU Upgrade Featured 3.13k The Smol Training Playbook 📚 3.13k The secrets to building world-class LLMs
Open Whisper-style Speech Models (OWSM) Collection Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 22 items • Updated Aug 30, 2025 • 6