Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach Paper • 2410.00025 • Published Sep 16, 2024
fastabx: A library for efficient computation of ABX discriminability Paper • 2505.02692 • Published May 5, 2025
BabyHuBERT: Multilingual Self-Supervised Learning for Segmenting Speakers in Child-Centered Long-Form Recordings Paper • 2509.15001 • Published Sep 18, 2025
SpidR: Learning Fast and Stable Linguistic Units for Spoken Language Models Without Supervision Paper • 2512.20308 • Published Dec 23, 2025
DiscoPhon: Benchmarking the Unsupervised Discovery of Phoneme Inventories With Discrete Speech Units Paper • 2603.18612 • Published Mar 19