neuphonic/neutts-air
Text-to-Speech β’ 0.7B β’ Updated β’ 13.1k β’ 870
State-of-the-art target speech extractor
Extreme Super-Resolution via Scale Autoregression
Generate gigascale 3D models with simple text prompts
Watch and experiment with realtime AIβs with visuals
Voice Activity Detection using MarbleNet model
Filter multilingual data for high-quality language models
Transcribe speech and highlight emphasized words