Try out Step-Audio-EditX
Chat with Xiaomi MiMo-Audio using voice
MOSS-TTSD: Text to Spoken Dialogue Generation
Generate full app code from a simple description