LLMLingua
π
132
Compress prompts to speed up language model inference
A unified multimodal understanding and generation model.
Clone voices and generate speech from text using reference audio
Generate synchronized audio from videos or text prompts
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Huggingface space for JanusFlow-1.3B
What happened in open-source AI this year, and whatβs next?
Transcribe audio and YouTube videos into text instantly