prithivMLmods/proxima-ocr-d.markdown-post3.0.l
Image-Text-to-Text β’ 9B β’ Updated β’ 12 β’ 5
Generate high-quality speech from text with optional voice cloning
Long-form Speech Synthesis with Zonos
Long-Form Speech Synthesis with Zonos and DeepFilterNet
Generate audio from text with customizable emotions
mcp_server
Separate music into vocals and instrumental
Translate text from English to Russian or Chinese
Generate virtual tryβon images of clothes on a person
Generate music from a text description and optional melody
Track your online presence with reverse face search
Swap faces in images or videos
Find image sources by uploading an image