Generate singing voice from lyrics and melody
Segment objects from images using natural language prompts