generate a video from an image with a text prompt
Generate speech from text with emotional tone adjustment