Transcribe audio files into readable text
Generate 3D models and videos from text or images
Create textured 3D meshes from text, images, or models