QwQ-32B-Preview
Scalable and Versatile 3D Generation from images
Transcribe audio to text with speaker diarization