DeepSeek-OCR
π
16
Extract text from images and convert to markdown
A unified multimodal understanding and generation model.
Edit photos with scribbles and AI-driven color changes
Easily expand image boundaries
text-to-3D & image-to-3D
Engage in multimedia chat with LLMs and ML models
Generate audio from text descriptions