AI & ML interests
None defined yet.
Recent Activity
Papers
UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory
Table-as-Search: Formulate Long-Horizon Agentic Information Seeking as Table Completion
Our advanced models and datasets for exploring the frontiers of MT
An unified model for multimodal understanding, text-to-image generation, and image editing.
With 29B parameters, Ovis1.6-Gemma2-27B achieves exceptional performance in the OpenCompass benchmark, ranking among the top-tier open-source MLLMs.
-
AIDC-AI/Ovis1.6-Gemma2-27B
Image-Text-to-Text β’ 29B β’ Updated β’ 29 β’ 62 -
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text β’ Updated β’ 23.7k β’ 273 -
AIDC-AI/Ovis1.6-Gemma2-9B-GPTQ-Int4
Image-Text-to-Text β’ Updated β’ 9 β’ 9 -
AIDC-AI/Ovis1.6-Llama3.2-3B
Image-Text-to-Text β’ 4B β’ Updated β’ 23.3k β’ 49
Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering under stringent computational constraints.
Our next-generation MLLMs for native-resolution vision and advanced reasoning
Our latest advancement in multi-modal large language models (MLLMs)
Ovis1.5 is fully open-source: we release training datasets, training & inference codes, and model weights.
Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering under stringent computational constraints.
Our advanced models and datasets for exploring the frontiers of MT
Our next-generation MLLMs for native-resolution vision and advanced reasoning
An unified model for multimodal understanding, text-to-image generation, and image editing.
Our latest advancement in multi-modal large language models (MLLMs)
With 29B parameters, Ovis1.6-Gemma2-27B achieves exceptional performance in the OpenCompass benchmark, ranking among the top-tier open-source MLLMs.
-
AIDC-AI/Ovis1.6-Gemma2-27B
Image-Text-to-Text β’ 29B β’ Updated β’ 29 β’ 62 -
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text β’ Updated β’ 23.7k β’ 273 -
AIDC-AI/Ovis1.6-Gemma2-9B-GPTQ-Int4
Image-Text-to-Text β’ Updated β’ 9 β’ 9 -
AIDC-AI/Ovis1.6-Llama3.2-3B
Image-Text-to-Text β’ 4B β’ Updated β’ 23.3k β’ 49
Ovis1.5 is fully open-source: we release training datasets, training & inference codes, and model weights.