Any-to-Any
Transformers
Safetensors
multilingual
minicpmo
feature-extraction
minicpm-o
omni
vision
ocr
multi-image
video
custom_code
audio
speech
voice cloning
live Streaming
realtime speech conversation
asr
tts
Instructions to use openbmb/MiniCPM-o-2_6 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use openbmb/MiniCPM-o-2_6 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("openbmb/MiniCPM-o-2_6", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
File size: 714 Bytes
c248f01 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 | {
"image_processor_type": "MiniCPMVImageProcessor",
"auto_map": {
"AutoProcessor": "processing_minicpmo.MiniCPMOProcessor",
"AutoImageProcessor": "image_processing_minicpmv.MiniCPMVImageProcessor"
},
"processor_class": "MiniCPMOProcessor",
"max_slice_nums": 9,
"scale_resolution": 448,
"patch_size": 14,
"use_image_id": true,
"image_feature_size": 64,
"im_start": "<image>",
"im_end": "</image>",
"slice_start": "<slice>",
"slice_end": "</slice>",
"unk": "<unk>",
"im_id_start": "<image_id>",
"im_id_end": "</image_id>",
"slice_mode": true,
"norm_mean": [0.5, 0.5, 0.5],
"norm_std": [0.5, 0.5, 0.5],
"version": 2.6
} |