Spatial-MLLM
Collection
5 items • Updated
How to use Diankun/Spatial-MLLM-subset-sft with Transformers:
# Load model directly
from transformers import AutoProcessor, AutoModelForSeq2SeqLM
processor = AutoProcessor.from_pretrained("Diankun/Spatial-MLLM-subset-sft")
model = AutoModelForSeq2SeqLM.from_pretrained("Diankun/Spatial-MLLM-subset-sft")This repository contains the model described in Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence.
Project page: https://diankun-wu.github.io/Spatial-MLLM/
Base model
Qwen/Qwen2.5-VL-3B-Instruct