How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="Jackrong/Qwen3.5-4B-Python-Coder")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)
# Load model directly
from transformers import AutoProcessor, AutoModelForImageTextToText

processor = AutoProcessor.from_pretrained("Jackrong/Qwen3.5-4B-Python-Coder")
model = AutoModelForImageTextToText.from_pretrained("Jackrong/Qwen3.5-4B-Python-Coder")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
inputs = processor.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(processor.decode(outputs[0][inputs["input_ids"].shape[-1]:]))
Quick Links

⚠️ Preview Status

This repository is currently a preview release.

The model is still under active testing, and I am continuing to explore more suitable training settings and optimization strategies. Because of this, the current version should be considered experimental.

Downloading or using this model for serious evaluation is not recommended yet. A more stable version will be released after further testing and parameter tuning.

Downloads last month
66
Safetensors
Model size
5B params
Tensor type
BF16
·
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Jackrong/Qwen3.5-4B-Python-Coder

Finetuned
Qwen/Qwen3.5-4B
Finetuned
(91)
this model
Quantizations
1 model