Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MassivDash/Gemma-4-Rust-Coder:# Run inference directly in the terminal:
llama-cli -hf MassivDash/Gemma-4-Rust-Coder:Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf MassivDash/Gemma-4-Rust-Coder:# Run inference directly in the terminal:
./llama-cli -hf MassivDash/Gemma-4-Rust-Coder:Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf MassivDash/Gemma-4-Rust-Coder:# Run inference directly in the terminal:
./build/bin/llama-cli -hf MassivDash/Gemma-4-Rust-Coder:Use Docker
docker model run hf.co/MassivDash/Gemma-4-Rust-Coder:Gemma-4-Rust-Coder : GGUF
This model is a specialized fine-tune of Gemma 4, specifically optimized for Rust systems programming, memory safety patterns, and high-performance development. It was trained using Unsloth Studio to ensure maximum efficiency and performance.
🦀 Fine-Tuning Focus
The model has been adjusted to excel in:
- Idiomatic Rust: Writing clean, "Rusty" code using modern patterns.
- Concurrency: Deep understanding of
Send,Sync, and async runtimes likeTokio. - Vision-to-Code: Using its multimodal capabilities to translate architecture diagrams or UI mockups into functional Rust code.
🤝 Credits & Acknowledgments
Special thanks to Fortytwo-Network for providing the Strandset-Rust-v1 dataset. This model's specialized knowledge of the Rust ecosystem is a direct result of this high-quality data.
🚀 Usage
This model is converted to GGUF format for seamless use with llama.cpp and other compatible executors.
Example usage:
- Text-only LLM:
llama-cli -hf MassivDash/Gemma-4-Rust-Coder --jinja - Multimodal / Vision:
llama-mtmd-cli -hf MassivDash/Gemma-4-Rust-Coder --jinja
📂 Available Model files:
gemma-4-e2b-it.Q3_K_M.ggufgemma-4-e2b-it.BF16-mmproj.gguf
⚠️ Ollama Note for Vision Models
Important: Ollama currently does not support separate mmproj files for vision models.
To create an Ollama model from this vision model:
- Place the
Modelfilein the same directory as the finetuned bf16 merged model. - Run:
ollama create model_name -f ./Modelfile(Replacemodel_namewith your desired name)
🔗 Stay Connected
For more insights on AI development and fine-tuning, visit my blog: 👉 spaceout.pl
This model was trained 2x faster with Unsloth
- Downloads last month
- 3,347
3-bit
4-bit
5-bit
8-bit

Install from brew
# Start a local OpenAI-compatible server with a web UI: llama-server -hf MassivDash/Gemma-4-Rust-Coder:# Run inference directly in the terminal: llama-cli -hf MassivDash/Gemma-4-Rust-Coder: