How to use from
llama.cpp
Install (macOS, Linux)
curl -LsSf https://llama.app/install.sh | sh
# Start a local OpenAI-compatible server with a web UI:
llama serve -hf fableforge-ai/ShellWhisperer-1.5B:
# Run inference directly in the terminal:
llama cli -hf fableforge-ai/ShellWhisperer-1.5B:
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama serve -hf fableforge-ai/ShellWhisperer-1.5B:
# Run inference directly in the terminal:
llama cli -hf fableforge-ai/ShellWhisperer-1.5B:
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf fableforge-ai/ShellWhisperer-1.5B:
# Run inference directly in the terminal:
./llama-cli -hf fableforge-ai/ShellWhisperer-1.5B:
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf fableforge-ai/ShellWhisperer-1.5B:
# Run inference directly in the terminal:
./build/bin/llama-cli -hf fableforge-ai/ShellWhisperer-1.5B:
Use Docker
docker model run hf.co/fableforge-ai/ShellWhisperer-1.5B:
Quick Links

ShellWhisperer-1.5B — Ultra-Fast Shell Command Assistant

1,100+ downloads · 986MB · 20+ tok/s · Runs on anything

Ollama HuggingFace Size License Speed Runs On


What Is This?

ShellWhisperer is the fastest shell command assistant on the planet. At just 986MB (Q4_K_M), it runs on everything from a Raspberry Pi to a gaming PC. Fine-tuned from Qwen2.5-0.5B on 100K+ shell command traces, it predicts bash, zsh, PowerShell, and DevOps commands with uncanny accuracy.

Unlike general models that kind of know shell commands, ShellWhisperer was trained exclusively on shell tasks — it doesn't write poetry, it gets you the right command fast.

Quick Start

Ollama (recommended)

ollama run FableForge-AI/shellwhisperer

llama.cpp

./llama-cli -m shellwhisperer-1.5b-Q4_K_M.gguf --prompt "find all files larger than 100MB" -n 512

Python

from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("fableforge-ai/ShellWhisperer-1.5B")

Quantizations

File Size Speed Best For
Q2_K.gguf 645 MB ~25 tok/s Phone, Raspberry Pi
Q3_K_M.gguf 786 MB ~22 tok/s Low-end devices
Q4_0.gguf 895 MB ~28 tok/s Fast inference
Q4_K_M.gguf 940 MB ~22 tok/s Recommended
Q5_K_M.gguf 1.0 GB ~18 tok/s High quality
Q6_K.gguf 1.2 GB ~15 tok/s Pro quality
Q8_0.gguf 1.5 GB ~12 tok/s Near-lossless
f16.gguf 2.9 GB ~8 tok/s Full precision

Hardware Requirements

Hardware Can Run? Best Quant
Phone (2GB+ RAM) CPU Q2_K
Raspberry Pi Zero CPU Q2_K
Old laptop (4GB RAM) CPU Q4_K_M
Gaming PC (RTX 3060+) Full GPU Q5_K_M
Mac M1/M2 Full GPU Q4_K_M
Server (32GB+) Full GPU Q8_0 / F16

Why ShellWhisperer?

Feature ShellWhisperer GPT-4 Llama 3.1 8B
Size 986 MB N/A 4.8 GB
Speed ~22 tok/s CPU API only ~8 tok/s CPU
Shell accuracy 100K+ shell examples General General
Runs on phone ✅ Yes ❌ No ❌ No
Offline ✅ Yes ❌ No ⚠️ Partial

Examples in Action

User: find all files modified in last 7 days
ShellWhisperer: find / -type f -mtime -7 2>/dev/null | head -20

User: check disk usage of all mounted drives sorted by usage
ShellWhisperer: df -h | grep -v tmpfs | sort -k5 -hr

User: list all running Docker containers with ports
ShellWhisperer: docker ps --format "table {{.Names}}\t{{.Ports}}\t{{.Status}}"

User: undo last commit but keep changes in working directory
ShellWhisperer: git reset --soft HEAD~1

User: find which process is listening on port 8080
ShellWhisperer: lsof -i :8080 | grep LISTEN

Training Details

Parameter Value
Base Model Qwen2.5-0.5B
Training Data 100K+ shell command traces from Fable5
Context Window 32K tokens
Quantization imatrix-guided via llama.cpp
License Apache 2.0

FableForge Ecosystem

Model Size Best For
ShellWhisperer 986 MB Shell commands, ultra-fast
ReasonCritic-7B 3.1-16 GB Reasoning + uncensored
NEXUS-Coder 488 MB - 2.9 GB Domain coding
NEXUS-Security 488 MB - 2.9 GB Cybersecurity
NEXUS-Medical 488 MB - 2.9 GB Medical
NEXUS-Legal 488 MB - 2.9 GB Legal
NEXUS-Finance 488 MB - 2.9 GB Finance
NEXUS-Science 488 MB - 2.9 GB Science

License

Apache 2.0 — commercial use allowed.


Part of the FableForge AI ecosystem. Zero Limits. Pure Intelligence.

Downloads last month
2,007
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for fableforge-ai/ShellWhisperer-1.5B

Finetuned
(640)
this model
Finetunes
1 model
Quantizations
7 models