π Stack 2.9 - Pattern-Based AI Coding Assistant
A HuggingFace Spaces demo for Stack 2.9, a pattern-based AI coding assistant powered by Qwen2.5-Coder-7B.
β¨ Features
- π€ Qwen2.5-Coder-7B - State-of-the-art code generation model
- π§ 7 Integrated Tools - File operations, git, web search, shell commands
- π§ Pattern Memory - Learns from each interaction
- β‘ Fast Streaming - Real-time token-by-token generation
- πΎ 4-bit Quantization - Runs on 16GB GPU (~4GB VRAM)
π§ Available Tools
| Tool | Description |
|---|---|
file_read |
Read files from the filesystem |
file_write |
Write content to files |
git_status |
Check git repository status |
web_search |
Search the web for information |
run_command |
Execute shell commands |
create_directory |
Create new directories |
list_directory |
List directory contents |
πββοΈ Quick Start
Local Development
# Clone the repository
git clone https://github.com/your-repo/stack-2.9.git
cd stack-2.9/space
# Install dependencies
pip install -r requirements.txt
# Run the demo
python app.py --share
HuggingFace Spaces
- Create a new Space on HuggingFace
- Select "Gradio" as the SDK
- Upload the files from this directory:
app.pyrequirements.txtREADME.md
- The model will load automatically on startup
π» Usage
Example Prompts
Hello! What can you help me with?
Check git status of this repository
Search for best practices for Python async programming
List the files in the current directory
Write a simple Python function to calculate fibonacci
How do I use Git to create a new branch?
What's your memory of our conversation?
Python API
from app import StackModel, memory
# Initialize model
model = StackModel()
model.load()
# Generate response
response = model.generate("Write a hello world in Python")
print(response)
# Check memory stats
print(memory.get_stats())
π Environment Variables
HF_TOKEN- Your HuggingFace token for private models (optional)MODEL_ID- Override default model (default: Qwen/Qwen2.5-Coder-7B-Instruct)
π Memory System
Stack 2.9 includes a pattern memory system that:
- Tracks Interactions - Records every user-assistant exchange
- Learns Patterns - Identifies frequently used tools
- Stores Code - Saves useful code snippets
- Adapts Behavior - Uses learned context to improve responses
π οΈ Tech Stack
- Model: Qwen2.5-Coder-7B-Instruct
- Quantization: 4-bit (bitsandbytes)
- Framework: Gradio 4.0+
- Backend: Transformers + Accelerate
- GPU: 16GB VRAM recommended
π License
MIT License - see LICENSE file for details.
π Acknowledgments
- Qwen - Base model
- HuggingFace - Spaces hosting
- Gradio - UI framework
Made with β€οΈ by Stack 2.9