Spaces:

Nitishkumar-ai
/

commitguard-env

Running on A10G

commitguard-env / scripts /lightning_ai_runbook.md

Deployment Build (Final): Professional Structure + Blog

95cbc5b about 2 hours ago

1.49 kB

Training on Lightning AI

This guide explains how to run CommitGuard GRPO training on a Lightning AI GPU Studio.

GPU: NVIDIA L4 (24GB) or A10G (24GB) is sufficient for Llama-3.2-3B with Unsloth 4-bit.
Image: Default Linux / PyTorch images are fine; the setup script handles dependencies.

Installs uv for fast dependency management.
Creates a virtual environment and installs all requirements (Unsloth, TRL, etc.).
Starts the commitguard_env server in the background (via tmux if available).
Runs scripts/train_grpo.py.

If you want to see the environment server logs:

tmux attach -t env_server

(Press Ctrl+B, then D to detach).

To save your model to the Hugging Face Hub, login before training:

huggingface-cli login

Checkpoints and the final merged LoRA adapter will be saved to: outputs/commitguard-llama-3b/final

OOM Error: If you hit Out-Of-Memory, try reducing --batch-size or --num-generations in scripts/train_grpo.py.
Server Connection: If training fails with connection errors, ensure the server started correctly by checking curl http://localhost:8000/health.