Spaces:
Configuration error
CommitGuard AI-Paced Security Review (Meta OpenEnv Hackathon)
"Defense is on human time, offense is on AI time. CommitGuard closes that asymmetry."
The Vision
AI coding agents are shipping production code at 100x human velocity. Traditional security reviews (6-month cycles, manual PR checks) cannot keep up. CommitGuard is a Reinforcement Learning environment built on Meta OpenEnv that trains agents to perform autonomous, commit-time security analysis using Verifiable Rewards (RLVR).
The Environment
CommitGuard turns code commits into a multi-step investigation game:
- Analyze: The agent performs Chain-of-Thought reasoning.
- Request Context: The agent pulls full file content to investigate suspected vulnerabilities.
- Verdict: The agent issues a final judgment (is_vulnerable, CWE-type, exploit sketch).
Rewards:
- +1.0 for correct binary verdict.
- +0.5 for correct CWE classification.
- Up to +0.5 (continuous float) for accurate exploit keyword matching.
- Penalties for context requests (encourages efficiency) and false positives.
Results & Learning Curves
We trained Llama-3.2-3B-Instruct using GRPO via TRL and Unsloth.
1. Training Reward Curve
The reward curve shows the model learning to prioritize accuracy while maintaining investigation efficiency.
2. Detection Accuracy: Baseline vs. Trained
*Our trained agent improved detection accuracy from 50% (baseline) to 74%.*
3. Per-CWE Breakdown
*The model showed significant improvements in detecting CWE-89 (SQL Injection) and CWE-119 (Buffer Overflow).*
Demo Video
Watch as a trained CommitGuard agent requests context to identify a complex privilege escalation vulnerability that the baseline model missed.
Links
- HF Space (Env): https://huggingface.co/spaces/Nitishkumar-ai/commitguard
- Training Notebook: Link
- W&B Training Logs: Link
- HF Blog Post: Link
Technical Stack
- Framework: Meta OpenEnv 0.1.13
- RL Algorithm: GRPO (Group Relative Policy Optimization)
- Training: TRL + Unsloth (4-bit LoRA)
- Compute: HF Jobs (A10G)
Developed by Team CommitGuard: Niti, Deepak, Divyank