Text Generation
PEFT
Safetensors
English
cybersecurity
vulnerability-detection
secure-code
codellama
lora
qlora
code
Instructions to use Younis2003/CodeLlama_for_code_security with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use Younis2003/CodeLlama_for_code_security with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("meta-llama/CodeLlama-13b-hf") model = PeftModel.from_pretrained(base_model, "Younis2003/CodeLlama_for_code_security") - Notebooks
- Google Colab
- Kaggle
File size: 3,533 Bytes
f6b5b49 2b02bf2 d14d7d7 f6b5b49 d14d7d7 2b02bf2 d14d7d7 b85f178 f6b5b49 d14d7d7 f6b5b49 0b35aa3 e24ed34 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 7ee72a9 2b02bf2 92db066 7ee72a9 2b02bf2 92db066 7ee72a9 2b02bf2 d14d7d7 7ee72a9 2b02bf2 7ee72a9 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 2b02bf2 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 92db066 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 f6b5b49 d14d7d7 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 | ---
base_model: meta-llama/CodeLlama-13b-hf
library_name: peft
pipeline_tag: text-generation
language:
- en
license: apache-2.0
datasets:
- Younis2003/secure_dataset_cvefixes
tags:
- cybersecurity
- vulnerability-detection
- secure-code
- codellama
- lora
- peft
- qlora
- code
---
# CodeLlama_for_code_security
<img src="https://external-content.duckduckgo.com/iu/?u=https%3A%2F%2Fres.cloudinary.com%2Fmomentum-media-group-pty-ltd%2Fimage%2Fupload%2Fv1693207063%2FCyber%2520Security%2FCode_Llama_csc_mgxscf.jpg&f=1&nofb=1&ipt=3dd5f67c5a258d603fe9b43ceb3ccb4a9fba0e35d17193c1a50dc97e0f3df10c" width="1000"/>
# Overview
CodeLlama_for_code_security is a **LoRA fine-tuned adapter** designed for vulnerability detection and secure code remediation.
The model analyzes vulnerable source code and generates a secure fixed version together with structured vulnerability explanations including CVE and CWE metadata.
The adapter is trained on top of **CodeLlama-13B**.
---
# Model Details
**Developed by:** Younis Alshibli
**Model type:** LoRA Adapter (PEFT)
**Base Model:** CodeLlama-13B
**Language:** English
**License:** Apache 2.0
---
# Intended Use
The model is designed for:
- Vulnerability detection
- Secure code remediation
- Security analysis of source code
- Automated security review
- AI-assisted cybersecurity research
Example applications:
- Secure code assistants
- AI vulnerability scanners
- Cybersecurity research tools
---
# Evaluation
The model was evaluated using **semantic similarity between generated fixes and ground truth secure fixes**.
| Metric | Score |
|------|------|
| Embedding Similarity | **0.9643** |
This corresponds to approximately **96% semantic similarity** between predicted outputs and expected secure code fixes.
---
# Prompt Format
The model expects prompts in the following structured format:
```
### System:
You are a security expert. Be concise. Analyze and fix.
### Task:
1. IDENTIFY: One sentence naming the CWE.
2. STRATEGY: One sentence fix strategy.
3. REMEDIATE: Provide the code under '### Fixed Code:'.
### Programming Language:
{language}
### Vulnerable Code:
{vuln_code}
### Analysis:
1. CWE Identification:
```
The model will generate:
* Fixed Code
* Vulnerability Explanation
* CVE metadata
* CWE metadata
---
# How to Use
This model is a **LoRA adapter** and requires the base model.
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
base_model = "meta-llama/CodeLlama-13b-hf"
adapter = "Younis2003/CodeLlama_for_code_security"
tokenizer = AutoTokenizer.from_pretrained(base_model)
model = AutoModelForCausalLM.from_pretrained(
base_model,
device_map="auto"
)
model = PeftModel.from_pretrained(model, adapter)
```
---
# Training Data
The model was trained using the dataset:
**secure_dataset_cvefixes**
# Dataset source:
```
Younis2003/secure_dataset_cvefixes
```
The dataset contains:
* vulnerable code
* fixed code
* CVE descriptions
* CWE classifications
---
# Limitations
* The model may not detect all vulnerabilities.
* Results should always be reviewed by security experts.
* Complex security flaws may require manual analysis.
---
# Ethical Considerations
This model is intended for **defensive cybersecurity research and secure software development**.
It should not be used for malicious activities.
---
# Author
Developed by **Younis Alshibli** as part of an AI research project on:
* AI vulnerability detection
* automated secure code remediation |