rezaduty's picture
Update README: remove Unsloth branding, add detailed capabilities and usage
efcee03 verified
---
base_model: google/gemma-4-e2b-it
tags:
- text-generation-inference
- transformers
- gemma4
- trl
- peft
- cybersecurity
- devsecops
- security
- lora
license: apache-2.0
language:
- en
---
# Gemma 4 E2B — Cybersecurity Interview Expert
A QLoRA fine-tuned version of [Gemma 4 E2B Instruct](https://huggingface.co/google/gemma-4-e2b-it) specialized in **deep, production-level cybersecurity knowledge**. This model answers technical security interview questions with precision, concrete examples, and actionable recommendations.
---
## Model Details
| Property | Value |
|---|---|
| **Base model** | google/gemma-4-e2b-it (2B parameters) |
| **Fine-tuning method** | QLoRA (rank 16, α 16) |
| **Trainable parameters** | 31M / 5.15B (0.60%) |
| **Training data** | 646 curated cybersecurity interview Q&A pairs |
| **Epochs** | 3 |
| **Final training loss** | 0.574 |
| **License** | Apache 2.0 |
---
## Expertise & Capabilities
This model demonstrates expert-level knowledge across the full spectrum of modern cybersecurity:
### Cloud & Container Security
- Docker security hardening (rootless containers, capabilities, seccomp, AppArmor)
- Kubernetes RBAC, Pod Security Standards, network policies, admission controllers
- AWS IAM least-privilege design, ECR image scanning, Terraform security patterns
- Cloud-native threat modeling and attack surface reduction
### DevSecOps & CI/CD
- Secure pipeline design (ArgoCD, GitHub Actions, GitLab CI)
- Supply chain security: SLSA, SBOM, sigstore/cosign, dependency verification
- Secrets management (Vault, AWS Secrets Manager, SOPS)
- Infrastructure-as-Code security scanning (Checkov, tfsec, Terrascan)
### Application & Secure Coding
- OWASP Top 10 — root cause analysis and remediation
- Injection attacks (SQL, command, LDAP, template), XSS, SSRF, deserialization
- Authentication & authorization: OAuth 2.0, OIDC, JWT, PKCE, session security
- Cryptography: TLS configuration, key management, algorithm selection
### Threat Intelligence & Offensive Security
- SOC operations, SIEM correlation rules, threat hunting
- MITRE ATT&CK mapping and adversary emulation
- Active Directory attack paths (Kerberoasting, Pass-the-Hash, DCSync)
- Red team tactics and purple team collaboration
### Emerging & Specialized Domains
- AI/LLM security: prompt injection, model poisoning, guardrail bypasses
- OT/ICS/SCADA security: Purdue model, IEC 62443, air-gap strategies
- Blockchain & smart contract auditing (reentrancy, overflow, access control)
- Digital forensics, incident response, and malware analysis
---
## Usage
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel
import torch
base_model = "google/gemma-4-e2b-it"
adapter = "rezaduty/gemma4-e2b-cybersecurity-interview"
tokenizer = AutoTokenizer.from_pretrained(adapter)
model = AutoModelForCausalLM.from_pretrained(
base_model,
torch_dtype=torch.bfloat16,
device_map="auto",
)
model = PeftModel.from_pretrained(model, adapter)
messages = [
{
"role": "system",
"content": [{"type": "text", "text": (
"You are an expert cybersecurity engineer specializing in DevSecOps, "
"container security, and cloud-native security. Answer technical interview "
"questions with depth, precision, and concrete examples."
)}]
},
{
"role": "user",
"content": [{"type": "text", "text": "Explain why running Docker containers as root is a security risk and how to fix it."}]
},
]
inputs = tokenizer.apply_chat_template(
messages,
tokenize=True,
add_generation_prompt=True,
return_tensors="pt",
).to(model.device)
output = model.generate(
input_ids=inputs,
max_new_tokens=512,
temperature=0.7,
top_p=0.9,
use_cache=True,
)
print(tokenizer.decode(output[0][inputs.shape[-1]:], skip_special_tokens=True))
```
---
## Training Dataset
Covers 15 curated topic domains across 646 high-quality question/answer pairs:
- Container & Kubernetes security
- Cloud IAM, ECR, Terraform security
- CI/CD and ArgoCD pipeline security
- AI/LLM security
- DevOps patterns and security tooling
- Secure coding (OWASP, injection, crypto)
- SOC operations and threat intelligence
- Active Directory and red team techniques
- Software architecture and design security
- Authentication, identity, and supply chain
- OT/ICS/SCADA security
- Blockchain and smart contract security
- OS hardening, cloud SaaS, and forensics
---
## System Prompt
For best results, use this system prompt:
```
You are an expert cybersecurity engineer specializing in DevSecOps, container security, and cloud-native security. Answer technical interview questions with depth, precision, and concrete examples.
```
---
## Developed by
[rezaduty](https://huggingface.co/rezaduty)