Update README: remove Unsloth branding, add detailed capabilities and usage

efcee03 verified about 15 hours ago

4.84 kB

	---
	base_model: google/gemma-4-e2b-it
	tags:
	- text-generation-inference
	- transformers
	- gemma4
	- trl
	- peft
	- cybersecurity
	- devsecops
	- security
	- lora
	license: apache-2.0
	language:
	- en
	---

	# Gemma 4 E2B — Cybersecurity Interview Expert

	A QLoRA fine-tuned version of [Gemma 4 E2B Instruct](https://huggingface.co/google/gemma-4-e2b-it) specialized in deep, production-level cybersecurity knowledge. This model answers technical security interview questions with precision, concrete examples, and actionable recommendations.

	---

	## Model Details

	\| Property \| Value \|
	\|---\|---\|
	\| Base model \| google/gemma-4-e2b-it (2B parameters) \|
	\| Fine-tuning method \| QLoRA (rank 16, α 16) \|
	\| Trainable parameters \| 31M / 5.15B (0.60%) \|
	\| Training data \| 646 curated cybersecurity interview Q&A pairs \|
	\| Epochs \| 3 \|
	\| Final training loss \| 0.574 \|
	\| License \| Apache 2.0 \|

	---

	## Expertise & Capabilities

	This model demonstrates expert-level knowledge across the full spectrum of modern cybersecurity:

	### Cloud & Container Security
	- Docker security hardening (rootless containers, capabilities, seccomp, AppArmor)
	- Kubernetes RBAC, Pod Security Standards, network policies, admission controllers
	- AWS IAM least-privilege design, ECR image scanning, Terraform security patterns
	- Cloud-native threat modeling and attack surface reduction

	### DevSecOps & CI/CD
	- Secure pipeline design (ArgoCD, GitHub Actions, GitLab CI)
	- Supply chain security: SLSA, SBOM, sigstore/cosign, dependency verification
	- Secrets management (Vault, AWS Secrets Manager, SOPS)
	- Infrastructure-as-Code security scanning (Checkov, tfsec, Terrascan)

	### Application & Secure Coding
	- OWASP Top 10 — root cause analysis and remediation
	- Injection attacks (SQL, command, LDAP, template), XSS, SSRF, deserialization
	- Authentication & authorization: OAuth 2.0, OIDC, JWT, PKCE, session security
	- Cryptography: TLS configuration, key management, algorithm selection

	### Threat Intelligence & Offensive Security
	- SOC operations, SIEM correlation rules, threat hunting
	- MITRE ATT&CK mapping and adversary emulation
	- Active Directory attack paths (Kerberoasting, Pass-the-Hash, DCSync)
	- Red team tactics and purple team collaboration

	### Emerging & Specialized Domains
	- AI/LLM security: prompt injection, model poisoning, guardrail bypasses
	- OT/ICS/SCADA security: Purdue model, IEC 62443, air-gap strategies
	- Blockchain & smart contract auditing (reentrancy, overflow, access control)
	- Digital forensics, incident response, and malware analysis

	---

	## Usage

	```python
	from transformers import AutoTokenizer, AutoModelForCausalLM
	from peft import PeftModel
	import torch

	base_model = "google/gemma-4-e2b-it"
	adapter = "rezaduty/gemma4-e2b-cybersecurity-interview"

	tokenizer = AutoTokenizer.from_pretrained(adapter)
	model = AutoModelForCausalLM.from_pretrained(
	base_model,
	torch_dtype=torch.bfloat16,
	device_map="auto",
	)
	model = PeftModel.from_pretrained(model, adapter)

	messages = [
	{
	"role": "system",
	"content": [{"type": "text", "text": (
	"You are an expert cybersecurity engineer specializing in DevSecOps, "
	"container security, and cloud-native security. Answer technical interview "
	"questions with depth, precision, and concrete examples."
	)}]
	},
	{
	"role": "user",
	"content": [{"type": "text", "text": "Explain why running Docker containers as root is a security risk and how to fix it."}]
	},
	]

	inputs = tokenizer.apply_chat_template(
	messages,
	tokenize=True,
	add_generation_prompt=True,
	return_tensors="pt",
	).to(model.device)

	output = model.generate(
	input_ids=inputs,
	max_new_tokens=512,
	temperature=0.7,
	top_p=0.9,
	use_cache=True,
	)
	print(tokenizer.decode(output[0][inputs.shape[-1]:], skip_special_tokens=True))
	```

	---

	## Training Dataset

	Covers 15 curated topic domains across 646 high-quality question/answer pairs:

	- Container & Kubernetes security
	- Cloud IAM, ECR, Terraform security
	- CI/CD and ArgoCD pipeline security
	- AI/LLM security
	- DevOps patterns and security tooling
	- Secure coding (OWASP, injection, crypto)
	- SOC operations and threat intelligence
	- Active Directory and red team techniques
	- Software architecture and design security
	- Authentication, identity, and supply chain
	- OT/ICS/SCADA security
	- Blockchain and smart contract security
	- OS hardening, cloud SaaS, and forensics

	---

	## System Prompt

	For best results, use this system prompt:

	```
	You are an expert cybersecurity engineer specializing in DevSecOps, container security, and cloud-native security. Answer technical interview questions with depth, precision, and concrete examples.
	```

	---

	## Developed by

	[rezaduty](https://huggingface.co/rezaduty)