Update README.md

d417113 verified 8 days ago

3.86 kB


	---
	tags:
	- autotrain
	- text-classification
	base_model: answerdotai/ModernBERT-base
	widget:
	- text: "I love AutoTrain"
	datasets:
	- RISys-Lab/cybersec-topic-classification-dataset-filtered
	---

	# Model Card: Cybersecurity Text Classifier (ModernBERT-base)

	<p align="center">
	<b> "RedSage: A Cybersecurity Generalist LLM" (ICLR 2026) </b>
	<br>
	<b>Authors:</b> Naufal Suryanto<sup>1*</sup>, Muzammal Naseer<sup>1</sup>, Pengfei Li<sup>1</sup>, Syed Talal Wasim<sup>2</sup>, Jinhui Yi<sup>2</sup>, Juergen Gall<sup>2</sup>, Paolo Ceravolo<sup>3</sup>, Ernesto Damiani<sup>3</sup>
	<br>
	<sup>1</sup>Khalifa University, <sup>2</sup>University of Bonn, <sup>3</sup>University of Milan
	<br>
	<sup>*</sup>Project Lead
	<br>
	<br>
	<a href="https://openreview.net/forum?id=W4FAenIrQ2"><img src="https://img.shields.io/badge/Paper-OpenReview-B31B1B.svg"></a>
	<a href="https://huggingface.co/RISys-Lab"><img src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-RISys--Lab-orange"></a>
	</p>

	---

	## Model Details

	* Model Type: Binary text classification model developed for domain-specific content filtering.
	* Architecture: Based on ModernBERT-base, a bidirectional transformer encoder optimized for efficiency and long-context performance.
	* Domain: Cybersecurity vs. Non-Cybersecurity.
	* License: Released as part of the open-source RedSage project resources.

	## Intended Use

	* Primary Use Case: Identifying cybersecurity-relevant documents within large-scale, unstructured web corpora such as FineWeb.
	* Application: Filtering approximately 17.2 trillion tokens from Common Crawl subsets (2013–2024) to curate the 11.7B-token CyberFineWeb corpus.
	* Intended Users: Researchers and developers focused on domain continual pretraining for cybersecurity LLMs.

	## Training Data

	* Source Dataset: Cybersecurity Topic Classification dataset.
	* Data Origin: Labeled samples collected from Reddit, StackExchange, and arXiv, alongside web articles.
	* Dataset Size:
	* Pre-processing: 9.27M training samples and 459K validation samples.
	* Post-filtering: Reduced to 4.62M training samples and 2.46K validation samples after removing very short texts to minimize ambiguity.
	* Labeling Method: Derived from forum categories, tags, and keyword metadata rather than LLM-generated annotations.

	## Training Procedure

	* Optimizer: Adam optimizer.
	* Learning Rate: 2e-5.
	* Schedule: 10% warmup ratio over 2 training epochs.
	* Hardware: Implementation utilized the ModernBERT-base encoder as the foundation for the binary head.

	## Evaluation Results

	The model was evaluated on a validation set of 2,460 samples derived from web articles, achieving the following metrics:

	\| Metric \| Score \|
	\| :--- \| :--- \|
	\| Accuracy \| 97.3% \|
	\| Precision \| 92.8% \|
	\| Recall \| 90.2% \|
	\| F1 Score \| 91.4% \|

	## Limitations & Risks

	* Context Sensitivity: While highly accurate, the model was specifically filtered to exclude very short texts to avoid context ambiguity.
	* Temporal Bias: The model identifies cybersecurity content based on trends observed in web data up to late 2024; emerging threats post-2024 may not be represented.
	* Dual-Use Concerns: The classifier is designed to identify offensive security technical content, which carries an inherent risk of misuse if applied outside of defensive or educational research.

	---

	## Citation

	```bibtex
	@inproceedings{suryanto2026redsage,
	title={RedSage: A Cybersecurity Generalist {LLM}},
	author={Naufal Suryanto and Muzammal Naseer and Pengfei Li and Syed Talal Wasim and Jinhui Yi and Juergen Gall and Paolo Ceravolo and Ernesto Damiani},
	booktitle={The Fourteenth International Conference on Learning Representations},
	year={2026},
	url={https://openreview.net/forum?id=W4FAenIrQ2}
	}
	```