Jailbreak attack datasets generated against multiple LLMs, one dataset per attack method.
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 7
deepkeep-ai/openai-privacy-filter
Token Classification • 1B • Updated • 52
deepkeep-ai/stable-diffusion-xl-1.0-inpainting-0.1-9
Updated • 56
deepkeep-ai/napguard-patch-detector-3
Updated • 54
deepkeep-ai/sac-patch-segmenter-2
Updated • 66
deepkeep-ai/Ministral-3-8B-Instruct-2512
9B • Updated • 8.99k
deepkeep-ai/sae-guard-gemma3-4b-english-expanded
Feature Extraction • Updated • 1
deepkeep-ai/sae-guard-gemma3-4b-english-research
Feature Extraction • 1 • Updated • 11 • 1
datasets 8
deepkeep-ai/semantic-encoding-data-splits-llm-korean
Viewer • Updated • 16.5k • 15
deepkeep-ai/jigsaw_toxic_not_harmful_5k
Viewer • Updated • 5k • 35
deepkeep-ai/jigsaw_toxic_not_harmful_5k_translated
Viewer • Updated • 5k • 38
deepkeep-ai/notinject_expanded_1k_qwen35_9b_cuda_translated_roleplay
Viewer • Updated • 1k • 126
deepkeep-ai/seq_cls_train_translated_v3
Viewer • Updated • 2.15k • 32
deepkeep-ai/datasets
Updated • 30
deepkeep-ai/AdvBench-gcg
Viewer • Updated • 268 • 9
deepkeep-ai/benchoverflow
Viewer • Updated • 2.98k • 4