Safety classifiers fine-tuned on a bilingual dataset composed of the English QA pairs from BeaverTails and the Italian QA pairs from BeaverTails-IT.
Giuseppe Magazzù
saiteki-kai
AI & ML interests
My research focuses on the developement of safety mitigation strategies and benchmarks for large language models.
Recent Activity
liked a dataset 1 day ago
MIND-Lab/BeaverTails-IT-Evaluation liked a dataset 25 days ago
nvidia/Aegis-AI-Content-Safety-Dataset-2.0 liked a model 27 days ago
deepseek-ai/DeepSeek-V4-Pro