Safety classifiers fine-tuned on a bilingual dataset composed of the English QA pairs from BeaverTails and the Italian QA pairs from BeaverTails-IT.
Giuseppe Magazzù
saiteki-kai
AI & ML interests
My research focuses on the developement of safety mitigation strategies and benchmarks for large language models.
Recent Activity
liked
a model
1 day ago
Qwen/Qwen3.5-397B-A17B
liked
a model
1 day ago
Nanbeige/Nanbeige4.1-3B
upvoted
a
changelog
1 day ago
Community Evals and Benchmark Repositories