ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation Paper • 2310.17389 • Published Oct 26, 2023
NemoGuard Collection Essential datasets and models for content safety, topic-following, and security guardrails • 11 items • Updated about 2 hours ago • 11
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated 6 days ago • 52
ShieldGemma Collection ShieldGemma is a family of models for text and image content moderation. • 4 items • Updated Jul 10 • 9
A Holistic Approach to Undesired Content Detection in the Real World Paper • 2208.03274 • Published Aug 5, 2022
eliasalbouzidi/distilbert-nsfw-text-classifier Text Classification • 67M • Updated May 16 • 9.26k • • 22