Skip to content

llamaguard

from Orchestra-Research/AI-research-SKILLs

Meta's 7-8B specialized moderation model for LLM input/output filtering. 6 safety categories - violence/hate, sexual content, weapons, substances, self-harm, criminal planning. 94-95% accuracy. Deploy

v1.0.0MIT
338
Lines
977
Words
13
Code Blocks

Languages

bashpython
07-safety-alignment/llamaguard/SKILL.md