Dangerous Capability Evals
Run dangerous-capability evaluations on frontier models before wider deployment. Learn the canonical categories (CBRN uplift, cyber offence, autonomous replication and adaptation, persuasion and manipulation), eval design rules, elicitation discipline (the 'we must elicit to the best of our ability' principle), result disclosure under responsibility, and the link to responsible-scaling commitments.
6
Lessons
📋
Templates
✅
Practitioner-Ready
100%
Free
Lessons in This Topic
Work through these 6 lessons in order, or jump to whichever is most relevant.
Lilly Tech Systems