Dangerous Capability Evals

Run dangerous-capability evaluations on frontier models before wider deployment. Learn the canonical categories (CBRN uplift, cyber offence, autonomous replication and adaptation, persuasion and manipulation), eval design rules, elicitation discipline (the 'we must elicit to the best of our ability' principle), result disclosure under responsibility, and the link to responsible-scaling commitments.

6
Lessons
📋
Templates
Practitioner-Ready
100%
Free

Lessons in This Topic

Work through these 6 lessons in order, or jump to whichever is most relevant.