Dangerous Capability Evals

Run dangerous-capability evaluations on frontier models before wider deployment. Learn the canonical categories (CBRN uplift, cyber offence, autonomous replication and adaptation, persuasion and manipulation), eval design rules, elicitation discipline (the 'we must elicit to the best of our ability' principle), result disclosure under responsibility, and the link to responsible-scaling commitments.

Start Topic → View All Lessons

Lessons

📋

Templates

✅

Practitioner-Ready

100%

Free