Evals-Based Safety Commitments

Make and keep evals-based safety commitments. Learn commitment design (what is committed, on what conditions), the eval-credibility problem (will the eval actually trigger at the right moment), external evaluation by AISIs and third parties, commitment reporting to regulators and the public, and the failure mode where the eval evolves to always pass.

6
Lessons
📋
Templates
Practitioner-Ready
100%
Free

Lessons in This Topic

Work through these 6 lessons in order, or jump to whichever is most relevant.