Claude Refusals & Behaviour

Reason about Claude refusals. Learn what triggers a refusal (hard categories, gradient categories), the difference between full refusal and partial-answer-with-caveat, system-prompt patterns that legitimately adjust posture (authorised pen-test, security research, defensive use cases), the refusal eval discipline, and the failure modes (over-refusal that breaks legitimate use, under-refusal that causes harm).

6
Lessons
📋
Templates
Practitioner-Ready
100%
Free

Lessons in This Topic

Work through these 6 lessons in order, or jump to whichever is most relevant.