LLM Fairness & Stereotypes

Audit and mitigate fairness issues in LLMs. Learn the evaluation suites (BBQ, BOLD, StereoSet, HolisticBias, MMLU-Pro fairness slice), the limits of benchmark-only evaluation, RLHF effects on fairness, sycophancy and stereotype reinforcement, prompt-engineering mitigations, and red-team patterns for stereotype probing.

6
Lessons
📋
Templates
Practitioner-Ready
100%
Free

Lessons in This Topic

Work through these 6 lessons in order, or jump to whichever is most relevant.