LLM Fairness & Stereotypes

Audit and mitigate fairness issues in LLMs. Learn the evaluation suites (BBQ, BOLD, StereoSet, HolisticBias, MMLU-Pro fairness slice), the limits of benchmark-only evaluation, RLHF effects on fairness, sycophancy and stereotype reinforcement, prompt-engineering mitigations, and red-team patterns for stereotype probing.

Start Topic → View All Lessons

6

Lessons

📋

Templates

✅

Practitioner-Ready

100%

Free

Lessons in This Topic

Work through these 6 lessons in order, or jump to whichever is most relevant.

LLM Fairness Overview

Advanced

Eval Suites: BBQ, BOLD, StereoSet, HolisticBias

Advanced

Limits of Benchmark-Only Eval

Advanced

RLHF Effects on Fairness

Advanced

Prompt-Engineering Mitigations

Advanced

LLM Red-Team Pattern Template

Advanced