Indirect Prompt Injection

Reason about indirect prompt injection where instructions ride into the model via retrieved or fetched content (web pages, emails, RAG documents, file uploads). Learn the attack family conceptually, the agent-trust problem (the model cannot tell instructions from data), canonical scenarios that show up in real products, eval patterns, and the defence stack (provenance tracking, content sanitation, capability constraints, human-in-loop on high-risk actions).

Start Topic → View All Lessons

Lessons

📋

Templates

✅

Practitioner-Ready

100%

Free