Agent Evaluation

Measure agent quality on tool use, planning, task completion, and cost. Build trajectory evaluations, golden tests, and online monitoring for production agents.

6
Lessons
💻
Code Examples
Production-Ready
100%
Free

Lessons in This Skill

Work through these 6 lessons in order, or jump to whichever topic you need most.