Ray on Kubernetes

Deploy and manage Ray clusters on Kubernetes with KubeRay. Learn distributed training with Ray Train, scalable model serving with Ray Serve, and production best practices for running AI workloads at scale.

6
Lessons
Hands-On Projects
🕑
Self-Paced
100%
Free

Your Learning Path

Follow these lessons in order, or jump to any topic that interests you.

What You'll Learn

By the end of this course, you'll be able to:

💻

Deploy Ray on K8s

Install and configure KubeRay with autoscaling Ray clusters on any Kubernetes environment.

Distributed Training

Scale model training across multiple GPUs and nodes using Ray Train with PyTorch and Hugging Face.

🚀

Model Serving

Deploy scalable inference endpoints with Ray Serve featuring batching, composition, and autoscaling.

📊

Production Operations

Monitor, troubleshoot, and optimize Ray clusters for reliability and cost efficiency in production.