Introduction to Azure OpenAI Infrastructure
Understand the architecture, available models, and enterprise capabilities of Azure OpenAI Service — Microsoft's managed platform for deploying OpenAI foundation models.
What is Azure OpenAI Service?
Azure OpenAI Service provides enterprise-grade access to OpenAI's models (GPT-4, GPT-4o, DALL-E, Whisper) with Azure's security, compliance, and networking capabilities. Unlike using OpenAI directly, Azure OpenAI gives you data residency, private networking, and integration with Azure's identity and monitoring ecosystem.
Available Model Families
| Model | Capabilities | Best For |
|---|---|---|
| GPT-4o / GPT-4o mini | Text, vision, audio | General-purpose, multimodal applications |
| GPT-4 | Text generation, reasoning | Complex reasoning, code generation |
| GPT-3.5 Turbo | Fast text generation | High-volume, cost-sensitive workloads |
| Embeddings | Text embeddings | Search, RAG, classification |
| DALL-E 3 | Image generation | Creative content, design |
| Whisper | Speech to text | Transcription, voice interfaces |
Architecture Overview
Resource & Deployments
An Azure OpenAI resource contains one or more model deployments, each with its own quota and configuration.
Quotas & Limits
Tokens-per-minute (TPM) and requests-per-minute (RPM) quotas are allocated per deployment per region.
Content Safety
Built-in content filtering for harmful content detection, with configurable severity thresholds.
API Compatibility
Compatible with the OpenAI Python SDK, making migration from OpenAI to Azure OpenAI straightforward.
Lilly Tech Systems