Bedrock Foundation Models Intermediate

Bedrock provides access to a diverse catalog of foundation models. Each model has different strengths, context windows, and pricing. This lesson compares the available models and provides a framework for selecting the right model for your use case.

Available Models

ProviderModelContextStrengths
AnthropicClaude 3.5 Sonnet/Haiku200K tokensReasoning, code, analysis, safety
MetaLlama 3.1 (8B/70B/405B)128K tokensOpen weights, multilingual, code
AmazonTitan Text/Embeddings32K tokensCost-effective, embeddings
MistralMistral Large/Small32K tokensMultilingual, code, efficiency
CohereCommand R/R+128K tokensRAG, enterprise search

Model Selection Framework

Decision Guide
Complex reasoning, analysis, code:
  Claude 3.5 Sonnet (best quality, higher cost)

High-volume, cost-sensitive:
  Claude 3.5 Haiku or Llama 3.1 8B (fast, cheap)

RAG and enterprise search:
  Cohere Command R+ (optimized for retrieval)

Embeddings for vector search:
  Titan Embeddings or Cohere Embed

Multilingual content:
  Mistral Large or Llama 3.1

Image understanding:
  Claude 3.5 Sonnet (vision capable)

Pricing Considerations

  • On-demand pricing — Pay per input/output token. Best for variable or low-volume workloads.
  • Provisioned throughput — Reserve model capacity for consistent performance. Best for high-volume production.
  • Batch inference — Submit large batches at 50% discount. Best for offline processing.
Cost Strategy: Start with Claude Haiku or Llama 8B for prototyping and development. Upgrade to larger models only for use cases that demonstrate clear quality improvement. Many production workloads work well with smaller, cheaper models.

Ready to Integrate the API?

The next lesson covers how to call Bedrock models programmatically with the AWS SDK.

Next: API Integration →