Practice Exam
30 exam-style questions covering all 4 domains of the AIF-C01 exam. Try to answer each question before revealing the explanation. Aim for 21+ correct (70%) to feel confident for the real exam.
Domain 1: AI/ML Fundamentals (Questions 1-6)
A) Supervised learning - classification
B) Supervised learning - regression
C) Unsupervised learning - clustering
D) Reinforcement learning
Show Answer
B) Supervised learning - regression. The company has labeled historical data (input: dates/prices, output: quantities) and wants to predict a continuous number (units sold). This is regression. Classification (A) predicts categories. Clustering (C) requires no labels. RL (D) involves agents and rewards.
A) Data collection and preparation
B) Model training
C) Network configuration
D) Model monitoring and maintenance
Show Answer
C) Network configuration. While infrastructure is needed, network configuration is not a phase of the ML lifecycle. The lifecycle consists of: problem definition, data collection/preparation, feature engineering, model training, evaluation, deployment, and monitoring/maintenance.
A) Underfitting
B) Overfitting
C) Data drift
D) Bias
Show Answer
B) Overfitting. High training accuracy but low test accuracy is the textbook definition of overfitting. The model memorized the training data instead of learning general patterns. Solutions include regularization, dropout, more training data, or a simpler model.
A) Accuracy
B) Precision
C) Recall
D) RMSE
Show Answer
C) Recall. When missing positives (false negatives) is costly, optimize for recall. Recall measures what percentage of actual positives the model catches. For fraud detection, missing a fraud (false negative) is worse than flagging a legitimate transaction (false positive). Accuracy (A) is misleading for imbalanced data. Precision (B) minimizes false positives. RMSE (D) is for regression.
A) Any AI system that can pass the Turing test
B) A subset of machine learning using multi-layered neural networks
C) A type of reinforcement learning
D) An algorithm that only works with structured data
Show Answer
B) A subset of machine learning using multi-layered neural networks. Deep learning uses neural networks with multiple hidden layers ("deep" networks) to learn complex patterns. It excels at unstructured data (images, text, audio), not only structured data (D). It is not limited to RL (C) or defined by the Turing test (A).
A) Supervised learning
B) Unsupervised learning
C) Semi-supervised learning
D) Reinforcement learning
Show Answer
D) Reinforcement learning. The robot (agent) interacts with the warehouse (environment) and learns from rewards and penalties. This is the defining pattern of RL: agent + environment + rewards. AWS DeepRacer is a similar RL use case.
Domain 2: Generative AI (Questions 7-13)
A) Zero-shot prompting
B) Few-shot prompting
C) Chain-of-thought prompting
D) Fine-tuning
Show Answer
B) Few-shot prompting. Providing examples (shots) in the prompt to show the model the expected output format is few-shot prompting. 3 examples = few-shot. Zero examples = zero-shot (A). Chain-of-thought (C) adds "think step by step" reasoning. Fine-tuning (D) changes the model itself, not the prompt.
A) RAG produces higher quality text
B) RAG does not require any foundation model
C) RAG can be updated with new information without retraining the model
D) RAG eliminates all hallucination
Show Answer
C) RAG can be updated with new information without retraining the model. The key advantage of RAG is that you simply update the knowledge base (add/remove documents), and the model immediately has access to the new information. Fine-tuning requires retraining. RAG does need an FM (B), does not guarantee higher quality (A), and reduces but does not eliminate hallucination (D).
A) Knowledge Bases
B) Agents
C) Guardrails
D) Model evaluation
Show Answer
C) Guardrails. Bedrock Guardrails includes a "denied topics" feature that prevents the model from responding about specific subjects you define. Knowledge Bases (A) is for RAG. Agents (B) is for multi-step task automation. Model evaluation (D) compares model performance.
A) Temperature = 0.0
B) Temperature = 0.1
C) Temperature = 0.8
D) Temperature = 0.0 with top-p = 0.1
Show Answer
C) Temperature = 0.8. Higher temperature (0.7-1.0) increases randomness and creativity in the output, which is ideal for marketing copy. Low temperature (A, B) produces deterministic, factual output. Low top-p with low temperature (D) would make output even more constrained.
A) A model trained on a small, task-specific dataset
B) A large model pre-trained on broad data that can be adapted to many tasks
C) A model that can only perform one specific task
D) A rule-based expert system
Show Answer
B) A large model pre-trained on broad data that can be adapted to many tasks. Foundation models are trained on massive, diverse datasets and serve as a "foundation" that can be adapted (via prompting, fine-tuning, or RAG) for many different downstream tasks. They are general-purpose, not task-specific (A, C), and use ML, not rules (D).
A) To encrypt the documents for security
B) To represent text as numerical vectors for similarity search
C) To compress documents to save storage
D) To translate documents between languages
Show Answer
B) To represent text as numerical vectors for similarity search. In RAG, documents are converted to vector embeddings (numerical representations) so that when a user asks a question, the system can find the most semantically similar document chunks using vector similarity search. Amazon Titan Embeddings and Amazon Bedrock Knowledge Bases handle this process.
A) Zero-shot prompting
B) RAG with medical documents
C) Fine-tuning with domain-specific data
D) Increasing the temperature
Show Answer
C) Fine-tuning with domain-specific data. When the model needs to adopt a specific writing style or terminology that differs from its training, fine-tuning is appropriate. RAG (B) adds knowledge but does not change the model's style. Zero-shot (A) uses the model as-is. Temperature (D) affects randomness, not style.
Domain 3: AWS AI/ML Services (Questions 14-24)
A) Amazon Rekognition
B) Amazon Comprehend
C) Amazon Translate
D) Amazon Lex
Show Answer
B) Amazon Comprehend. Comprehend provides sentiment analysis that classifies text as positive, negative, neutral, or mixed. It is the go-to service for analyzing text meaning. Rekognition (A) is for images. Translate (C) is for language translation. Lex (D) is for building chatbots.
A) Amazon Polly
B) Amazon Transcribe
C) Amazon Comprehend
D) Amazon Translate
Show Answer
B) Amazon Transcribe. Transcribe converts speech to text and supports speaker identification (speaker diarization). Polly (A) does the opposite: text to speech. Comprehend (C) analyzes text meaning. Translate (D) translates between languages.
A) Amazon Bedrock
B) Amazon Comprehend
C) Amazon SageMaker
D) Amazon Rekognition
Show Answer
C) Amazon SageMaker. SageMaker is the comprehensive ML platform for building custom models. The team wants full control over the ML process (model selection, training, deployment), which is exactly what SageMaker provides. Bedrock (A) is for generative AI. Comprehend (B) is for NLP. Rekognition (D) is for images.
A) Amazon Transcribe
B) Amazon Polly
C) Amazon Lex
D) Amazon Translate
Show Answer
B) Amazon Polly. Polly converts text to lifelike spoken audio (text-to-speech). This is the correct service for accessibility features that read text aloud. Transcribe (A) does the opposite: speech to text. Lex (C) is for chatbots. Translate (D) is for language translation.
A) Amazon Textract
B) Amazon Comprehend Medical
C) Amazon Rekognition
D) Amazon Translate
Show Answer
B) Amazon Comprehend Medical. Comprehend Medical is specifically designed to extract medical entities (medications, dosages, conditions, procedures) from clinical text. Textract (A) extracts text from scanned documents but does not understand medical terminology. Rekognition (C) analyzes images. Translate (D) translates languages.
A) SageMaker Studio
B) SageMaker Autopilot
C) SageMaker Canvas
D) SageMaker JumpStart
Show Answer
C) SageMaker Canvas. Canvas is the no-code ML interface designed for business users. It provides visual, point-and-click model building. Studio (A) is a full IDE for data scientists. Autopilot (B) automates ML but still requires some technical knowledge. JumpStart (C) provides pre-trained models but is accessed through Studio.
A) Amazon Kendra
B) Amazon Personalize
C) Amazon Comprehend
D) Amazon Forecast
Show Answer
B) Amazon Personalize. Personalize is the recommendation engine that uses user interaction data (viewing history, ratings) to provide personalized recommendations. Kendra (A) is for enterprise search. Comprehend (C) is for text analysis. Forecast (D) is for time-series prediction.
A) Amazon Personalize
B) Amazon Forecast
C) Amazon Comprehend
D) Amazon Bedrock
Show Answer
B) Amazon Forecast. Forecast is the time-series forecasting service designed for exactly this use case: predicting future values based on historical time-series data. Personalize (A) is for recommendations. Comprehend (C) is for NLP. Bedrock (D) is for generative AI.
A) Amazon Textract
B) Amazon Comprehend
C) Amazon Rekognition
D) Amazon Bedrock
Show Answer
C) Amazon Rekognition. Rekognition provides face comparison capabilities that can compare a face in one image against a collection of faces in another database. This is exactly what identity verification requires. Textract (A) is for documents. Comprehend (B) is for text. Bedrock (D) is for generative AI.
A) Amazon Kendra
B) Amazon Textract
C) Amazon Translate
D) Amazon Polly
Show Answer
A) Amazon Kendra. Kendra is the intelligent enterprise search service that understands natural language queries and finds answers across documents, wikis, and knowledge bases. Textract (B) extracts text from scanned documents. Translate (C) translates languages. Polly (D) converts text to speech.
A) Amazon SageMaker
B) Amazon Bedrock
C) Amazon Comprehend
D) Amazon Kendra
Show Answer
B) Amazon Bedrock. Bedrock is the fully managed service that provides access to foundation models from multiple providers through a unified API. SageMaker JumpStart also offers some FMs, but the "single API, multiple providers" description specifically matches Bedrock.
Domain 4: Responsible AI (Questions 25-30)
A) Overfitting
B) Bias / fairness issue
C) Data drift
D) Underfitting
Show Answer
B) Bias / fairness issue. When a model produces systematically different outcomes for different groups despite similar qualifications, this is a bias/fairness problem. The model may have learned biased patterns from historical data. SageMaker Clarify can detect this type of disparate impact.
A) Amazon SageMaker Model Monitor
B) Amazon SageMaker Clarify
C) Amazon Bedrock Guardrails
D) Amazon CloudWatch
Show Answer
B) Amazon SageMaker Clarify. Clarify provides model explainability through SHAP values, which show how much each feature contributed to a specific prediction. Model Monitor (A) detects data drift. Guardrails (C) filters generative AI content. CloudWatch (D) monitors infrastructure metrics.
A) Knowledge Bases
B) Guardrails with content filters
C) Model evaluation
D) Fine-tuning
Show Answer
B) Guardrails with content filters. Bedrock Guardrails allows you to configure content filters that block harmful content categories including violence, hate speech, sexual content, and more. You can also define denied topics for additional specificity.
A) SQL injection
B) Prompt injection
C) Data poisoning
D) Model extraction
Show Answer
B) Prompt injection. Prompt injection is when an attacker crafts inputs designed to override the model's system prompt and instructions. The input explicitly tries to make the model "ignore previous instructions." Mitigations include input validation, Bedrock Guardrails, and robust system prompts.
A) Amazon SageMaker Model Cards
B) Amazon CloudFormation
C) Amazon S3
D) AWS CloudTrail
Show Answer
A) Amazon SageMaker Model Cards. Model Cards provide a structured framework for documenting ML models, including their intended use, limitations, performance metrics, and bias evaluation results. This supports governance and transparency. CloudFormation (B) manages infrastructure. S3 (C) stores files. CloudTrail (D) logs API calls.
A) Overfitting
B) Underfitting
C) Data drift
D) Prompt injection
Show Answer
C) Data drift. Data drift occurs when the statistical properties of production data change over time compared to the training data. Customer behavior evolving is a classic example. The model was trained on historical patterns that no longer apply. SageMaker Model Monitor detects data drift automatically.
Score Your Results
25-30 correct: Excellent. You are ready for the exam. Review any missed questions.
21-24 correct: Good. You are close. Review the domains where you missed questions.
16-20 correct: Fair. Go back and re-study the weaker domains before scheduling the exam.
Below 16: You need more study time. Re-read all domain lessons and retake this practice exam in a few days.