MLA-C01 Sample Questions Answers

Questions 4

A company is training a deep learning model to detect abnormalities in images. The company has limited GPU resources and a large hyperparameter space to explore. The company needs to test different configurations and avoid wasting computation time on poorly performing models that show weak validation accuracy in early epochs.

Which hyperparameter optimization strategy should the company use?

Options:

Grid search across all possible combinations

Bayesian optimization with early stopping

Manual tuning of each parameter individually

Exhaustive search without early stopping

Buy Now

Questions 5

A company needs to update the model definition of an existing Amazon SageMaker Al endpoint.

Select and order the correct steps from the following list to update the model definition settings with the LEAST interruption of inferences. Select each step one time or not

at all. (Select and order THREE.)

Create a new endpoint configuration that uses the new model definition.

Create a new model definition with updated settings by using the CreateModel action in the SageMaker AI API.

Delete the endpoint that needs to be updated and recreate the endpoint with the new endpoint configuration.

Delete the IAM role and permissions for the ExecutionRoleArn parameter.

Update the endpoint with the new endpoint configuration.

Options:

Buy Now

Questions 6

A company wants to migrate ML models from an on-premises environment to Amazon SageMaker AI. The models are based on the PyTorch algorithm. The company needs to reuse its existing custom scripts as much as possible.

Which SageMaker AI feature should the company use?

Options:

SageMaker AI built-in algorithms

SageMaker Canvas

SageMaker JumpStart

SageMaker AI script mode

Buy Now

Questions 7

A company uses Amazon Athena to query a dataset in Amazon S3. The dataset has a target variable that the company wants to predict.

The company needs to use the dataset in a solution to determine if a model can predict the target variable.

Which solution will provide this information with the LEAST development effort?

Options:

Create a new model by using Amazon SageMaker Autopilot. Report the model ' s achieved performance.

Implement custom scripts to perform data pre-processing, multiple linear regression, and performance evaluation. Run the scripts on Amazon EC2 instances.

Configure Amazon Macie to analyze the dataset and to create a model. Report the model ' s achieved performance.

Select a model from Amazon Bedrock. Tune the model with the data. Report the model ' s achieved performance.

Buy Now

Questions 8

An ML engineer is developing a fraud detection model by using the Amazon SageMaker XGBoost algorithm. The model classifies transactions as either fraudulent or legitimate.

During testing, the model excels at identifying fraud in the training dataset. However, the model is inefficient at identifying fraud in new and unseen transactions.

What should the ML engineer do to improve the fraud detection for new transactions?

Options:

Increase the learning rate.

Remove some irrelevant features from the training dataset.

Increase the value of the max_depth hyperparameter.

Decrease the value of the max_depth hyperparameter.

Buy Now

Questions 9

A company is creating an application that will recommend products for customers to purchase. The application will make API calls to Amazon Q Business. The company must ensure that responses from Amazon Q Business do not include the name of the company ' s main competitor.

Which solution will meet this requirement?

Options:

Configure the competitor ' s name as a blocked phrase in Amazon Q Business.

Configure an Amazon Q Business retriever to exclude the competitor’s name.

Configure an Amazon Kendra retriever for Amazon Q Business to build indexes that exclude the competitor ' s name.

Configure document attribute boosting in Amazon Q Business to deprioritize the competitor ' s name.

Buy Now

Questions 10

A hospital wants to predict patient outcomes for the coming year An ML engineer must improve several existing ML models that currently perform poorly.

Select the correct regularization method from the following list to improve each model Select each regularization method one time, more than one time, or not at all. (Select THREE.)

• L1 regularization

• L2 regularization

• Early stopping

Options:

Buy Now

Questions 11

A company has multiple models that are hosted on Amazon SageMaker Al. The models need to be re-trained. The requirements for each model are different, so the company needs to choose different deployment strategies to transfer all requests to a new model.

Select the correct strategy from the following list for each requirement. Select each strategy one time. (Select THREE.)

. Canary traffic shifting

. Linear traffic shifting guardrail

. All at once traffic shifting

Options:

Buy Now

Questions 12

Case study

An ML engineer is developing a fraud detection model on AWS. The training dataset includes transaction logs, customer profiles, and tables from an on-premises MySQL database. The transaction logs and customer profiles are stored in Amazon S3.

The dataset has a class imbalance that affects the learning of the model ' s algorithm. Additionally, many of the features have interdependencies. The algorithm is not capturing all the desired underlying patterns in the data.

Before the ML engineer trains the model, the ML engineer must resolve the issue of the imbalanced data.

Which solution will meet this requirement with the LEAST operational effort?

Options:

Use Amazon Athena to identify patterns that contribute to the imbalance. Adjust the dataset accordingly.

Use Amazon SageMaker Studio Classic built-in algorithms to process the imbalanced dataset.

Use AWS Glue DataBrew built-in features to oversample the minority class.

Use the Amazon SageMaker Data Wrangler balance data operation to oversample the minority class.

Buy Now

Questions 13

An ML engineer needs to deploy ML models to get inferences from large datasets in an asynchronous manner. The ML engineer also needs to implement scheduled monitoring of the data quality of the models. The ML engineer must receive alerts when changes in data quality occur.

Which solution will meet these requirements?

Options:

Deploy the models by using scheduled AWS Glue jobs. Use Amazon CloudWatch alarms to monitor the data quality and to send alerts.

Deploy the models by using scheduled AWS Batch jobs. Use AWS CloudTrail to monitor the data quality and to send alerts.

Deploy the models by using Amazon Elastic Container Service (Amazon ECS) on AWS Fargate. Use Amazon EventBridge to monitor the data quality and to send alerts.

Deploy the models by using Amazon SageMaker AI batch transform. Use SageMaker Model Monitor to monitor the data quality and to send alerts.

Buy Now

Questions 14

A company uses a batching solution to process data analytics each day. The company wants to build an analytics platform to provide near real-time updates. The company wants to use open source technology and does not want to manage or scale the infrastructure.

Which solution will meet these requirements?

Options:

Create Amazon Managed Streaming for Apache Kafka (Amazon MSK) Serverless clusters to process the data.

Create Amazon Managed Streaming for Apache Kafka (Amazon MSK) Provisioned clusters. Configure the clusters based on data volume.

Create data streams in Amazon Kinesis Data Streams. Use AWS Application Auto Scaling to scale the infrastructure.

Create self-hosted Apache Flink applications on Amazon EC2. Run the applications as containers.

Buy Now

Questions 15

An ML engineer has a custom container that performs k-fold cross-validation and logs an average F1 score during training. The ML engineer wants Amazon SageMaker AI Automatic Model Tuning (AMT) to select hyperparameters that maximize the average F1 score.

How should the ML engineer integrate the custom metric into SageMaker AI AMT?

Options:

Define the average F1 score in the TrainingInputMode parameter.

Define a metric definition in the tuning job that uses a regular expression to capture the average F1 score from the training logs.

Publish the average F1 score as a custom Amazon CloudWatch metric.

Write the F1 score to a JSON file in Amazon S3 and reference it in ObjectiveMetricName.

Buy Now

Questions 16

An ML engineer is deploying a generative AI model-based customer support agent that uses Amazon SageMaker AI for inference. The customer support agent must respond to customer questions about topics such as shipping policies, refund processes, and account management. The generative AI model generates one token at a time.

Customers report dissatisfaction with how long the customer support agent takes to generate lengthy responses to questions. The ML engineer must apply an inference optimization technique to improve the performance of the customer support agent.

Which solution will meet this requirement?

Options:

Compilation

Speculative decoding

Quantization

Fast model loading

Buy Now

Questions 17

An ML engineer needs to organize a large set of text documents into topics. The ML engineer will not know what the topics are in advance. The ML engineer wants to use built-in algorithms or pre-trained models available through Amazon SageMaker AI to process the documents.

Which solution will meet these requirements?

Options:

Use the BlazingText algorithm to identify the relevant text and to create a set of topics based on the documents.

Use the Sequence-to-Sequence algorithm to summarize the text and to create a set of topics based on the documents.

Use the Object2Vec algorithm to create embeddings and to create a set of topics based on the embeddings.

Use the Latent Dirichlet Allocation (LDA) algorithm to process the documents and to create a set of topics based on the documents.

Buy Now

Questions 18

An ML engineer needs to use Amazon SageMaker to fine-tune a large language model (LLM) for text summarization. The ML engineer must follow a low-code no-code (LCNC) approach.

Which solution will meet these requirements?

Options:

Use SageMaker Studio to fine-tune an LLM that is deployed on Amazon EC2 instances.

Use SageMaker Autopilot to fine-tune an LLM that is deployed by a custom API endpoint.

Use SageMaker Autopilot to fine-tune an LLM that is deployed on Amazon EC2 instances.

Use SageMaker Autopilot to fine-tune an LLM that is deployed by SageMaker JumpStart.

Buy Now

Questions 19

An ML engineer is configuring auto scaling for an inference component of a model that runs behind an Amazon SageMaker AI endpoint. The ML engineer configures SageMaker AI auto scaling with a target tracking scaling policy set to 100 invocations per model per minute. The SageMaker AI endpoint scales appropriately during normal business hours. However, the ML engineer notices that at the start of each business day, there are zero instances available to handle requests, which causes delays in processing.

The ML engineer must ensure that the SageMaker AI endpoint can handle incoming requests at the start of each business day.

Which solution will meet this requirement?

Options:

Reduce the SageMaker AI auto scaling cooldown period to the minimum supported value. Add an auto scaling lifecycle hook to scale the SageMaker AI instances.

Change the target metric to CPU utilization.

Modify the scaling policy target value to one.

Apply a step scaling policy that scales based on an Amazon CloudWatch alarm. Apply a second CloudWatch alarm and scaling policy to scale the minimum number of instances from zero to one at the start of each business day.

Buy Now

Questions 20

An ML engineer needs to process thousands of existing CSV objects and new CSV objects that are uploaded. The CSV objects are stored in a central Amazon S3 bucket and have the same number of columns. One of the columns is a transaction date. The ML engineer must query the data based on the transaction date.

Which solution will meet these requirements with the LEAST operational overhead?

Options:

Use an Amazon Athena CREATE TABLE AS SELECT (CTAS) statement to create a table based on the transaction date from data in the central S3 bucket. Query the objects from the table.

Create a new S3 bucket for processed data. Set up S3 replication from the central S3 bucket to the new S3 bucket. Use S3 Object Lambda to query the objects based on transaction date.

Create a new S3 bucket for processed data. Use AWS Glue for Apache Spark to create a job to query the CSV objects based on transaction date. Configure the job to store the results in the new S3 bucket. Query the objects from the new S3 bucket.

Create a new S3 bucket for processed data. Use Amazon Data Firehose to transfer the data from the central S3 bucket to the new S3 bucket. Configure Firehose to run an AWS Lambda function to query the data based on transaction date.

Buy Now

Questions 21

An ML engineer is using AWS CodeDeploy to deploy new container versions for inference on Amazon ECS.

The deployment must shift 10% of traffic initially, and the remaining 90% must shift within 10–15 minutes.

Which deployment configuration meets these requirements?

Options:

CodeDeployDefault.LambdaLinear10PercentEvery10Minutes

CodeDeployDefault.ECSAllAtOnce

CodeDeployDefault.ECSCanary10Percent15Minutes

CodeDeployDefault.LambdaCanary10Percent15Minutes

Buy Now

Questions 22

A company has significantly increased the amount of data that is stored as .csv files in an Amazon S3 bucket. Data transformation scripts and queries are now taking much longer than they used to take.

An ML engineer must implement a solution to optimize the data for query performance.

Which solution will meet this requirement with the LEAST operational overhead?

Options:

Configure an AWS Lambda function to split the .csv files into smaller objects in the S3 bucket.

Configure an AWS Glue job to drop columns that have string type values and to save the results to the S3 bucket.

Configure an AWS Glue extract, transform, and load (ETL) job to convert the .csv files to Apache Parquet format.

Configure an Amazon EMR cluster to process the data that is in the S3 bucket.

Buy Now

Questions 23

A company regularly receives new training data from the vendor of an ML model. The vendor delivers cleaned and prepared data to the company ' s Amazon S3 bucket every 3-4 days.

The company has an Amazon SageMaker pipeline to retrain the model. An ML engineer needs to implement a solution to run the pipeline when new data is uploaded to the S3 bucket.

Which solution will meet these requirements with the LEAST operational effort?

Options:

Create an S3 Lifecycle rule to transfer the data to the SageMaker training instance and to initiate training.

Create an AWS Lambda function that scans the S3 bucket. Program the Lambda function to initiate the pipeline when new data is uploaded.

Create an Amazon EventBridge rule that has an event pattern that matches the S3 upload. Configure the pipeline as the target of the rule.

Use Amazon Managed Workflows for Apache Airflow (Amazon MWAA) to orchestrate the pipeline when new data is uploaded.

Buy Now

Questions 24

A company has used Amazon SageMaker to deploy a predictive ML model in production. The company is using SageMaker Model Monitor on the model. After a model update, an ML engineer notices data quality issues in the Model Monitor checks.

What should the ML engineer do to mitigate the data quality issues that Model Monitor has identified?

Options:

Adjust the model ' s parameters and hyperparameters.

Initiate a manual Model Monitor job that uses the most recent production data.

Create a new baseline from the latest dataset. Update Model Monitor to use the new baseline for evaluations.

Include additional data in the existing training set for the model. Retrain and redeploy the model.

Buy Now

Questions 25

A company uses an Amazon EMR cluster to run a data ingestion process for an ML model. An ML engineer notices that the processing time is increasing.

Which solution will reduce the processing time MOST cost-effectively?

Options:

Use Spot Instances to increase the number of primary nodes.

Use Spot Instances to increase the number of core nodes.

Use Spot Instances to increase the number of task nodes.

Use On-Demand Instances to increase the number of core nodes.

Buy Now

Questions 26

An ML engineer is developing a classification model. The ML engineer needs to use custom libraries in processing jobs, training jobs, and pipelines in Amazon SageMaker AI.

Which solution will provide this functionality with the LEAST implementation effort?

Options:

Manually install the libraries in the SageMaker AI containers.

Build a custom Docker container that includes the required libraries. Host the container in Amazon Elastic Container Registry (Amazon ECR). Use the ECR image in the SageMaker AI jobs and pipelines.

Use a SageMaker AI notebook instance and install libraries at startup.

Run code externally on Amazon EC2 and import results into SageMaker AI.

Buy Now

Questions 27

A company runs an ML model on Amazon SageMaker AI. The company uses an automatic process that makes API calls to create training jobs for the model. The company has new compliance rules that prohibit the collection of aggregated metadata from training jobs.

Which solution will prevent SageMaker AI from collecting metadata from the training jobs?

Options:

Opt out of metadata tracking for any training job that is submitted.

Ensure that training jobs are running in a private subnet in a custom VPC.

Encrypt the training data with an AWS Key Management Service (AWS KMS) customer managed key.

Reconfigure the training jobs to use only AWS Nitro instances.

Buy Now

Questions 28

A company is developing a customer support AI assistant by using an Amazon Bedrock Retrieval Augmented Generation (RAG) pipeline. The AI assistant retrieves articles from a knowledge base stored in Amazon S3. The company uses Amazon OpenSearch Service to index the knowledge base. The AI assistant uses an Amazon Bedrock Titan Embeddings model for vector search.

The company wants to improve the relevance of the retrieved articles to improve the quality of the AI assistant ' s answers.

Which solution will meet these requirements?

Options:

Use auto-summarization on the retrieved articles by using Amazon SageMaker JumpStart.

Use a reranker model before passing the articles to the foundation model (FM).

Use Amazon Athena to pre-filter the articles based on metadata before retrieval.

Use Amazon Bedrock Provisioned Throughput to process queries more efficiently.

Buy Now

Questions 29

A company uses AWS CodePipeline to orchestrate a continuous integration and continuous delivery (CI/CD) pipeline for ML models and applications.

Select and order the steps from the following list to describe a CI/CD process for a successful deployment. Select each step one time. (Select and order FIVE.)

. CodePipeline deploys ML models and applications to production.

· CodePipeline detects code changes and starts to build automatically.

. Human approval is provided after testing is successful.

. The company builds and deploys ML models and applications to staging servers for testing.

. The company commits code changes or new training datasets to a Git repository.

Options:

Buy Now

Questions 30

A gaming company needs to deploy a natural language processing (NLP) model to moderate a chat forum in a game. The workload experiences heavy usage during evenings and weekends but minimal activity during other hours.

Which solution will meet these requirements MOST cost-effectively?

Options:

Use an Amazon SageMaker AI batch transform job with fixed capacity.

Use Amazon SageMaker Serverless Inference.

Use a single Amazon EC2 GPU instance with reserved capacity.

Use Amazon SageMaker Asynchronous Inference.

Buy Now

Questions 31

A company is developing a generative AI conversational interface to assist customers with payments. The company wants to use an ML solution to detect customer intent. The company does not have training data to train a model.

Which solution will meet these requirements?

Options:

Fine-tune a sequence-to-sequence (seq2seq) algorithm in Amazon SageMaker JumpStart.

Use an LLM from Amazon Bedrock with zero-shot learning.

Use the Amazon Comprehend DetectEntities API.

Run an LLM from Amazon Bedrock on Amazon EC2 instances.

Buy Now

Questions 32

A company wants to use Amazon SageMaker AI to host an ML model that runs on CPU for real-time predictions. The model has intermittent traffic during business hours and periods of no traffic after business hours.

Which hosting option will serve inference requests in the MOST cost-effective manner?

Options:

Deploy the model to a real-time endpoint with scheduled auto scaling.

Deploy the model to a SageMaker AI Serverless Inference endpoint with provisioned concurrency during business hours.

Deploy the model to an asynchronous inference endpoint with auto scaling to zero.

Deploy the model to a real-time endpoint and activate it only during business hours using AWS Lambda.

Buy Now

Questions 33

An ML engineer is training an XGBoost regression model in Amazon SageMaker AI. The ML engineer conducts several rounds of hyperparameter tuning with random grid search. After these rounds of tuning, the error rate on the test hold-out dataset is much larger than the error rate on the training dataset.

The ML engineer needs to make changes before running the hyperparameter grid search again.

Which changes will improve the model ' s performance? (Select TWO.)

Options:

Increase the model complexity by increasing the number of features in the dataset.

Decrease the model complexity by reducing the number of features in the dataset.

Decrease the model complexity by reducing the number of samples in the dataset.

Increase the value of the L2 regularization parameter.

Decrease the value of the L2 regularization parameter.

Buy Now

Questions 34

A company uses Amazon SageMaker Studio to develop an ML model. The company has a single SageMaker Studio domain. An ML engineer needs to implement a solution that provides an automated alert when SageMaker compute costs reach a specific threshold.

Which solution will meet these requirements?

Options:

Add resource tagging by editing the SageMaker user profile in the SageMaker domain. Configure AWS Cost Explorer to send an alert when the threshold is reached.

Add resource tagging by editing the SageMaker user profile in the SageMaker domain. Configure AWS Budgets to send an alert when the threshold is reached.

Add resource tagging by editing each user ' s IAM profile. Configure AWS Cost Explorer to send an alert when the threshold is reached.

Add resource tagging by editing each user ' s IAM profile. Configure AWS Budgets to send an alert when the threshold is reached.

Buy Now

Questions 35

A company wants to develop an ML model by using tabular data from its customers. The data contains meaningful ordered features with sensitive information that should not be discarded. An ML engineer must ensure that the sensitive data is masked before another team starts to build the model.

Which solution will meet these requirements?

Options:

Use Amazon Made to categorize the sensitive data.

Prepare the data by using AWS Glue DataBrew.

Run an AWS Batch job to change the sensitive data to random values.

Run an Amazon EMR job to change the sensitive data to random values.

Buy Now

Questions 36

A company ' s ML engineer has deployed an ML model for sentiment analysis to an Amazon SageMaker endpoint. The ML engineer needs to explain to company stakeholders how the model makes predictions.

Which solution will provide an explanation for the model ' s predictions?

Options:

Use SageMaker Model Monitor on the deployed model.

Use SageMaker Clarify on the deployed model.

Show the distribution of inferences from A/В testing in Amazon CloudWatch.

Add a shadow endpoint. Analyze prediction differences on samples.

Buy Now

Questions 37

An ML engineer has developed a binary classification model outside of Amazon SageMaker. The ML engineer needs to make the model accessible to a SageMaker Canvas user for additional tuning.

The model artifacts are stored in an Amazon S3 bucket. The ML engineer and the Canvas user are part of the same SageMaker domain.

Which combination of requirements must be met so that the ML engineer can share the model with the Canvas user? (Choose two.)

Options:

The ML engineer and the Canvas user must be in separate SageMaker domains.

The Canvas user must have permissions to access the S3 bucket where the model artifacts are stored.

The model must be registered in the SageMaker Model Registry.

The ML engineer must host the model on AWS Marketplace.

The ML engineer must deploy the model to a SageMaker endpoint.

Buy Now

Questions 38

An ML engineer needs to deploy ML models to get inferences from large datasets in an asynchronous manner. The ML engineer also needs to implement scheduled monitoring of data quality for the models and must receive alerts when changes in data quality occur.

Which solution will meet these requirements?

Options:

Deploy the models by using scheduled AWS Glue jobs. Use Amazon CloudWatch alarms to monitor the data quality and send alerts.

Deploy the models by using scheduled AWS Batch jobs. Use AWS CloudTrail to monitor the data quality and send alerts.

Deploy the models by using Amazon ECS on AWS Fargate. Use Amazon EventBridge to monitor the data quality and send alerts.

Deploy the models by using Amazon SageMaker AI batch transform. Use SageMaker Model Monitor to monitor the data quality and send alerts.

Buy Now

Questions 39

A company wants to predict the success of advertising campaigns by considering the color scheme of each advertisement. An ML engineer is preparing data for a neural network model. The dataset includes color information as categorical data.

Which technique for feature engineering should the ML engineer use for the model?

Options:

Apply label encoding to the color categories. Automatically assign each color a unique integer.

Implement padding to ensure that all color feature vectors have the same length.

Perform dimensionality reduction on the color categories.

One-hot encode the color categories to transform the color scheme feature into a binary matrix.

Buy Now

Questions 40

A company launches a feature that predicts home prices. An ML engineer trained a regression model using the SageMaker AI XGBoost algorithm. The model performs well on training data but underperforms on real-world validation data.

Which solution will improve the validation score with the LEAST implementation effort?

Options:

Create a larger training dataset with more real-world data and retrain.

Increase the num_round hyperparameter.

Change the eval_metric from RMSE to Error.

Increase the lambda hyperparameter.

Buy Now

Questions 41

A company uses Amazon SageMaker for its ML workloads. The company ' s ML engineer receives a 50 MB Apache Parquet data file to build a fraud detection model. The file includes several correlated columns that are not required.

What should the ML engineer do to drop the unnecessary columns in the file with the LEAST effort?

Options:

Download the file to a local workstation. Perform one-hot encoding by using a custom Python script.

Create an Apache Spark job that uses a custom processing script on Amazon EMR.

Create a SageMaker processing job by calling the SageMaker Python SDK.

Create a data flow in SageMaker Data Wrangler. Configure a transform step.

Buy Now

Questions 42

An ML engineer develops a neural network model to predict whether customers will continue to subscribe to a service. The model performs well on training data. However, the accuracy of the model decreases significantly on evaluation data.

The ML engineer must resolve the model performance issue.

Which solution will meet this requirement?

Options:

Penalize large weights by using L1 or L2 regularization.

Remove dropout layers from the neural network.

Train the model for longer by increasing the number of epochs.

Capture complex patterns by increasing the number of layers.

Buy Now

Questions 43

An ML engineer is tuning an image classification model that performs poorly on one of two classes. The poorly performing class represents an extremely small fraction of the training dataset.

Which solution will improve the model’s performance?

Options:

Optimize for accuracy. Use image augmentation on the less common images.

Optimize for F1 score. Use image augmentation on the less common images.

Optimize for accuracy. Use SMOTE to generate synthetic images.

Optimize for F1 score. Use SMOTE to generate synthetic images.

Buy Now

Questions 44

A company is developing ML models by using PyTorch and TensorFlow estimators with Amazon SageMaker AI. An ML engineer configures the SageMaker AI estimator and now needs to initiate a training job that uses a training dataset.

Which SageMaker AI SDK method can initiate the training job?

Options:

fit method

create_model method

deploy method

predict method

Buy Now

Questions 45

An ML engineer needs to use AWS CloudFormation to create an ML model that an Amazon SageMaker endpoint will host.

Which resource should the ML engineer declare in the CloudFormation template to meet this requirement?

Options:

AWS::SageMaker::Model

AWS::SageMaker::Endpoint

AWS::SageMaker::NotebookInstance

AWS::SageMaker::Pipeline

Buy Now

Questions 46

A company uses an ML model to recommend videos to users. The model is deployed on Amazon SageMaker AI. The model performed well initially after deployment, but the model ' s performance has degraded over time.

Which solution can the company use to identify model drift in the future?

Options:

Create a monitoring job in SageMaker Model Monitor. Then create a baseline from the training dataset.

Create a baseline from the training dataset. Then create a monitoring job in SageMaker Model Monitor.

Create a baseline by using a built-in rule in SageMaker Clarify. Monitor the drift in Amazon CloudWatch.

Retrain the model on new data. Compare the retrained model ' s performance to the original model ' s performance.

Buy Now

Questions 47

A company is developing an application that reads animal descriptions from user prompts and generates images based on the information in the prompts. The application reads a message from an Amazon Simple Queue Service (Amazon SQS) queue. Then the application uses Amazon Titan Image Generator on Amazon Bedrock to generate an image based on the information in the message. Finally, the application removes the message from SQS queue.

Which IAM permissions should the company assign to the application ' s IAM role? (Select TWO.)

Options:

Allow the bedrock:InvokeModel action for the Amazon Titan Image Generator resource.

Allow the bedrock:Get* action for the Amazon Titan Image Generator resource.

Allow the sqs:ReceiveMessage action and the sqs:DeleteMessage action for the SQS queue resource.

Allow the sqs:GetQueueAttributes action and the sqs:DeleteMessage action for the SQS queue resource.

Allow the sagemaker:PutRecord* action for the Amazon Titan Image Generator resource.

Buy Now

Questions 48

A company has an ML model that is deployed to an Amazon SageMaker AI endpoint for real-time inference. The company needs to deploy a new model. The company must compare the new model’s performance to the currently deployed model ' s performance before shifting all traffic to the new model.

Which solution will meet these requirements with the LEAST operational effort?

Options:

Deploy the new model to a separate endpoint. Manually split traffic between the two endpoints.

Deploy the new model to a separate endpoint. Use Amazon CloudFront to distribute traffic between the two endpoints.

Deploy the new model as a shadow variant on the same endpoint as the current model. Route a portion of live traffic to the shadow model for evaluation.

Use AWS Lambda functions with custom logic to route traffic between the current model and the new model.

Buy Now

Questions 49

A company is building a deep learning model on Amazon SageMaker. The company uses a large amount of data as the training dataset. The company needs to optimize the model ' s hyperparameters to minimize the loss function on the validation dataset.

Which hyperparameter tuning strategy will accomplish this goal with the LEAST computation time?

Options:

Hyperbaric!

Grid search

Bayesian optimization

Random search

Buy Now

Questions 50

A company needs to run a batch data-processing job on Amazon EC2 instances. The job will run during the weekend and will take 90 minutes to finish running. The processing can handle interruptions. The company will run the job every weekend for the next 6 months.

Which EC2 instance purchasing option will meet these requirements MOST cost-effectively?

Options:

Spot Instances

Reserved Instances

On-Demand Instances

Dedicated Instances

Buy Now

Questions 51

A company is creating an ML model to identify defects in a product. The company has gathered a dataset and has stored the dataset in TIFF format in Amazon S3. The dataset contains 200 images in which the most common defects are visible. The dataset also contains 1,800 images in which there is no defect visible.

An ML engineer trains the model and notices poor performance in some classes. The ML engineer identifies a class imbalance problem in the dataset.

What should the ML engineer do to solve this problem?

Options:

Use a few hundred images and Amazon Rekognition Custom Labels to train a new model.

Undersample the 200 images in which the most common defects are visible.

Oversample the 200 images in which the most common defects are visible.

Use all 2,000 images and Amazon Rekognition Custom Labels to train a new model.

Buy Now

Questions 52

An ML engineer is using Amazon Quick Suite (previously known as Amazon QuickSight) anomaly detection to detect very high or very low machine operating temperatures compared to normal. The ML engineer sets the Severity parameter to Low and above. The ML engineer sets the Direction parameter to All.

What effect will the ML engineer observe in the anomaly detection results if the ML engineer changes the Direction parameter to Lower than expected?

Options:

Increased anomaly identification frequency and increased recall

Decreased anomaly identification frequency and decreased recall

Increased anomaly identification frequency and decreased recall

Decreased anomaly identification frequency and increased recall

Buy Now

Questions 53

A company is using Amazon SageMaker and millions of files to train an ML model. Each file is several megabytes in size. The files are stored in an Amazon S3 bucket. The company needs to improve training performance.

Which solution will meet these requirements in the LEAST amount of time?

Options:

Transfer the data to a new S3 bucket that provides S3 Express One Zone storage. Adjust the training job to use the new S3 bucket.

Create an Amazon FSx for Lustre file system. Link the file system to the existing S3 bucket. Adjust the training job to read from the file system.

Create an Amazon Elastic File System (Amazon EFS) file system. Transfer the existing data to the file system. Adjust the training job to read from the file system.

Create an Amazon ElastiCache (Redis OSS) cluster. Link the Redis OSS cluster to the existing S3 bucket. Stream the data from the Redis OSS cluster directly to the training job.

Buy Now

Questions 54

Case Study

A company is building a web-based AI application by using Amazon SageMaker. The application will provide the following capabilities and features: ML experimentation, training, a

central model registry, model deployment, and model monitoring.

The application must ensure secure and isolated use of training data during the ML lifecycle. The training data is stored in Amazon S3.

The company must implement a manual approval-based workflow to ensure that only approved models can be deployed to production endpoints.

Which solution will meet this requirement?

Options:

Use SageMaker Experiments to facilitate the approval process during model registration.

Use SageMaker ML Lineage Tracking on the central model registry. Create tracking entities for the approval process.

Use SageMaker Model Monitor to evaluate the performance of the model and to manage the approval.

Use SageMaker Pipelines. When a model version is registered, use the AWS SDK to change the approval status to " Approved. "

Buy Now

Answer:

Explanation:

To implement a manual approval-based workflow ensuring that only approved models are deployed to production endpoints, Amazon SageMaker provides integrated tools such as SageMaker Pipelines and the SageMaker Model Registry.

SageMaker Pipelines is a robust service for building, automating, and managing end-to-end machine learning workflows. It facilitates the orchestration of various steps in the ML lifecycle, including data preprocessing, model training, evaluation, and deployment. By integrating with the SageMaker Model Registry, it enables seamless tracking and management of model versions and their approval statuses.

Implementation Steps:

Define the Pipeline:

Create a SageMaker Pipeline encompassing steps for data preprocessing, model training, evaluation, and registration of the model in the Model Registry.

Incorporate a Condition Step to assess model performance metrics. If the model meets predefined criteria, proceed to the next step; otherwise, halt the process.

Utilize the RegisterModel step to add the trained model to the Model Registry.

Set the ModelApprovalStatus parameter to PendingManualApproval during registration. This status indicates that the model awaits manual review before deployment.

Manual Approval Process:

Notify the designated approver upon model registration. This can be achieved by integrating Amazon EventBridge to monitor registration events and trigger notifications via AWS Lambda functions.

The approver reviews the model ' s performance and, if satisfactory, updates the model ' s status to Approved using the AWS SDK or through the SageMaker Studio interface.

Deploy the Approved Model:

Configure the pipeline to automatically deploy models with an Approved status to the production endpoint. This can be managed by adding deployment steps conditioned on the model ' s approval status.

Advantages of This Approach:

Automated Workflow: SageMaker Pipelines streamline the ML workflow, reducing manual interventions and potential errors.

Governance and Compliance: The manual approval step ensures that only thoroughly evaluated models are deployed, aligning with organizational standards.

Scalability: The solution supports complex ML workflows, making it adaptable to various project requirements.

By implementing this solution, the company can establish a controlled and efficient process for deploying models, ensuring that only approved versions reach production environments.

[References:, Automate the machine learning model approval process with Amazon SageMaker Model Registry and Amazon SageMaker Pipelines, Update the Approval Status of a Model - Amazon SageMaker, , , , ]

Questions 55

A company runs an Amazon SageMaker domain in a public subnet of a newly created VPC. The network is configured properly, and ML engineers can access the SageMaker domain.

Recently, the company discovered suspicious traffic to the domain from a specific IP address. The company needs to block traffic from the specific IP address.

Which update to the network configuration will meet this requirement?

Options:

Create a security group inbound rule to deny traffic from the specific IP address. Assign the security group to the domain.

Create a network ACL inbound rule to deny traffic from the specific IP address. Assign the rule to the default network Ad for the subnet where the domain is located.

Create a shadow variant for the domain. Configure SageMaker Inference Recommender to send traffic from the specific IP address to the shadow endpoint.

Create a VPC route table to deny inbound traffic from the specific IP address. Assign the route table to the domain.

Buy Now

Questions 56

An ML engineer is using a training job to fine-tune a deep learning model in Amazon SageMaker Studio. The ML engineer previously used the same pre-trained model with a similar

dataset. The ML engineer expects vanishing gradient, underutilized GPU, and overfitting problems.

The ML engineer needs to implement a solution to detect these issues and to react in predefined ways when the issues occur. The solution also must provide comprehensive real-time metrics during the training.

Which solution will meet these requirements with the LEAST operational overhead?

Options:

Use TensorBoard to monitor the training job. Publish the findings to an Amazon Simple Notification Service (Amazon SNS) topic. Create an AWS Lambda function to consume the findings and to initiate the predefined actions.

Use Amazon CloudWatch default metrics to gain insights about the training job. Use the metrics to invoke an AWS Lambda function to initiate the predefined actions.

Expand the metrics in Amazon CloudWatch to include the gradients in each training step. Use the metrics to invoke an AWS Lambda function to initiate the predefined actions.

Use SageMaker Debugger built-in rules to monitor the training job. Configure the rules to initiate the predefined actions.

Buy Now

Questions 57

An ML engineer needs to use an Amazon EMR cluster to process large volumes of data in batches. Any data loss is unacceptable.

Which instance purchasing option will meet these requirements MOST cost-effectively?

Options:

Run the primary node, core nodes, and task nodes on On-Demand Instances.

Run the primary node, core nodes, and task nodes on Spot Instances.

Run the primary node on an On-Demand Instance. Run the core nodes and task nodes on Spot Instances.

Run the primary node and core nodes on On-Demand Instances. Run the task nodes on Spot Instances.

Buy Now

Questions 58

An ML engineer has trained an ML model by using Amazon SageMaker AI. The ML engineer determines that the model is overfitting and that the training data contains unnecessary features. The ML engineer must reduce the overfitting and the impact of the unnecessary features.

Which solution will meet these requirements?

Options:

Apply L1 regularization to the training data. Retrain the model.

Use SageMaker Debugger to apply L1 regularization to the running model.

Increase the number of training iterations. Retrain the model.

Decrease the number of training iterations. Retrain the model.

Buy Now

Questions 59

A company needs to combine data from multiple sources. The company must use Amazon Redshift Serverless to query an AWS Glue Data Catalog database and underlying data that is stored in an Amazon S3 bucket.

Select and order the correct steps from the following list to meet these requirements. Select each step one time or not at all. (Select and order three.)

• Attach the IAM role to the Redshift cluster.

• Attach the IAM role to the Redshift namespace.

• Create an external database in Amazon Redshift to point to the Data Catalog schema.

• Create an external schema in Amazon Redshift to point to the Data Catalog database.

• Create an IAM role for Amazon Redshift to use to access only the S3 bucket that contains underlying data.

• Create an IAM role for Amazon Redshift to use to access the Data Catalog and the S3 bucket that contains underlying data.

Options:

Buy Now

Questions 60

An ML engineer wants to deploy a workflow that processes streaming IoT sensor data and periodically retrains ML models. The most recent model versions must be deployed to production.

Which service will meet these requirements?

Options:

Amazon SageMaker Pipelines

Amazon Managed Workflows for Apache Airflow (MWAA)

AWS Lambda

Apache Spark

Buy Now

Questions 61

An ML engineer is building an ML model in Amazon SageMaker AI. The ML engineer needs to load historical data directly from Amazon S3, Amazon Athena, and Snowflake into SageMaker AI.

Which solution will meet this requirement?

Options:

Use AWS Glue DataBrew to import the data into SageMaker AI.

Build a pipeline in SageMaker Pipelines to process the data. Use AWS DataSync to load the processed data into SageMaker AI.

Create a feature store in SageMaker Feature Store. Use an Apache Spark connector to Feature Store to access the data.

Use SageMaker Data Wrangler to query and import the data.

Buy Now

Questions 62

A company has trained an ML model in Amazon SageMaker. The company needs to host the model to provide inferences in a production environment.

The model must be highly available and must respond with minimum latency. The size of each request will be between 1 KB and 3 MB. The model will receive unpredictable bursts of requests during the day. The inferences must adapt proportionally to the changes in demand.

How should the company deploy the model into production to meet these requirements?

Options:

Create a SageMaker real-time inference endpoint. Configure auto scaling. Configure the endpoint to present the existing model.

Deploy the model on an Amazon Elastic Container Service (Amazon ECS) cluster. Use ECS scheduled scaling that is based on the CPU of the ECS cluster.

Install SageMaker Operator on an Amazon Elastic Kubernetes Service (Amazon EKS) cluster. Deploy the model in Amazon EKS. Set horizontal pod auto scaling to scale replicas based on the memory metric.

Use Spot Instances with a Spot Fleet behind an Application Load Balancer (ALB) for inferences. Use the ALBRequestCountPerTarget metric as the metric for auto scaling.

Buy Now

Questions 63

An ML engineer needs to use Amazon SageMaker Feature Store to create and manage features to train a model.

Select and order the steps from the following list to create and use the features in Feature Store. Each step should be selected one time. (Select and order three.)

• Access the store to build datasets for training.

• Create a feature group.

• Ingest the records.

Options:

Buy Now

Questions 64

A company is using an AWS Lambda function to monitor the metrics from an ML model. An ML engineer needs to implement a solution to send an email message when the metrics breach a threshold.

Which solution will meet this requirement?

Options:

Log the metrics from the Lambda function to AWS CloudTrail. Configure a CloudTrail trail to send the email message.

Log the metrics from the Lambda function to Amazon CloudFront. Configure an Amazon CloudWatch alarm to send the email message.

Log the metrics from the Lambda function to Amazon CloudWatch. Configure a CloudWatch alarm to send the email message.

Log the metrics from the Lambda function to Amazon CloudWatch. Configure an Amazon CloudFront rule to send the email message.

Buy Now

Questions 65

A logistics company has installed in-vehicle cameras for basic monitoring of its drivers. The company wants to improve driver safety by identifying distractions that could lead to accidents.

Which solution will meet this requirement with the LEAST operational effort?

Options:

Use Amazon Rekognition eye gaze direction detection to monitor driver behavior and identify distractions.

Use Amazon SageMaker AI to customize an AI model to monitor driver behavior and identify distractions.

Integrate a third-party driver monitoring system with Amazon Rekognition to monitor driver behavior and identify distractions.

Use Amazon Comprehend to analyze text-based driver feedback and identify distractions.

Buy Now

Questions 66

A company collects customer data every day. The company stores the data as compressed files in an Amazon S3 bucket that is partitioned by date. Every month, analysts download the data, process the data to check the data quality, and then upload the data to Amazon QuickSight dashboards.

An ML engineer needs to implement a solution to automatically check the data quality before the data is sent to QuickSight.

Which solution will meet these requirements with the LEAST operational overhead?

Options:

Run an AWS Glue crawler every month to update the AWS Glue Data Catalog. Use AWS Glue Data Quality rules to check the data quality.

Use an AWS Glue trigger to run an AWS Glue crawler every month to update the AWS Glue Data Catalog. Create an AWS Glue job that loads the data into a PySpark DataFrame. Configure the job to apply custom functions and to evaluate the data quality.

Run Python scripts on an AWS Lambda function every month to evaluate data quality. Configure the S3 bucket to invoke the Lambda function when objects are added to the S3 bucket.

Configure the S3 bucket to send event notifications to an Amazon Simple Queue Service (Amazon SQS) queue when objects are uploaded. Use Amazon CloudWatch insights every month for the SQS queue to evaluate the data quality.

Buy Now

Questions 67

An ML engineer needs to use an ML model to predict the price of apartments in a specific location.

Which metric should the ML engineer use to evaluate the model’s performance?

Options:

Accuracy

Area Under the ROC Curve (AUC)

F1 score

Mean absolute error (MAE)

Buy Now

Questions 68

A company ' s ML engineer is creating a classification model. The ML engineer explores the dataset and notices a column named day_of_week. The column contains the following values: Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, and Sunday.

Which technique should the ML engineer use to convert this column’s data to binary values?

Options:

Binary encoding

Label encoding

One-hot encoding

Tokenization

Buy Now

Questions 69

A company is using Amazon SageMaker to create ML models. The company ' s data scientists need fine-grained control of the ML workflows that they orchestrate. The data scientists also need the ability to visualize SageMaker jobs and workflows as a directed acyclic graph (DAG). The data scientists must keep a running history of model discovery experiments and must establish model governance for auditing and compliance verifications.

Which solution will meet these requirements?

Options:

Use AWS CodePipeline and its integration with SageMaker Studio to manage the entire ML workflows. Use SageMaker ML Lineage Tracking for the running history of experiments and for auditing and compliance verifications.

Use AWS CodePipeline and its integration with SageMaker Experiments to manage the entire ML workflows. Use SageMaker Experiments for the running history of experiments and for auditing and compliance verifications.

Use SageMaker Pipelines and its integration with SageMaker Studio to manage the entire ML workflows. Use SageMaker ML Lineage Tracking for the running history of experiments and for auditing and compliance verifications.

Use SageMaker Pipelines and its integration with SageMaker Experiments to manage the entire ML workflows. Use SageMaker Experiments for the running history of experiments and for auditing and compliance verifications.

Buy Now

Questions 70

An ML engineering team is spread across multiple locations. When the lead ML engineer opens an Amazon SageMaker AI notebook, the ML engineer does not see the latest merged notebook made by other team members from a Git repository.

The lead ML engineer must see the latest SageMaker AI notebook updates.

Which solution will meet this requirement?

Options:

Run the !git pull origin master command.

Run the !git commit command.

Run the !git push origin master command.

Run the !git branch command.

Buy Now

Questions 71

An ML engineer uses an Amazon SageMaker AI notebook instance to run a training job that trains a neural network model with an estimator. The training job loads data iteratively from an Amazon S3 path that is configured as an environment variable. The ML engineer viewed a profiling report of the training job. The ML engineer discovered that a substantial amount of the training time is spent during data loading.

How can the ML engineer improve the training speed?

Options:

Provision Amazon Elastic Block Store (Amazon EBS) Provisioned IOPS SSD io1 storage during the estimator initialization. Download the training data from the S3 path to Amazon EBS. Point the data loader to the EBS location.

Provision Amazon Elastic File System (Amazon EFS) storage during the estimator initialization. Download the training data to Amazon EFS by using the S3 path. Point the data loader to the EFS location.

Download the training data to the estimator by using fast file mode. Point the data loader to the location specified by the S3 path.

Configure the path to the S3 bucket that contains the training data as a hyperparameter instead of an environment variable.

Buy Now

Questions 72

A company uses a training job on Amazon SageMaker Al to train a neural network. The job first trains a model and then evaluates the model ' s performance ag

test dataset. The company uses the results from the evaluation phase to decide if the trained model will go to production.

The training phase takes too long. The company needs solutions that can shorten training time without decreasing the model ' s final performance.

Select the correct solutions from the following list to meet the requirements for each description. Select each solution one time or not at all. (Select THREE.)

. Change the epoch count.

. Choose an Amazon EC2 Spot Fleet.

· Change the batch size.

. Use early stopping on the training job.

· Use the SageMaker Al distributed data parallelism (SMDDP) library.

. Stop the training job.

Options:

Buy Now

Exam Code: MLA-C01

Exam Name: AWS Certified Machine Learning Engineer - Associate

Last Update: Jul 27, 2026

Questions: 241

PDF + Testing Engine

$59.99 ~~$171.4~~

Add to Cart

Testing Engine

$44.99 ~~$128.55~~

Add to Cart

PDF (Q&A)

$49.99 ~~$142.82~~

Add to Cart

Summer Sale - Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 65percent

dumpspedia logo

Navigation:

MLA-C01 Sample Questions Answers

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options: