Databricks-Machine-Learning-Professional Sample Questions Answers

Questions 4

A machine learning engineering team wants to build a continuous pipeline for data preparation of a machine learning application. The team would like the data to be fully processed and made ready for inference in a series of equal-sized batches.

Which of the following tools can be used to provide this type of continuous processing?

Options:

Spark UDFs

[Structured Streaming

MLflow

D Delta Lake

AutoML

Buy Now

Questions 5

A machine learning engineer wants to view all of the active MLflow Model Registry Webhooks for a specific model.

They are using the following code block:

Which of the following changes does the machine learning engineer need to make to this code block so it will successfully accomplish the task?

Options:

There are no necessary changes

Replace list with view in the endpoint URL

Replace POST with GET in the call to http request

Replace list with webhooks in the endpoint URL

Replace POST with PUT in the call to http request

Buy Now

Questions 6

A machine learning engineer is attempting to create a webhook that will trigger a Databricks Jobjob_idwhen a model version for modelmodeltransitions into any MLflow Model Registry stage.

They have the following incomplete code block:

Which of the following lines of code can be used to fill in the blank so that the code block accomplishes the task?

Options:

"MODEL_VERSION_CREATED"

"MODEL_VERSION_TRANSITIONED_TO_PRODUCTION"

"MODEL_VERSION_TRANSITIONED_TO_STAGING"

"MODEL_VERSION_TRANSITIONED_STAGE"

"MODEL_VERSION_TRANSITIONED_TO_STAGING", "MODEL_VERSION_TRANSITIONED_TO_PRODUCTION"

Buy Now

Questions 7

A machine learning engineer wants to log and deploy a model as an MLflow pyfunc model. They have custom preprocessing that needs to be completed on feature variables prior to fitting the model or computing predictions using that model. They decide to wrap this preprocessing in a custom model class ModelWithPreprocess, where the preprocessing is performed when calling fit and when calling predict. They then log the fitted model of the ModelWithPreprocess class as a pyfunc model.

Which of the following is a benefit of this approach when loading the logged pyfunc model for downstream deployment?

Options:

The pvfunc model can be used to deploy models in a parallelizable fashion

The same preprocessing logic will automatically be applied when calling fit

The same preprocessing logic will automatically be applied when calling predict

This approach has no impact when loading the logged Pvfunc model for downstream deployment

There is no longer a need for pipeline-like machine learning objects

Buy Now

Questions 8

A data scientist has developed a scikit-learn random forest model model, but they have not yet logged model with MLflow. They want to obtain the input schema and the output schema of the model so they can document what type of data is expected as input.

Which of the following MLflow operations can be used to perform this task?

Options:

mlflow.models.schema.infer_schema

mlflow.models.signature.infer_signature

mlflow.models.Model.get_input_schema

mlflow.models.Model.signature

There is no way to obtain the input schema and the output schema of an unlogged model.

Buy Now

Questions 9

Which of the following describes the purpose of the context parameter in the predict method of Python models for MLflow?

Options:

The context parameter allows the user to specify which version of the registered MLflowModel should be used based on the given application's current scenario

The context parameter allows the user to document the performance of a model after it has been deployed

The context parameter allows the user to include relevant details of the business case to allow downstream users to understand the purpose of the model

The context parameter allows the user to provide the model with completely custom if-else logic for the given application's current scenario

The context parameter allows the user to provide the model access to objects like preprocessing models or custom configuration files

Buy Now

Questions 10

A data scientist is using MLflow to track their machine learning experiment. As a part of each MLflow run, they are performing hyperparameter tuning. The data scientist would like to have one parent run for the tuning process with a child run for each unique combination of hyperparameter values.

They are using the following code block:

The code block is not nesting the runs in MLflow as they expected.

Which of the following changes does the data scientist need to make to the above code block so that it successfully nests the child runs under the parent run in MLflow?

Options:

Indent the child run blocks within the parent run block

Add the nested=True argument to the parent run

Remove the nested=True argument from the child runs

Provide the same name to the run name parameter for all three run blocks

Add the nested=True argument to the parent run and remove the nested=True arguments from the child runs

Buy Now

Questions 11

A data scientist has computed updated feature values for all primary key values stored in the Feature Store table features. In addition, feature values for some new primary key values have also been computed. The updated feature values are stored in the DataFrame features_df. They want to replace all data in features with the newly computed data.

Which of the following code blocks can they use to perform this task using the Feature Store Client fs?

Options:

Option A

Option B

Option C

Option D

Option E

Buy Now

Questions 12

A machine learning engineer has developed a random forest model using scikit-learn, logged the model using MLflow as random_forest_model, and stored its run ID in the run_id Python variable. They now want to deploy that model by performing batch inference on a Spark DataFrame spark_df.

Which of the following code blocks can they use to create a function called predict that they can use to complete the task?

It is not possible to deploy a scikit-learn model on a Spark DataFrame.

Options:

Option A

Option B

Option C

Option D

Option E

Buy Now

Questions 13

Which of the following MLflow operations can be used to delete a model from the MLflow Model Registry?

Options:

client.transition_model_version_stage

client.delete_model_version

client.update_registered_model

client.delete_model

client.delete_registered_model

Buy Now

Questions 14

Which of the following deployment paradigms can centrally compute predictions for a single record with exceedingly fast results?

Options:

Streaming

Batch

Edge/on-device

None of these strategies will accomplish the task.

Real-time

Buy Now

Questions 15

A machine learning engineer needs to deliver predictions of a machine learning model in real-time. However, the feature values needed for computing the predictions are available one week before the query time.

Which of the following is a benefit of using a batch serving deployment in this scenario rather than a real-time serving deployment where predictions are computed at query time?

Options:

Batch servinghas built-in capabilities in Databricks Machine Learning

There is no advantage to using batch serving deployments over real-time serving deployments

Computing predictions in real-time provides more up-to-date results

Testing is not possible in real-time serving deployments

Querying stored predictions can be faster than computing predictions in real-time

Buy Now

Questions 16

Which of the following describes the concept of MLflow Model flavors?

Options:

A convention that deployment tools can use to wrap preprocessing logic into a Model

A convention that MLflow Model Registry can use to version models

A convention that MLflow Experiments can use to organize their Runs by project

A convention that deployment tools can use to understand the model

A convention that MLflow Model Registrycan use to organize its Models by project

Buy Now

Questions 17

A data scientist wants to remove the star_rating column from the Delta table at the location path. To do this, they need to load in data and drop the star_rating column.

Which of the following code blocks accomplishes this task?

Options:

spark.read.format(“delta”).load(path).drop(“star_rating”)

spark.read.format(“delta”).table(path).drop(“star_rating”)

Delta tables cannot be modified

spark.read.table(path).drop(“star_rating”)

spark.sql(“SELECT * EXCEPT star_rating FROM path”)

Buy Now

Questions 18

Which of the following describes concept drift?

Options:

Concept drift is when there is a change in the distribution of an input variable

Concept drift is when there is a change in the distribution of a target variable

Concept drift is when there is a change in the relationship between input variables and target variables

Concept drift is when there is a change in the distribution of the predicted target given by the model

None of these describe Concept drift

Buy Now

Exam Code: Databricks-Machine-Learning-Professional

Exam Name: Databricks Certified Machine Learning Professional

Last Update: May 6, 2024

Questions: 60

PDF + Testing Engine

$64 ~~$159.99~~

Testing Engine (only)

$48 ~~$119.99~~

PDF (only)

$40 ~~$99.99~~

buy now Databricks-Machine-Learning-Professional

Weekend Sale - Limited Time 60% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 575363r9

dumpspedia logo

Navigation:

Databricks-Machine-Learning-Professional Sample Questions Answers

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Options:

Answer:

Explanation:

Quick Links

Why Us

Site Secure