Labour Day Sale - Limited Time 60% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 575363r9

Welcome To DumpsPedia

Databricks-Certified-Professional-Data-Scientist Sample Questions Answers

Questions 4

Spam filtering of the emails is an example of

Options:

A.

Supervised learning

B.

Unsupervised learning

C.

Clustering

D.

1 and 3 are correct

E.

2 and 3 are correct

Buy Now
Questions 5

What describes a true limitation of Logistic Regression method?

Options:

A.

It does not handle redundant variables well.

B.

It does not handle missing values well.

C.

It does not handle correlated variables well.

D.

It does not have explanatory values.

Buy Now
Questions 6

Assume some output variable "y" is a linear combination of some independent input variables "A" plus some independent noise "e". The way the independent variables are combined is defined by a parameter vector B y=AB+e where X is an m x n matrix. B is a vector of n unknowns, and b is a vector of m values. Assuming that m is not equal to n and the columns of X are linearly independent, which expression correctly solves for B?

Options:

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Buy Now
Questions 7

Suppose a man told you he had a nice conversation with someone on the train. Not knowing anything about this conversation, the probability that he was speaking to a woman is 50% (assuming the train had an equal number of men and women and the speaker was as likely to strike up a conversation with a man as with a woman). Now suppose he also told you that his conversational partner had long hair. It is now more

likely he was speaking to a woman, since women are more likely to have long hair than men.____________

can be used to calculate the probability that the person was a woman.

Options:

A.

SVM

B.

MLE

C.

Bayes' theorem

D.

Logistic Regression

Buy Now
Questions 8

A denote the event 'student is female' and let B denote the event 'student is French'. In a class of 100 students suppose 60 are French, and suppose that 10 of the French students are females. Find the probability that if I pick a French student, it will be a girl, that is, find P(A|B).

Options:

A.

1/3

B.

2/3

C.

1/6

D.

2/6

Buy Now
Questions 9

In which of the scenario you can use the linear regression model?

Options:

A.

Predicting Home Price based on the location and house area

B.

Predicting demand of the goods and services based on the weather

C.

Predicting tumor size reduction based on input as number of radiation treatment

D.

Predicting sales of the text book based on the number of students in state

Buy Now
Questions 10

Select the correct option from the below

Options:

A.

If you're trying to predict or forecast a target value^ then you need to look into supervised learning.

B.

If you've chosen supervised learning, with discrete target value like Yes/No. 1/2/3, A/B/C: or Red/Yellow/Black, then look into classification.

C.

If the target value can take on a number of values, say any value from 0.00 to 100.00, or -999 to 999: or +_to -_, then you need to look unsupervised learning

D.

If you're not trying to predict a target value, then you need to look into unsupervised learning

E.

Are you trying to fit your data into some discrete groups? If so and that's all you need, you should look into clustering.

Buy Now
Questions 11

Select the sequence of the developing machine learning applications

A) Analyze the input data

B) Prepare the input data

C) Collect data

D) Train the algorithm

E) Test the algorithm

F) Use It

Options:

A.

A, B, C, D, E, F

B.

C, B, A, D, E, F

C.

C, A, B, D, E, F

D.

C, B, A, D, E, F

Buy Now
Questions 12

Which of the following are point estimation methods?

Options:

A.

MAP

B.

MLE

C.

MMSE

Buy Now
Questions 13

You are creating a model for the recommending the book at Amazon.com, so which of the following recommender system you will use you don't have cold start problem?

Options:

A.

Naive Bayes classifier

B.

Item-based collaborative filtering

C.

User-based collaborative filtering

D.

Content-based filtering

Buy Now
Questions 14

Question-34. Stories appear in the front page of Digg as they are "voted up" (rated positively) by the community. As the community becomes larger and more diverse, the promoted stories can better reflect the average interest of the community members. Which of the following technique is used to make such recommendation engine?

Options:

A.

Naive Bayes classifier

B.

Collaborative filtering

C.

Logistic Regression

D.

Content-based filtering

Buy Now
Questions 15

You are working with the Clustering solution of the customer datasets. There are almost 40 variables are available for each customer and almost 1.00,0000 customer's data is available. You want to reduce the number of variables for clustering, what would you do?

Options:

A.

You will randomly reduce the number of variables

B.

You will find the correlation among the variables and from their variables are not co-related will be discarded.

C.

You will find the correlation among the variables and from the highly co-related variables, you will be considering only one or two variables from it.

D.

You cannot discard any variable for creating clusters.

E.

You can combine several variables in one variable

Buy Now
Questions 16

In which lifecycle stage are appropriate analytical techniques determined?

Options:

A.

Model planning

B.

Model building

C.

Data preparation

D.

Discovery

Buy Now
Questions 17

If E1 and E2 are two events, how do you represent the conditional probability given that E2 occurs given that E1 has occurred?

Options:

A.

P(E1)/P(E2)

B.

P(E1+E2)/P(E1)

C.

P(E2)/P(E1)

D.

P(E2)/(P(E1+E2)

Buy Now
Questions 18

Which of the following could be features?

Options:

A.

Words in the document

B.

Symptoms of a diseases

C.

Characteristics of an unidentified object

D.

0nly 1 and 2

E.

All 1,2 and 3 are possible

Buy Now
Questions 19

You are working on a Data Science project and during the project you have been gibe a responsibility to interview all the stakeholders in the project. In which phase of the project you are?

Options:

A.

Discovery

B.

Data Preparations

C.

Creating Models

D.

Executing Models

E.

Creating visuals from the outcome

F.

Operationnalise the models

Buy Now
Questions 20

Consider flipping a coin for which the probability of heads is p, where p is unknown, and our goa is to estimate p. The obvious approach is to count how many times the coin came up heads and divide by the total number of coin flips. If we flip the coin 1000 times and it comes up heads 367 times, it is very reasonable to estimate p as approximately 0.367. However, suppose we flip the coin only twice and we get heads both times. Is it reasonable to estimate p as 1.0? Intuitively, given that we only flipped the coin twice, it seems a bit

rash to conclude that the coin will always come up heads, and____________is a way of avoiding such rash

conclusions.

Options:

A.

Naive Bayes

B.

Laplace Smoothing

C.

Logistic Regression

D.

Linear Regression

Buy Now
Exam Code: Databricks-Certified-Professional-Data-Scientist
Exam Name: Databricks Certified Professional Data Scientist Exam
Last Update: Apr 26, 2024
Questions: 138
$64  $159.99
$48  $119.99
$40  $99.99
buy now Databricks-Certified-Professional-Data-Scientist