Summer Special Sale - Limited Time 60% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: 575363r9

Welcome To DumpsPedia

DA0-001 Sample Questions Answers

Questions 4

Which of the following is a non-parametric test?

Options:

A.

One-sample t-test

B.

Two-way ANOVA

C.

Correlation coefficient

D.

Spearman's rank correlation

Buy Now
Questions 5

An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:

Which of the following conclusions is accurate at a 95% confidence interval?

Options:

A.

In Germany, the increase in conversion from the new layout was not significant.

B.

In France, the increase in conversion from the new layout was not significant.

C.

In general, users who visit the new website are more likely to make a purchase.

D.

The new layout has the lowest conversion rates in the United Kingdom.

Buy Now
Questions 6

A recurring event is being stored in two databases that are housed in different geographical locations. A data analyst notices the event is being logged three hours earlier in one database than in the other database. Which of the following is the MOST likely cause of the issue?

Options:

A.

The data analyst is not querying the databases correctly.

B.

The databases are recording different events.

C.

The databases are recording the event in different time zones.

D.

The second database is logging incorrectly.

Buy Now
Questions 7

Five dogs have the following heights in millimeters:

300,430, 170, 470, 600

Which of the following is the standard deviation for the five dogs?

Options:

A.

147mm

B.

154mm

C.

394 mm

D.

21,704mm

Buy Now
Questions 8

Which of the following best describes an exploratory analysis?

Options:

A.

Involves the use of descriptive statistics to understand observations

B.

Involves analysis of exploring data sets for performance tracking

C.

Involves the testing of specific hypotheses

D.

Involves the use of arithmetic algebra to determine the distribution

Buy Now
Questions 9

Which of the following is a characteristic of a relational database?

Options:

A.

It utilizes key-value pairs.

B.

It has undefined fields.

C.

It is structured in nature.

D.

It uses minimal memory.

Buy Now
Questions 10

Given the customer table below:

Which of the following chart types is the most appropriate to represent the average spending of active customers vs. inactive customers?

Options:

A.

Pie chart

B.

Heat graph

C.

Scatter plot

D.

Line chart

Buy Now
Questions 11

Joseph is interpreting a left skewed distribution of test scores. Joe scored at the mean, Alfonso scored at the median, and gaby scored and the end of the tail.

Who had the highest score?

Options:

A.

Joseph

B.

Joe

C.

Alfonso

D.

Gaby

Buy Now
Questions 12

A county in Illinois is conducting a survey to determine the mean annual income per household. The county is 427sq mi (2.65q km). Which of the following sampling methods would MOST likely result in a representative sample?

Options:

A.

A stratified phone survey of 100 people that is conducted between 2:00 p.m. and 3:00 p.m.

B.

A systematic survey that is sent to 100 single-family homes in the county

C.

Surveys sent to ten randomly selected homes within 5mi (8km) of the county’s office

D.

Surveys sent to 100 randomly selected homes that are reflective of the population

Buy Now
Questions 13

Which of the following would be used to store unstructured data from different sources?

Options:

A.

A data lake

B.

A database management system

C.

A database

D.

A data warehouse

Buy Now
Questions 14

An analyst needs to provide a chart to identify the composition between the categories of the survey response data set:

Which of the following charts would be BEST to use?

Options:

A.

Histogram

B.

Pie

C.

Line

D.

Scatter pot

E.

Waterfall

Buy Now
Questions 15

A database consists of one fact table that is composed of multiple dimensions. Depending on the dimension, each one can be represented by a denormalized table or multiple normalized tables. This structure is an example of a:

Options:

A.

transactional schema.

B.

star schema.

C.

non-relational schema.

D.

snowflake schema.

Buy Now
Questions 16

A user receives a large custom report to track company sales across various date ranges. The user then completes a series of manual calculations for each date range. Which of the following should an analyst suggest so the user has a dynamic, seamless experience?

Options:

A.

Create multiple reports, one for each needed date range.

B.

Build calculations into the report so they are done automatically.

C.

Add macros to the report to speed up the filtering and calculations process.

D.

Create a dashboard with a date range picker and calculations built in.

Buy Now
Questions 17

A customer survey reveals 90% positive feedback. Which of the following statistical methods would be best to utilize to determine the reliability of a data set and predict how a larger sample of customers over the same time period might respond?

Options:

A.

Calculate a high variance on survey responses.

B.

Calculate the maximum range of the survey responses.

C.

Calculate a low standard deviation on survey responses.

D.

Remove any data more than 4 standard deviation from the mean.

Buy Now
Questions 18

A user imports a data file into the accounts payable system each day. On a regular basis. the field input is not what the system is expecting. so it results in an error for the row and a broken import process. To resolve the issue, the user opens the file, finds the error in the row, and manually corrects it before attempting the import again. The import sometimes breaks on subsequent attempts. though. Which of the following changes should be made to this process to reduce the number of errors?

Options:

A.

Delete all incorrect inputs and upload the corrected file.

B.

Have the user manually review the file for data completeness before loading it

C.

Create a data field to data type validator to run the file through prior to import.

D.

Spot-check the file prior to import to catch and correct field errors.

Buy Now
Questions 19

Which one of the following would not normally be considered a summary statistic?

Options:

A.

z-score.

B.

Mean.

C.

Variance.

D.

Standard deviation.

Buy Now
Questions 20

Which of the following can be used to translate data into another form so it can only be read by a user who has a key or a password?

Options:

A.

Data encryption.

B.

Data transmission.

C.

Data protection.

D.

Data masking.

Buy Now
Questions 21

An analyst is building a new dashboard for a user. After an initial conversation with the user. the analyst created a mock-up of the dashboard. Which of the following best explains why the analyst created the mock-up?

Options:

A.

To identify the dimensions and measures

B.

To send to the client after deploying the dashboard to production

C.

To confirm important details before dashboard development begins

D.

To receive client approval for the final dashboard design

Buy Now
Questions 22

You should always choose the analytics tool that is most appropriate for any given situation, even if that means acquiring a new tool.

Options:

A.

True.

B.

False.

Buy Now
Questions 23

A data analyst needs to perform a full outer join of a customer's orders using the tables below:

Which of the following is the mean of the order quantity?

Options:

A.

73.5

B.

76.5

C.

78.8

D.

81.5

Buy Now
Questions 24

Which of the following is a control measure for preventing a data breach?

Options:

A.

Data transmission

B.

Data attribution

C.

Data retention

D.

Data encryption

Buy Now
Questions 25

Which of the following best describes the process of examining data for statistics and information about the data?

  • Cleansing

Options:

A.

search

B.

Profiling

C.

Governance

Buy Now
Questions 26

Which of the following best describes how discrete data differs from continuous data?

Options:

A.

Discrete data cannot create a sloped line.

B.

Discrete data can only be a finite number of values.

C.

Discrete data can have decimal points.

D.

Discrete data applies only to numbers.

Buy Now
Questions 27

Which of the following techniques is used to quantify data?

Options:

A.

Decoding

B.

Enumeration

C.

Coding

D.

Structure

Buy Now
Questions 28

Emma is working in a data warehouse and finds a finance fact table links to an organization dimension, which in turn links to a currency dimension that not linked to the fact table.

What type of design pattern is the data warehouse using?

Options:

A.

Star.

B.

Sun.

C.

Snowflake.

D.

Comet.

Buy Now
Questions 29

Different people manually type a series of handwritten surveys into an online database. Which of the following issues will MOST likely arise with this data? (Choose two.)

Options:

A.

Data accuracy

B.

Data constraints

C.

Data attribute limitations

D.

Data bias

E.

Data consistency

F.

Data manipulation

Buy Now
Questions 30

Which of the following data sampling methods involves dividing a population into subgroups by similar characteristics?

Options:

A.

Systematic

B.

Simple random

C.

Convenience

D.

Stratified

Buy Now
Questions 31

An analyst is preparing a report that contains weather data. The temperatures are shown in Fahrenheit. but they must be reported in Celsius. Which of the following should the analyst do to fix this issue?

Options:

A.

Normalize the data.

B.

Standardize the data.

C.

Rescale the data.

D.

Aggregate the data.

Buy Now
Questions 32

The director of operations at a power company needs data to help identify where company resources should be allocated in order to monitor activity for outages and restoration of power in the entire state. Specifically, the director wants to see the following:

* County outages

* Status

* Overall trend of outages

INSTRUCTIONS:

Please, select each visualization to fit the appropriate space on the dashboard and choose an appropriate color scheme. Once you have selected all visualizations, please, select the appropriate titles and labels, if applicable. Titles and labels may be used more than once.

If at any time you would like to bring back the initial state of the simulation, please click the Reset All button.

Options:

Buy Now
Questions 33

A data analyst has been asked to derive a new variable labeled “Promotion_flag” based on the total quantity sold by each salesperson. Given the table below:

Which of the following functions would the analyst consider appropriate to flag “Yes” for every salesperson who has a number above 1,000,000 in the Quantity_sold column?

Options:

A.

Date

B.

Mathematical

C.

Logical

D.

Aggregate

Buy Now
Questions 34

A report is scheduled to run and be distributed at the end of business each day. On Mondays, one of the recipients opens the previous week's reports and combines them to calculate the weekly totals and projections for the coming week. This is a tedious process, and the recipient asks an analyst for help. Which of the following should the analyst recommend?

Options:

A.

Add calculation fields to the daily report so the totals are built in.

B.

Create a new report with weekly totals set to run at the end of business on Friday.

C.

Provide a daily summary to the report with totals to save the user the effort of manual calculations.

D.

Reduce the frequency of the report to once a week and change the date range.

Buy Now
Questions 35

A marketing analytics team received customer transaction data from two different sources. The data is complete and accurate; however, the field names appear to be inconsistent. Given the following tables:

Which of the following is considered best practice if the team wants to consolidate the files and conduct further analysis?

Options:

A.

Standardize the field names.

B.

Recode the data values.

C.

Overwrite the field names in one of the tables.

D.

Edit the field names in the data dictionary.

Buy Now
Questions 36

Which of the following describes the method of sampling in which elements of data are selected randomly from each of the small subgroups within a population?

Options:

A.

Simple random

B.

Cluster

C.

Systematic

D.

Stratified

Buy Now
Questions 37

The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company's year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?

Options:

A.

Q2 2020 and Q4 2019

B.

YTD 2020 and YTD 2019

C.

Q2 2020 and Q2 2019

D.

Q2 2020 and Q2 2021

Buy Now
Questions 38

Given the table below:

Which of the following variables can be considered inconsistent, and how many distinct values should the variable have?

Options:

A.

Name, one

B.

Gender, two

C.

Level, three

D.

Code, four

E.

Region, five

Buy Now
Questions 39

Which of the following is used for calculations and pivot tables?

Options:

A.

IBM SPSS

B.

SAS

C.

Microsoft Excel

D.

Domo

Buy Now
Questions 40

Consider the following dataset which contains information about houses that are for sale:

Which of the following string manipulation commands will combine the address and region name columns to create a full address?

full_address------------------------- 85 Turner St, Northern Metropolitan 25 Bloomburg St, Northern Metropolitan 5 Charles St, Northern Metropolitan 40 Federation La, Northern Metropolitan 55a Park St, Northern Metropolitan

Options:

A.

SELECT CONCAT(address, ' , ' , regionname) AS full_address FROM melb LIMIT 5;

B.

SELECT CONCAT(address, '-' , regionname) AS full_address FROM melb LIMIT 5;

C.

SELECT CONCAT(regionname, ' , ' , address) AS full_address FROM melb LIMIT 5

D.

SELECT CONCAT(regionname, '-' , address) AS full_address FROM melb LIMIT 5;

Buy Now
Questions 41

After completing web scraping, which of the following file formats needs to be parsed?

Options:

A.

.html

B.

.txt

C.

.csv

D.

.tsv

Buy Now
Questions 42

Given the below:

Which of the following numbers represents a Type I error?

Options:

A.

1

B.

2

C.

3

D.

4

Buy Now
Questions 43

Which of the following is a process that is used during data integration to collect, blend, and load data?

Options:

A.

MDM

B.

ETL

C.

OLTP

D.

BI

Buy Now
Questions 44

Jenny wants to study the academic performance of undergraduate sophomores and wants to determine the average grade point average at different points during an academic year.

What best describes the data set she needs?

Options:

A.

Sample.

B.

Observation.

C.

Variable.

D.

Population.

Buy Now
Questions 45

Which of the following roles is responsible for ensuring an organization's data quality, security, privacy, and regulatory compliance?

Options:

A.

Data owner.

B.

Data steward.

C.

Data custodian.

D.

Data processor.

Buy Now
Questions 46

A data analyst must fulfill a request for information that is needed weekly and should be automatically emailed to a specific set of users. Which of the following types of reports should the analyst recommend?

Options:

A.

A self-service report

B.

A research report

C.

An ad hoc report

D.

An operational report

Buy Now
Questions 47

Daniel is using the structured Query language to work with data stored in relational database.

He would like to add several new rows to a database table.

What command should he use?

Options:

A.

SELECT.

B.

ALTER.

C.

INSERT.

D.

UPDATE.

Buy Now
Questions 48

Which of the following reports can be used when insight into operational performance is needed each Wednesday?

Options:

A.

Static report

B.

Tactical report

C.

Recurring report

D.

Ad hoc report

Buy Now
Questions 49

Which of the following would be considered non-personally identifiable information?

Options:

A.

Cell phone device name

B.

Customer’s name

C.

Government ID number

D.

Telephone number

Buy Now
Questions 50

Which of the following is the correct data type for text?

Options:

A.

Boolean

B.

String

C.

Integer

D.

Float

Buy Now
Questions 51

A JSON file is an example of:

Options:

A.

structured data.

B.

web data.

C.

machine data.

D.

processed data.

Buy Now
Questions 52

A data analyst is performing a data merge within a spreadsheet using the tables below:

https://www.bing.com/images/blob?bcid=S1XCF9p02M4GjpbGxHj0lrIaj9sw.....4c

The analyst is attempting to pull the addresses from Table 2 into Table 1 using the last names and is receiving an error message. Which of the following steps can the analyst perform to fix the error?

Options:

A.

Use concatenate to combine the tables.

B.

Ensure the formula is pulling from right to left.

C.

Sort the data by the last name field.

D.

Review the spelling and data type.

Buy Now
Questions 53

Amanda needs to create a dashboard that will draw information from many other data sources and present it to business leaders.

Which one of the following tools is least likely to meet her needs?

Options:

A.

QuickSight.

B.

Tableau.

C.

Power BI.

D.

SPSS Modeler.

Buy Now
Questions 54

Which of the following BEST describes standard deviation?

Options:

A.

A measure that is used to establish a relationship between two variables

B.

A measure of how data is distributed

C.

A measure of the amount of dispersion of a set of values

D.

A measure that is used to find the significant difference between variables

Buy Now
Questions 55

Which of the following contains alphanumeric values?

Options:

A.

10.1Ε²

B.

13.6

C.

1347

D.

A3J7

Buy Now
Questions 56

Which of the following BEST describes the issue in which character values are mixed with integer values in a data set column?

Options:

A.

Duplicate data

B.

Missing data

C.

Data outliers

D.

Invalid data type

Buy Now
Questions 57

A database consists of one fact table that is composed of multiple dimensions. Each dimension is represented by a denormalized table. This structure is an example of a:

Options:

A.

non-relational schema.

B.

galaxy schema.

C.

snowflake schema.

D.

star schema.

Buy Now
Questions 58

Which of the following will MOST likely be streamed live?

Options:

A.

Machine data

B.

Key-value pairs

C.

Delimited rows

D.

Flat files

Buy Now
Questions 59

An analyst has been tracking company intranet usage and has been asked to create a chat to show the most-used/most-clicked portions of a homepage that contains more than 30 links. Which of the following visualizations would BEST illustrate this information?

Options:

A.

Scatter plot

B.

Heat map

C.

Pie chart

D.

Infographic

Buy Now
Questions 60

An analyst notices changes in sales ratios when analyzing a quarterly report. Which of the following is the analyst conducting?

Options:

A.

A gap analysis

B.

A link analysis

C.

A trend analysis

D.

A statistical analysis

Buy Now
Questions 61

What would be an example of an acceptable form of primary identification for the Data+ exam?

Options:

A.

Passport.

B.

School ID card.

C.

Employee ID card.

D.

Credit card with photo and signature.

Buy Now
Questions 62

Which of the following best describes the law of large numbers?

Options:

A.

As a sample size decreases, its standard deviation gets closer to the average of the whole population.

B.

As a sample size grows, its mean gets closer to the average of the whole population

C.

As a sample size decreases, its mean gets closer to the average of the whole population.

D.

When a sample size doubles. the sample is indicative of the whole population.

Buy Now
Questions 63

Given the following grocery store orders:

If a query is made to the table with the following logic:

Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)

Which of the following is the number of orders that will be returned by the query?

Options:

A.

Four

B.

Five

C.

Six

D.

Seven

Buy Now
Questions 64

A data analyst needs to create a weekly recurring report on sales performance and distribute it to all sales managers. Which of the following would be the BEST method to automate and ensure successful delivery for this task?

Options:

A.

Use scheduled report delivery.

B.

Implement subscription access delivery.

C.

Print out a copy.

D.

Upload the report to the server.

Buy Now
Questions 65

Which of the following value is the measure of dispersion "range" between the scores of ten students in a test.

The scores of ten students in a test are 17, 23, 30, 36, 45, 51, 58, 66, 72, 77.

Options:

A.

90

B.

60

C.

70

D.

80

Buy Now
Questions 66

Which of the following is a KPI metric for tracking sales performance?

Options:

A.

Order status percentage

B.

Customer acquisition percentage

C.

Gross profit percentage

D.

Click-through rate percentage

Buy Now
Questions 67

Which of the following is the most likely reason for a data analyst to optimize a query using parameterization?

Options:

A.

To return a subset of records

B.

To insert a temporary table

C.

To prevent SQL injections

D.

To increase the query speed

Buy Now
Questions 68

The senior management team at a company receives a detailed sales report at the end of each quarter. The report is several pages long and includes data from dozens of offices across the country. The team wants a better way to get a quick snapshot of what is included in the report. Which of the following modifications would best meet this requirement?

Options:

A.

Modifying documentation elements to include reference data sources

B.

Modifying the font size and style so important data points are more visible

C.

Modifying the report to include a summary section with observations and insights

D.

Modifying the report layout so it is easier to follow and understand

Buy Now
Questions 69

Which of the following is the first step an analyst should perform upon receiving a business request for analysis?

Options:

A.

Determine the data needs and sources for analysis.

B.

Initiate the analysis for exploratory data analysis.

C.

Review the business questions to understand the scope.

D.

Finalize the methodology to solve the problem.

Buy Now
Questions 70

A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?

Options:

A.

A real-time monitor that allows the manager to view performance the day the campaign was launched

B.

A sell-service dashboard that allows the manager to look at the company's annual budget performance

C.

A spreadsheet of the raw data from all marketing campaigns and channels

D.

A summary with statistics, conclusions, and recommendations from the data analyst

Buy Now
Questions 71

Which of the following is a difference between a primary key and a unique key?

Options:

A.

A unique key cannot take null values, whereas a primary key can take null values.

B.

There can be only one primary key in a data set, whereas there can be multiple unique keys.

C.

A primary key can take a value more than once, whereas a unique key cannot take a value more than once.

D.

A primary key cannot be a date variable, whereas a unique key can be.

Buy Now
Questions 72

An analyst is currently working on a ticket for revamping a company-wide dashboard that has been in use for five years. Which of the following should be the first step in the development process?

Options:

A.

Talk to the group that made the request to determine the desired goal.

B.

Make changes to a frequently used report that is already in production.

C.

Build an additional dashboard with fewer views that are tailored toward each specific team.

D.

Develop a more streanMined dashboard to roll out by the next delivery date.

Buy Now
Questions 73

Which of the following is a domain-specific language used in programming that is designed for managing data that is held in a relational data stream management system?

Options:

A.

SAS

B.

SQL

C.

Python

D.

R

Buy Now
Questions 74

A development company is constructing a new unit in its apartment complex. The complex has the following floor plans:

Using the average cost per square foot of the original floor plans, which of the following should be the price of the Rose unit?

Options:

A.

$640,900

B.

$690,000

C.

$705,200

D.

$702,500

Buy Now
Questions 75

Which of the following should be accomplished NEXT after understanding a business requirement for a data analysis report?

Options:

A.

Rephrase the business requirement.

B.

Determine the data necessary for the analysis.

C.

Build a mock dashboard/presentation layout.

D.

Perform exploratory data analysis.

Buy Now
Questions 76

Given the following data table:

Which of the following are appropriate reasons to undertake data cleansing? (Select two).

Options:

A.

Non-parametric data

B.

Missing data

C.

Duplicate data

D.

Invalid data

E.

Redundant data

F.

Normalized data

Buy Now
Questions 77

Which of the following database schemas features normalized dimension tables?

Options:

A.

Flat

B.

Snowflake

C.

Hierarchical

D.

Star

Buy Now
Questions 78

A financial institution is reporting on sales performance to a company at the account level. Due to the sensitive nature of the government the does il with, some account information is not shown. Which of the following fields should be masked?

Options:

A.

Sales volume

B.

Start date

C.

Product name

D.

Customer name

Buy Now
Exam Code: DA0-001
Exam Name: CompTIA Data+ Certification Exam
Last Update: May 19, 2024
Questions: 262
$64  $159.99
$48  $119.99
$40  $99.99
buy now DA0-001