New Year Special Limited Time Flat 70% Discount offer - Ends in 0d 00h 00m 00s - Coupon code: 70spcl

Databricks Databricks-Certified-Data-Analyst-Associate Databricks Certified Data Analyst Associate Exam Exam Practice Test

Page: 1 / 5
Total 45 questions

Databricks Certified Data Analyst Associate Exam Questions and Answers

Question 1

How can a data analyst determine if query results were pulled from the cache?

Options:

A.

Go to the Query History tab and click on the text of the query. The slideout shows if the results came from the cache.

B.

Go to the Alerts tab and check the Cache Status alert.

C.

Go to the Queries tab and click on Cache Status. The status will be green if the results from the last run came from the cache.

D.

Go to the SQL Warehouse (formerly SQL Endpoints) tab and click on Cache. The Cache file will show the contents of the cache.

E.

Go to the Data tab and click Last Query. The details of the query will show if the results came from the cache.

Question 2

A data analyst has created a user-defined function using the following line of code:

CREATE FUNCTION price(spend DOUBLE, units DOUBLE)

RETURNS DOUBLE

RETURN spend / units;

Which of the following code blocks can be used to apply this function to the customer_spend and customer_units columns of the table customer_summary to create column customer_price?

Options:

A.

SELECT PRICE customer_spend, customer_units AS customer_price FROM customer_summary

B.

SELECT price FROM customer_summary

C.

SELECT function(price(customer_spend, customer_units)) AS customer_price FROM customer_summary

D.

SELECT double(price(customer_spend, customer_units)) AS customer_price FROM customer_summary

E.

SELECT price(customer_spend, customer_units) AS customer_price FROM customer_summary

Question 3

A business analyst has been asked to create a data entity/object called sales_by_employee. It should always stay up-to-date when new data are added to the sales table. The new entity should have the columns sales_person, which will be the name of the employee from the employees table, and sales, which will be all sales for that particular sales person. Both the sales table and the employees table have an employee_id column that is used to identify the sales person.

Which of the following code blocks will accomplish this task?

A)

Question # 3

B)

Question # 3

C)

Question # 3

D)

Question # 3

Options:

A.

Option

B.

Option

C.

Option

D.

Option

Question 4

A data analyst is processing a complex aggregation on a table with zero null values and their query returns the following result:

Question # 4

Which of the following queries did the analyst run to obtain the above result?

A)

Question # 4

B)

Question # 4

C)

Question # 4

D)

Question # 4

E)

Question # 4

Options:

A.

Option A

B.

Option B

C.

Option C

D.

Option D

E.

Option E

Question 5

Which of the following statements about a refresh schedule is incorrect?

Options:

A.

A query can be refreshed anywhere from 1 minute lo 2 weeks

B.

Refresh schedules can be configured in the Query Editor.

C.

A query being refreshed on a schedule does not use a SQL Warehouse (formerly known as SQL Endpoint).

D.

A refresh schedule is not the same as an alert.

E.

You must have workspace administrator privileges to configure a refresh schedule

Question 6

Delta Lake stores table data as a series of data files, but it also stores a lot of other information.

Which of the following is stored alongside data files when using Delta Lake?

Options:

A.

None of these

B.

Table metadata, data summary visualizations, and owner account information

C.

Table metadata

D.

Data summary visualizations

E.

Owner account information

Question 7

After runningDESCRIBE EXTENDED accounts.customers;, the following was returned:

Question # 7

Now, a data analyst runs the following command:

DROP accounts.customers;

Which of the following describes the result of running this command?

Options:

A.

Running SELECT * FROM delta. `dbfs:/stakeholders/customers` results in an error.

B.

Running SELECT * FROM accounts.customers will return all rows in the table.

C.

All files with the .customers extension are deleted.

D.

The accounts.customers table is removed from the metastore, and the underlying data files are deleted.

E.

The accounts.customers table is removed from the metastore, but the underlying data files are untouched.

Question 8

Which of the following is an advantage of using a Delta Lake-based data lakehouse over common data lake solutions?

Options:

A.

ACID transactions

B.

Flexible schemas

C.

Data deletion

D.

Scalable storage

E.

Open-source formats

Question 9

A data analyst has recently joined a new team that uses Databricks SQL, but the analyst has never used Databricks before. The analyst wants to know where in Databricks SQL they can write and execute SQL queries.

On which of the following pages can the analyst write and execute SQL queries?

Options:

A.

Data page

B.

Dashboards page

C.

Queries page

D.

Alerts page

E.

SQL Editor page

Question 10

A data analyst has created a Query in Databricks SQL, and now they want to create two data visualizations from that Query and add both of those data visualizations to the same Databricks SQL Dashboard.

Which of the following steps will they need to take when creating and adding both data visualizations to the Databricks SQL Dashboard?

Options:

A.

They will need to alter the Query to return two separate sets of results.

B.

They will need to add two separate visualizations to the dashboard based on the same Query.

C.

They will need to create two separate dashboards.

D.

They will need to decide on a single data visualization to add to the dashboard.

E.

They will need to copy the Query and create one data visualization per query.

Question 11

Which of the following is a benefit of Databricks SQL using ANSI SQL as its standard SQL dialect?

Options:

A.

It has increased customization capabilities

B.

It is easy to migrate existingSQL queries to Databricks SQL

C.

It allows for the use of Photon's computation optimizations

D.

It is more performant than other SQL dialects

E.

It is more compatible with Spark's interpreters

Question 12

In which of the following situations will the mean value and median value of variable be meaningfully different?

Options:

A.

When the variable contains no outliers

B.

When the variable contains no missing values

C.

When the variable is of the boolean type

D.

When the variable is of the categorical type

E.

When the variable contains a lot of extreme outliers

Question 13

A data analyst has been asked to use the below tablesales_tableto get the percentage rank of products within region by the sales:

Question # 13

The result of the query should look like this:

Question # 13

Which of the following queries will accomplish this task?

A)

Question # 13

B)

Question # 13

C)

Question # 13

D)Question # 13

Options:

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Page: 1 / 5
Total 45 questions