Winter Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: pass65

Pass the Databricks Data Analyst Databricks-Certified-Data-Analyst-Associate Questions and answers with ValidTests

Exam Databricks-Certified-Data-Analyst-Associate All Questions
Exam Databricks-Certified-Data-Analyst-Associate Premium Access

View all detail and faqs for the Databricks-Certified-Data-Analyst-Associate exam

Viewing page 1 out of 2 pages
Viewing questions 1-10 out of questions
Questions # 1:

The stakeholders.customers table has 15 columns and 3,000 rows of data. The following command is run:

Question # 1

After runningSELECT * FROM stakeholders.eur_customers, 15 rows are returned. After the command executes completely, the user logs out of Databricks.

After logging back in two days later, what is the status of thestakeholders.eur_customersview?

Options:

A.

The view remains available and SELECT * FROM stakeholders.eur_customers will execute correctly.

B.

The view has been dropped.

C.

The view is not available in the metastore, but the underlying data can be accessed with SELECT * FROM delta. `stakeholders.eur_customers`.

D.

The view remains available but attempting to SELECT from it results in an empty result set because data in views are automatically deleted after logging out.

E.

The view has been converted into a table.

Expert Solution
Questions # 2:

A data analyst is attempting to drop a table my_table. The analyst wants to delete all table metadata and data.

They run the following command:

DROP TABLE IF EXISTS my_table;

While the object no longer appears when they run SHOW TABLES, the data files still exist.

Which of the following describes why the data files still exist and the metadata files were deleted?

Options:

A.

The table's data was larger than 10 GB

B.

The table did not have a location

C.

The table was external

D.

The table's data was smaller than 10 GB

E.

The table was managed

Expert Solution
Questions # 3:

Delta Lake stores table data as a series of data files, but it also stores a lot of other information.

Which of the following is stored alongside data files when using Delta Lake?

Options:

A.

None of these

B.

Table metadata, data summary visualizations, and owner account information

C.

Table metadata

D.

Data summary visualizations

E.

Owner account information

Expert Solution
Questions # 4:

In which of the following situations will the mean value and median value of variable be meaningfully different?

Options:

A.

When the variable contains no outliers

B.

When the variable contains no missing values

C.

When the variable is of the boolean type

D.

When the variable is of the categorical type

E.

When the variable contains a lot of extreme outliers

Expert Solution
Questions # 5:

A data engineering team has created a Structured Streaming pipeline that processes data in micro-batches and populates gold-level tables. The microbatches are triggered every 10 minutes.

A data analyst has created a dashboard based on this gold level data. The project stakeholders want to see the results in the dashboard updated within 10 minutes or less of new data becoming available within the gold-level tables.

What is the ability to ensure the streamed data is included in the dashboard at the standard requested by the project stakeholders?

Options:

A.

A refresh schedule with an interval of 10 minutes or less

B.

A refresh schedule with an always-on SQL Warehouse (formerly known as SQL Endpoint

C.

A refresh schedule with stakeholders included as subscribers

D.

A refresh schedule with a Structured Streaming cluster

Expert Solution
Questions # 6:

A data analyst is processing a complex aggregation on a table with zero null values and their query returns the following result:

Which of the following queries did the analyst run to obtain the above result?

A)

B)

C)

D)

E)

Question # 6

Options:

A.

Option A

B.

Option B

C.

Option C

D.

Option D

E.

Option E

Expert Solution
Questions # 7:

Which statement describes descriptive statistics?

Options:

A.

A branch of statistics that uses a variety of data analysis techniques to infer properties of an underlying distribution of probability.

B.

A branch of statistics that uses summary statistics to categorically describe and summarize data.

C.

A branch of statistics that uses summary statistics to quantitatively describe and summarize data.

D.

A branch of statistics that uses quantitative variables that must take on a finite or countably infinite set of values.

Expert Solution
Questions # 8:

A data analyst has created a Query in Databricks SQL, and now they want to create two data visualizations from that Query and add both of those data visualizations to the same Databricks SQL Dashboard.

Which of the following steps will they need to take when creating and adding both data visualizations to the Databricks SQL Dashboard?

Options:

A.

They will need to alter the Query to return two separate sets of results.

B.

They will need to add two separate visualizations to the dashboard based on the same Query.

C.

They will need to create two separate dashboards.

D.

They will need to decide on a single data visualization to add to the dashboard.

E.

They will need to copy the Query and create one data visualization per query.

Expert Solution
Questions # 9:

What is a benefit of using Databricks SQL for business intelligence (Bl) analytics projects instead of using third-party Bl tools?

Options:

A.

Computations, data, and analytical tools on the same platform

B.

Advanced dashboarding capabilities

C.

Simultaneous multi-user support

D.

Automated alerting systems

Expert Solution
Questions # 10:

A data analyst is processing a complex aggregation on a table with zero null values and the query returns the following result:

Which query did the analyst execute in order to get this result?

A)

B)

C)

D)

Question # 10

Options:

A.

Option A

B.

Option B

C.

Option C

D.

Option D

Expert Solution
Viewing page 1 out of 2 pages
Viewing questions 1-10 out of questions