Winter Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: pass65

Exam Databricks-Certified-Professional-Data-Engineer All Questions
Exam Databricks-Certified-Professional-Data-Engineer All Questions

View all questions & answers for the Databricks-Certified-Professional-Data-Engineer exam

Databricks Certification Databricks-Certified-Professional-Data-Engineer Question # 5 Topic 1 Discussion

Databricks-Certified-Professional-Data-Engineer Exam Topic 1 Question 5 Discussion:
Question #: 5
Topic #: 1

A Delta Lake table representing metadata about content posts from users has the following schema:

user_id LONG, post_text STRING, post_id STRING, longitude FLOAT, latitude FLOAT, post_time TIMESTAMP, date DATE

This table is partitioned by the date column. A query is run with the following filter:

longitude < 20 & longitude > -20

Which statement describes how data will be filtered?


A.

Statistics in the Delta Log will be used to identify partitions that might Include files in the filtered range.


B.

No file skipping will occur because the optimizer does not know the relationship between the partition column and the longitude.


C.

The Delta Engine will use row-level statistics in the transaction log to identify the flies that meet the filter criteria.


D.

Statistics in the Delta Log will be used to identify data files that might include records in the filtered range.


E.

The Delta Engine will scan the parquet file footers to identify each row that meets the filter criteria.


Get Premium Databricks-Certified-Professional-Data-Engineer Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.