Winter Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: pass65

Exam Databricks-Machine-Learning-Associate All Questions
Exam Databricks-Machine-Learning-Associate All Questions

View all questions & answers for the Databricks-Machine-Learning-Associate exam

Databricks ML Data Scientist Databricks-Machine-Learning-Associate Question # 14 Topic 2 Discussion

Databricks-Machine-Learning-Associate Exam Topic 2 Question 14 Discussion:
Question #: 14
Topic #: 2

A data scientist has replaced missing values in their feature set with each respective feature variable’s median value. A colleague suggests that the data scientist is throwing away valuable information by doing this.

Which of the following approaches can they take to include as much information as possible in the feature set?


A.

Impute the missing values using each respective feature variable's mean value instead of the median value


B.

Refrain from imputing the missing values in favor of letting the machine learning algorithm determine how to handle them


C.

Remove all feature variables that originally contained missing values from the feature set


D.

Create a binary feature variable for each feature that contained missing values indicating whether each row's value has been imputed


E.

Create a constant feature variable for each feature that contained missing values indicating the percentage of rows from the feature that was originally missing


Get Premium Databricks-Machine-Learning-Associate Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.