A data scientist is working with a dataset containing customer information, including an 'Income' feature with a significant number of missing values. The distribution of the 'Income' feature is heavily right-skewed. Which of the following methods for handling missing data is most appropriate in this situation to minimize the impact of outliers?
-
A
Deleting the rows with missing income values.
-
B
Imputing the missing values with the mean of the 'Income' feature.
-
C
Imputing the missing values with the median of the 'Income' feature.
-
D
Replacing missing values with a constant like zero.