Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Databricks Certified Data Analyst Associate Exam - Topic 3 Question 51 Discussion

Actual exam question for Databricks's Databricks Certified Data Analyst Associate exam
Question #: 51
Topic #: 3
[All Databricks Certified Data Analyst Associate Questions]

In which circumstance will there be a substantial difference between the variable's mean and median values?

Show Suggested Answer Hide Answer
Suggested Answer: D

The mean is sensitive to extreme values, often called outliers, which can significantly skew the average away from the true center of the data. The median, however, is a measure of central tendency that is resistant to such outliers because it only considers the middle value(s) when the data is ordered. Therefore, when a variable contains many extreme outliers, there will be a substantial difference between the mean and the median. According to Databricks data analysis materials, this is a fundamental concept when choosing summary statistics for reporting.


Contribute your Thoughts:

0/2000 characters
Antonio
17 days ago
I'm not entirely sure, but I feel like categorical variables wouldn't really have a mean or median, so A seems unlikely.
upvoted 0 times
...
Clarence
22 days ago
I remember practicing a question about means and medians, and it mentioned that outliers really affect the mean more than the median.
upvoted 0 times
...
Jaime
27 days ago
I think the mean and median can differ a lot when there are extreme outliers, so maybe D is the right answer?
upvoted 0 times
...

Save Cancel