New Year Sale 2026! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Microsoft DP-100 Exam - Topic 3 Question 2 Discussion

Actual exam question for Microsoft's DP-100 exam
Question #: 2
Topic #: 3
[All DP-100 Questions]

You are performing a filter based feature selection for a dataset 10 build a multi class classifies by using Azure Machine Learning Studio.

The dataset contains categorical features that are highly correlated to the output label column.

You need to select the appropriate feature scoring statistical method to identify the key predictors. Which method should you use?

Show Suggested Answer Hide Answer
Suggested Answer: D

Pearson's correlation statistic, or Pearson's correlation coefficient, is also known in statistical models as the r value. For any two variables, it returns a value that indicates the strength of the correlation

Pearson's correlation coefficient is the test statistics that measures the statistical relationship, or association, between two continuous variables. It is known as the best method of measuring the association between variables of interest because it is based on the method of covariance. It gives information about the magnitude of the association, or correlation, as well as the direction of the relationship.


https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/filter-based-feature-selection

https://www.statisticssolutions.com/pearsons-correlation-coefficient/

Contribute your Thoughts:

0/2000 characters
Antione
4 months ago
Wait, are we sure Chi-squared is the best choice?
upvoted 0 times
...
Ruby
4 months ago
Kendall correlation? Seems a bit off for this scenario.
upvoted 0 times
...
Mary
5 months ago
I thought Spearman correlation could work too, but not sure.
upvoted 0 times
...
Jettie
5 months ago
Totally agree, Chi-squared fits perfectly here.
upvoted 0 times
...
Apolonia
5 months ago
Chi-squared is the way to go for categorical features!
upvoted 0 times
...
Malinda
5 months ago
I practiced a similar question before, and I think Chi-squared is definitely the right choice for identifying key predictors in this case.
upvoted 0 times
...
Mona
5 months ago
I'm not entirely sure, but I feel like Spearman correlation might be more suited for ordinal data rather than categorical.
upvoted 0 times
...
Elsa
5 months ago
I remember studying feature selection methods, and I think Chi-squared is often used for categorical data.
upvoted 0 times
...
German
5 months ago
I have a vague memory of Kendall correlation being used for ranking, but I don't think it's the best fit here.
upvoted 0 times
...
Annabelle
5 months ago
Hmm, this question is asking about the limitations of the Customer Community License type. I'll need to think carefully about the key differences between this and other license types.
upvoted 0 times
...
Veta
5 months ago
I'm not sure about this one. The options seem pretty technical, so I'll have to guess and hope for the best.
upvoted 0 times
...
Rashad
5 months ago
Hmm, I'm not totally sure about this one. I'll have to think it through carefully. Maybe I should review my notes on the agile principles again.
upvoted 0 times
...

Save Cancel