Microsoft DP-100 Exam - Topic 4 Question 128 Discussion

Actual exam question for Microsoft's DP-100 exam

Question #: 128
Topic #: 4

You are building a binary classification model by using a supplied training set.

The training set is imbalanced between two classes.

You need to resolve the data imbalance.

What are three possible ways to achieve this goal? Each correct answer presents a complete solution NOTE: Each correct selection is worth one point.

APenalize the classification

BResample the data set using under sampling or oversampling

CGenerate synthetic samples in the minority class.

DUse accuracy as the evaluation metric of the model.

ENormalize the training feature set.

Show Suggested Answer

Suggested Answer: A, B, D

https://machinelearningmastery.com/tactics-to-combat-imbalanced-classes-in-your-machine-learning-dataset/

by Maurine at Jul 08, 2025, 03:41 PM

Limited Time Offer

25%

Off

Get Premium DP-100 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Quentin

4 months ago

Wait, can resampling really fix the imbalance issue? Sounds too easy.

upvoted 0 times

...

Noelia

4 months ago

Using accuracy as a metric? That's a bad idea for imbalanced data.

upvoted 0 times

...

Myra

5 months ago

Generating synthetic samples is a great strategy!

upvoted 0 times

...

Patrick

5 months ago

I think penalizing the classification can help too.

upvoted 0 times

...

Annett

5 months ago

Resampling is a solid choice!

upvoted 0 times

...

Alysa

5 months ago

I feel like using accuracy as an evaluation metric isn't a good idea for imbalanced datasets, but I can't remember what we should use instead.

upvoted 0 times

...

Lili

5 months ago

Generating synthetic samples for the minority class sounds right, but I can't recall the exact method we discussed. Was it SMOTE or something else?

upvoted 0 times

...

Lisha

6 months ago

I think resampling the dataset is definitely one way to handle imbalance, either by undersampling or oversampling. That seems familiar.

upvoted 0 times

...

Lisandra

6 months ago

I remember we talked about penalizing the classification to help with imbalanced data, but I'm not sure if that's a complete solution.

upvoted 0 times

...

I'm a little confused by this question. Is normalizing the feature set (option E) really a way to handle class imbalance? I was thinking that would be more for improving model performance in general. I'm leaning towards options B and C, but I'll have to double-check my understanding before answering.

upvoted 0 times

...

Virgie

6 months ago

Okay, I've got this! The key here is to not use accuracy as the evaluation metric, since that can be misleading with imbalanced data. I'll go with options B and C - resampling and generating synthetic samples. That should help balance out the classes and give me a more reliable model.

upvoted 0 times

...

Barbra

6 months ago

Hmm, I'm a bit unsure about this one. I know we need to address the class imbalance, but I'm not sure if penalizing the classification (option A) is a good approach. I'll have to think more about the pros and cons of the different options before deciding.

upvoted 0 times

...

Una

6 months ago

This looks like a straightforward question on handling imbalanced datasets. I think I'll go with options B and C - resampling the data using under/oversampling, and generating synthetic samples for the minority class. Those seem like the most common and effective techniques for this problem.

upvoted 0 times

...

Johna

8 months ago

Wait, did they just throw in a completely irrelevant option just to mess with us? D, really? Accuracy? What is this, amateur hour?

upvoted 0 times

...

Javier

8 months ago

A, penalizing the classification? Sounds like a good idea, but I'm not sure how that would work in practice. Hmm, maybe I need to read up on that one.

upvoted 0 times

Helga

7 months ago

C) Generate synthetic samples in the minority class.

upvoted 0 times

...

Lewis

7 months ago

B) Resample the data set using under sampling or oversampling

upvoted 0 times

...

Rosendo

7 months ago

A) Penalize the classification

upvoted 0 times

...

Elvis

8 months ago

E? Normalize the features? What is this, a trick question? That's got nothing to do with class imbalance.

upvoted 0 times

Ernest

7 months ago

A) Penalize the classification

upvoted 0 times

...

Stephanie

9 months ago

I believe generating synthetic samples in the minority class could also be a good solution.

upvoted 0 times

...

Lucina

9 months ago

Definitely not D. Accuracy is a terrible metric for imbalanced data. You gotta use something like F1-score or area under the ROC curve.

upvoted 0 times

Delmy

8 months ago

C) Generate synthetic samples in the minority class.

upvoted 0 times

...

Willow

8 months ago

B) Resample the data set using under sampling or oversampling

upvoted 0 times

...

Tresa

8 months ago

A) Penalize the classification

upvoted 0 times

...

Nickolas

9 months ago

B and C are the way to go! Oversampling and synthetic samples are classic techniques for imbalanced datasets.

upvoted 0 times

Raelene

9 months ago

Using accuracy as the evaluation metric may not be suitable for imbalanced datasets.

upvoted 0 times

...

Felicitas

9 months ago

Penalizing the classification can also help in balancing the classes.

upvoted 0 times

...

Deangelo

9 months ago

I agree, oversampling and generating synthetic samples are effective methods for handling imbalanced data.

upvoted 0 times

...

Paris

9 months ago

I agree with Linsey. Resampling the data set can help balance the classes.

upvoted 0 times

...

Linsey

10 months ago

I think we should resample the data set using under sampling or oversampling.

upvoted 0 times

...

Microsoft DP-100 Exam - Topic 4 Question 128 Discussion

Contribute your Thoughts:

Quentin

Noelia

Myra

Patrick

Annett

Alysa

Lili

Lisha

Lisandra

Luz

Virgie

Barbra

Una

Johna

Javier

Helga

Lewis

Rosendo

Elvis

Ernest

Stephanie

Lucina

Delmy

Willow

Tresa

Nickolas

Raelene

Felicitas

Deangelo

Paris

Linsey