Amazon Exam AIF-C01 Topic 2 Question 26 Discussion

Actual exam question for Amazon's AIF-C01 exam

Question #: 26
Topic #: 2

[All AIF-C01 Questions]

Why does overfilting occur in ML models?

AThe training dataset does not reptesent all possible input values.

BThe model contains a regularization method.

CThe model training stops early because of an early stopping criterion.

DThe training dataset contains too many features.

Show Suggested Answer

Suggested Answer: A

Overfitting occurs when an ML model learns the training data too well, including noise and patterns that do not generalize to new data. A key cause of overfitting is when the training dataset does not represent all possible input values, leading the model to over-specialize on the limited data it was trained on, failing to generalize to unseen data.

Exact Extract from AWS AI Documents:

From the Amazon SageMaker Developer Guide:

'Overfitting often occurs when the training dataset is not representative of the broader population of possible inputs, causing the model to memorize specific patterns, including noise, rather than learning generalizable features.'

(Source: Amazon SageMaker Developer Guide, Model Evaluation and Overfitting)

Detailed

Option A: The training dataset does not represent all possible input values.This is the correct answer. If the training dataset lacks diversity and does not cover the range of possible inputs, the model overfits by learning patterns specific to the training data, failing to generalize.

Option B: The model contains a regularization method.Regularization methods (e.g., L2 regularization) are used to prevent overfitting, not cause it. This option is incorrect.

Option C: The model training stops early because of an early stopping criterion.Early stopping is a technique to prevent overfitting by halting training when performance on a validation set degrades. It does not cause overfitting.

Option D: The training dataset contains too many features.While too many features can contribute to overfitting (e.g., by increasing model complexity), this is less directly tied to overfitting than a non-representative dataset. The dataset's representativeness is the primary cause.

Amazon SageMaker Developer Guide: Model Evaluation and Overfitting (https://docs.aws.amazon.com/sagemaker/latest/dg/model-evaluation.html)

AWS AI Practitioner Learning Path: Module on Model Performance and Evaluation

AWS Documentation: Understanding Overfitting (https://aws.amazon.com/machine-learning/)

by Vanna at Jul 08, 2025, 01:17 PM

Limited Time Offer

25%

Off

Get Premium AIF-C01 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Currently there are no comments in this discussion, be the first to comment!