Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

PMI-CPMAI Exam - Topic 5 Question 10 Discussion

Actual exam question for PMI's PMI-CPMAI exam
Question #: 10
Topic #: 5
[All PMI-CPMAI Questions]

A transportation company is preparing data for an AI model to optimize fleet management. The project team is working with large amounts of structured and unstructured data.

If the project manager avoids addressing the variety of data during preparation, what will be the result?

Show Suggested Answer Hide Answer
Suggested Answer: D

PMI-CPMAI explains that modern AI projects often work with high-volume, high-variety data, including both structured (tables, logs, telemetry) and unstructured formats (text, documents, images). A core principle in the data preparation and pipeline design stages is that ''variety must be explicitly addressed through normalization, harmonization, and feature extraction so that models receive coherent, compatible inputs.'' If the project manager ignores the variety dimension---treating all data as if it were homogeneous---this typically leads to misaligned schemas, inconsistent encodings, missing modalities, and improperly handled unstructured content.

The guidance notes that such issues ''manifest as degraded model performance, instability, and reduced generalizability, even when volume and velocity are adequately managed.'' In a fleet management context, failing to harmonize telematics, maintenance records, driver logs, and external data (e.g., traffic or weather) means the model cannot fully capture relevant patterns, and some signals may be effectively unusable or misleading. Rather than improving accuracy or consistency, skipping this work undermines the quality of features, increases noise, and introduces hidden biases.

As a result, PMI-CPMAI indicates that not addressing data variety during preparation will most directly lead to reduced model performance, because the model is trained and evaluated on incomplete, inconsistent, or poorly integrated representations of the underlying operational reality.


Contribute your Thoughts:

0/2000 characters
Susana
2 days ago
This reminds me of a practice question where we learned that not addressing data variety typically leads to reduced model performance. I think D is the right choice.
upvoted 0 times
...
Clement
7 days ago
I remember discussing how ignoring data variety can lead to issues, but I'm not sure if it directly affects processing speed or model performance.
upvoted 0 times
...

Save Cancel