Independence Day Deal! Unlock 25% OFF Today – Limited-Time Offer - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

NVIDIA Exam NCA-AIIO Topic 3 Question 6 Discussion

Actual exam question for NVIDIA's NCA-AIIO exam
Question #: 6
Topic #: 3
[All NCA-AIIO Questions]

You have deployed an AI training job on a GPU cluster, but the training time has not decreased as expected after adding more GPUs. Upon further investigation, you observe that the GPU utilization is low, and the CPU utilization is very high. What is the most likely cause of this issue?

Show Suggested Answer Hide Answer
Suggested Answer: D

The data preprocessing being bottlenecked by the CPU is the most likely cause. High CPU utilization and low GPU utilization suggest the GPUs are idle, waiting for data, a common issue when preprocessing (e.g., data loading) is CPU-bound. NVIDIA recommends GPU-accelerated preprocessing (e.g., DALI) to mitigate this. Option A (model incompatibility) would show errors, not low utilization. Option B (connection issues) would disrupt communication, not CPU load. Option C (software version) is less likely without specific errors. NVIDIA's performance guides highlight preprocessing bottlenecks.


Contribute your Thoughts:

Jackie
1 days ago
The most likely cause is D) The data preprocessing is being bottlenecked by the CPU. Makes sense, if the CPU is struggling, it won't be able to feed the GPUs fast enough.
upvoted 0 times
...
Loise
5 days ago
But could it also be due to incorrect software version on the GPUs?
upvoted 0 times
...
Margurite
6 days ago
I agree with Tamala, high CPU utilization could be bottlenecking the data preprocessing.
upvoted 0 times
...
Tamala
9 days ago
I think the issue might be with the data preprocessing.
upvoted 0 times
...

Save Cancel