Here you can find all the free questions related with NVIDIA AI Infrastructure (NCP-AII) exam. You can also find on this page links to recently updated premium files with which you can practice for actual NVIDIA AI Infrastructure Exam. These premium versions are provided as NCP-AII exam practice tests, both as desktop software and browser based application, you can use whatever suits your style. Feel free to try the AI Infrastructure Exam premium files for free, Good luck with your NVIDIA AI Infrastructure Exam.
Question No: 1
MultipleChoice
An infrastructure engineer in an AI factory has successfully replaced a power supply unit on an NVIDIA DGX H100. After installation, both the IN and OUT LEDs on the new power supply illuminate solid green. Which NVSM CLI command should the engineer use to quickly verify the overall system status and ensure it is operating as expected?
Options
Answer CExplanation
The NVIDIA System Management (NVSM) tool is the definitive CLI utility for monitoring the health of DGX platforms. While replacing a PSU (Power Supply Unit) is a common maintenance task, verifying that the new component is correctly integrated into the system's health model is mandatory. While nvsm show power would provide specific data regarding wattage and voltage for the PSU, the most comprehensive way to ensure the replacement hasn't caused secondary issues or that the system hasn't remained in a 'Degraded' state is to run nvsm show health. This command performs a global check across all subsystems: GPUs, NVLink switches, storage, fans, and power. If the PSU replacement was successful and the system is back to full redundancy, nvsm show health will return a 'Healthy' status. In an AI factory setting, where DGX H100 nodes pull significant power, ensuring that all 6 PSUs (in an N+N or N+1 configuration) are not only physically green but logically acknowledged by the Baseboard Management Controller (BMC) is critical for preventing unexpected shutdowns during high-load training iterations.
Question No: 2
MultipleChoice
A systems administrator is preparing a new DGX server for deployment. What is the most secure approach to configuring the BMC port during initial setup?
Options
Answer DExplanation
The Baseboard Management Controller (BMC) is a powerful tool that allows for total control over the DGX system, including the ability to flash firmware, cycle power, and access the serial console. Because of this, it is a high-value target for security threats. The '100% verified' secure approach (Option D) involves two critical layers:
Network Isolation: The BMC port should never be exposed to the public internet (Option A) or even the general production network (Option B). It must reside on a dedicated Out-of-Band (OOB) network that is firewalled and accessible only to authorized administrators.
Credential Management: Standard NVIDIA factory defaults (like admin/admin) must be changed immediately upon first access. As part of the DGX first-boot wizard, the system prompts the administrator to create a strong, unique password for the primary user, which is then synchronized to the BMC.
Leaving the port disconnected (Option C) is unfeasible for modern data center operations, as the BMC is required for remote monitoring and 'headless' deployment. Following the isolated/firewalled approach ensures the AI Factory remains resilient against both external attacks and internal lateral movement.
Question No: 3
MultipleChoice
A cluster administrator needs to validate transceiver firmware versions across 200 ports using UFM. Which GUI-based method provides a consolidated view?
Options
Answer AExplanation
Managing a large-scale AI fabric requires centralized visibility into the physical layer. The NVIDIA Unified Fabric Manager (UFM) provides a comprehensive Dashboard for InfiniBand networks. To check transceiver firmware---which is critical for ensuring feature parity and stability across the fabric---the administrator can use the UFM Enterprise GUI. By navigating to the 'Devices' section and selecting a specific switch, the 'Cables' tab will aggregate telemetry for every occupied port. This view displays the manufacturer, part number, and the specific firmware version of the transceivers (LinkX) or Active Optical Cables (AOC). This consolidated view is far more efficient than manual CLI queries (Option C) for 200+ ports. Maintaining uniform firmware across transceivers ensures that optimizations like Adaptive Routing and Congestion Control perform consistently across the entire 400G or 200G fabric.