Skip to main content

Remove an H100/H200 GPU and heat sink module

Follow instructions in this section to remove an H100/H200 GPU and heat sink module. The procedure must be executed by a trained technician.

About this task

Attention
  • Read Installation Guidelines and Safety inspection checklist to ensure that you work safely.
  • Power off the server and peripheral devices and disconnect the power cords and all external cables. See Power off the server.
  • Two people and one lifting device on site that can support up to 400 lb (181 kg) are required to perform this procedure. If you do not already have a lifting device available, Lenovo offers the Genie Lift GL-8 material lift that can be purchased at Data Center Solution Configurator. Make sure to include the Foot-release brake and the Load Platform when ordering the Genie Lift GL-8 material lift.
  • Make sure to inspect the connectors and sockets on the GPU and the GPU baseboard. Do not use the GPU or the GPU baseboard if its connectors are damaged or missing, or if there are debris in the sockets. Replace the GPU or the GPU baseboard with a new one before continuing the installation procedure.
  • GPU and heat sink is one part. Do not remove the heat sink from the GPU.
  • The following table shows the mapping information about the physical GPU sockets, slot numbering in XCC, and module IDs in nvidia-smi.


    Physical GPU socketSlot numbering in XCCModule ID in nvidia-smi
    SXM 1Slot 211
    SXM 2Slot 242
    SXM 3Slot 223
    SXM 4Slot 234
    SXM 5Slot 175
    SXM 6Slot 206
    SXM 7Slot 187
    SXM 8Slot 198
Note
Make sure you have the required tools listed below available to properly replace the component:
  • Torque screwdriver which can be set to 0.6 newton-meters, 5.3 inch-pounds
  • Torx T15 extended bit (200 mm long)
  • H100/H200 jig

Procedure

  1. Make preparation for this task.
    1. Remove all the power supply units. See Remove a hot-swap power supply unit.
    2. Remove all the front fans. See Remove a front hot-swap fan.
    3. Remove all the 2.5-inch hot-swap drives and the drive bay fillers (if any) from the drive bays. See Remove a 2.5-inch hot-swap drive.
    4. Pull the 8U GPU shuttle out of the chassis, and place it onto the lift platform. See Remove the 8U GPU shuttle.
    5. Remove the power complex. See Remove the power complex.
    6. (GPU and heat sink module 2, 4, 5, and 7 only) Remove the GPU air duct. See Remove an H100/H200 GPU air duct.
  2. Remove the plastic cover from the GPU and heat sink module.
    Figure 1. Plastic cover removal
    Plastic cover removal
  3. Align the jig with the GPU heat sink and carefully install it onto the GPU heat sink.
    Figure 2. Jig installation
    Jig installation
  4. Insert the torque screwdriver into the designated holes on the jig, and loosen the four Torx T15 screws in the sequence shown in the illustration below ( > > > ).
    Note
    Loosen screws with a torque screwdriver set to the proper torque. For reference, the torque required for the screws to be fully loosen is 0.6 newton-meters, 5.3 inch-pounds.
    Figure 3. Screw removal
    Screw removal
  5. Remove the jig from the GPU heat sink.
    Figure 4. Jig removal
    Jig removal
  6. Use both hands to lift the GPU and heat sink module out of the GPU baseboard.
    Figure 5. GPU and heat sink module removal
    GPU and heat sink module removal

After you finish

If you are instructed to return the component or optional device, follow all packaging instructions, and use any packaging materials for shipping that are supplied to you.