Skip to main content

Remove the H100/H200 GPU baseboard

Follow instructions in this section to remove the H100/H200 GPU baseboard. The procedure must be executed by a trained technician.

About this task

Attention
  • Read Installation Guidelines and Safety inspection checklist to ensure that you work safely.
  • Power off the server and peripheral devices and disconnect the power cords and all external cables. See Power off the server.
  • If the server is installed in a rack, remove the server from the rack. See Remove the server from rack.
  • Two people and one lifting device on site that can support up to 400 lb (181 kg) are required to perform this procedure. If you do not already have a lifting device available, Lenovo offers the Genie Lift GL-8 material lift that can be purchased at Data Center Solution Configurator. Make sure to include the Foot-release brake and the Load Platform when ordering the Genie Lift GL-8 material lift.
Note
Make sure you have the required tools listed below available to properly replace the component:
  • Torx T10 head screwdriver
  • Torx T15 head screwdriver
  • Phillips #1 head screwdriver
  • Phillips #2 head screwdriver
  • Flat head screwdriver
  • Alcohol cleaning pad
  • 2 x H100/H200 PCM Kit
  • 2 x SR780a V3 water loop putty pad kit
  • SR780a V3 water loop service kit
  • NVSwitch PCM Kit
  • NVSwitch putty pad Kit
  • GPU baseboard handles
Note
Make sure you have the required tools listed below available to properly replace the component:
  • Torque screwdriver which can be set to 0.6 newton-meters, 5.3 inch-pounds
The following illustration shows the GPU numbering and corresponding slot numbering in XCC.
Figure 1. GPU numbering
GPU numbering

Procedure

  1. Make preparation for this task.
    1. Remove the front top cover. See Remove the front top cover.
    2. Remove the rear top cover. See Remove the rear top cover.
    3. Remove the fan cage. See Remove the fan cage (trained technician only).
    4. Remove the CPU complex. See Remove the CPU complex.
    5. Remove the power complex. See Remove the power complex.
    6. Disconnect the cables from the GPU baseboard.
    7. Disconnect and remove the cables routed through the GPU complex, if necessary. Before disconnecting the cables, make a list of each cable and record the connectors the cable is connected to. Refer to Internal cable routing.
    8. Remove the rear H100/H200 GPU cold plate module. See Remove the rear H100/H200 GPU cold plate module.
    9. Remove the front H100/H200 GPU cold plate module. See Remove the front H100/H200 GPU cold plate module.
    10. Remove the NVswitch cold plate module. See Remove the NVSwitch cold plate module.
  2. Disengage the PCIe switch shuttle from the chassis.
    1. Press the two blue release latches.
    2. Rotate the two release levers until they are perpendicular to the PCIe switch shuttle.
    3. Pull the PCIe switch shuttle forward until it stops.
      Important
      Push the two release levers back until they lock into place after pulling out the PCIe switch shuttle to avoid damage.
      Figure 2. PCIe switch shuttle removal to stop position
      PCIe switch shuttle removal to stop position
  3. Unfasten the two M3 screws to remove the GPU connector protective bracket.
    Figure 3. Removing the GPU connector protective bracket
    Removing the GPU connector protective bracket
  4. Unfasten the seventeen Torx T15 captive screws on the GPU baseboard.
    Note
    Loosen or tighten the screws with a torque screwdriver set to the proper torque. For reference, the torque required for the screws to be fully loosen or tighten is 0.6 newton-meters, 5.3 inch-pounds.
    Figure 4. Screw removal
    Screw removal
  5. Remove the GPU complex.
    1. Press the button on the side of the handle.
    2. Adjust the handle to create space for screwdriver.
      Figure 5. Adjusting the handle
      Adjusting the handle
    3. Align the handles with the screw holes and lower them onto the GPU baseboard; then, fasten the five M3 screws (5 x M3, 0.5 newton-meters, 4.3 inch-pounds) to secure the handles to the GPU baseboard.
      Figure 6. Installing the handles
      Installing the handles
    4. Hold the two handles (1), and lift the GPU complex out of the chassis.
    Attention
    Make sure two people stand on either side of the GPU complex, and lift it by holding the two handles (1).
    Figure 7. Removing the GPU complex
    Removing the GPU complex
  6. Carefully lay the GPU complex on a flat, static protective surface; then, unfasten the five M3 screws that secure the handles to the baseboard. Lift the handles to remove them from the baseboard.
    Figure 8. Removing handles
    Removing handles
  7. Remove the GPUs from the GPU baseboard.
    1. Carefully lay the GPU complex on a flat, static protective surface.
    2. Unfasten the four Torx T15 screws in the sequence shown in the illustration below.
      Note
      Loosen the screws with a torque screwdriver set to the proper torque. For reference, the torque required for the screws to be fully loosen is 0.6 newton-meters, 5.3 inch-pounds.
    3. Carefully remove the GPU from the GPU baseboard.
      Figure 9. Removing the GPU
      Removing the GPU
    4. Repeat to remove all the GPUs.

After you finish

  1. Install a replacement unit. See Install the H100/H200 GPU baseboard.
  2. If you are instructed to return the component or optional device, follow all packaging instructions, and use any packaging materials for shipping that are supplied to you.