Skip to main content

Reliability, availability, and serviceability features

Three of the most important features in compute node design are reliability, availability, and serviceability (RAS). These RAS features help to ensure the integrity of the data that is stored in the compute node, the availability of the compute node when you need it, and the ease with which you can diagnose and correct problems.

The compute node has the following RAS features:

  • Advanced Configuration and Power Interface (ACPI)
  • Automatic server restart (ASR)
  • Built-in diagnostics using DSA Preboot
  • Built-in monitoring for temperature, voltage, and hard disk drives
  • Customer support center 24 hours per day, 7 days a week1
  • Customer upgrade of flash ROM-resident code and diagnostics
  • Customer-upgradeable Unified Extensible Firmware Interface (UEFI) code and diagnostics
  • ECC protected DDR4 DIMMs
  • ECC protection on the L2 cache
  • Error codes and messages
  • Integrated management module II (IMM2)
  • Light path diagnostics
  • Memory parity testing
  • Microprocessor built-in self-test (BIST) during power-on self-test (POST)
  • Microprocessor serial number access
  • Processor presence detection
  • ROM-resident diagnostics
  • System-error logging
  • Vital product data (VPD) on memory
  • Wake on LAN capability
  • Wake on PCI (PME) capability
1 Service availability varies by country. Response time varies depending on the number and nature of incoming calls.