Three of the most important features in compute node design
are reliability, availability, and serviceability (RAS). These RAS
features help to ensure the integrity of the data that is stored in
the compute node, the availability of the compute node when you need
it, and the ease with which you can diagnose and correct problems.
The compute node has the following RAS features:
- Advanced Configuration and Power Interface
(ACPI)
- Automatic server restart (ASR)
- Built-in diagnostics using DSA Preboot
- Built-in monitoring for temperature, voltage, and hard disk drives
- Customer support center 24 hours per day, 7 days a week1
- Customer upgrade of flash ROM-resident code and diagnostics
- Customer-upgradeable Unified Extensible Firmware Interface (UEFI)
code and diagnostics
- ECC protected DDR4 DIMMs
- ECC protection on the L2 cache
- Error codes and messages
- Integrated management module II (IMM2)
- Light path diagnostics
- Memory parity testing
- Microprocessor built-in self-test (BIST) during power-on self-test
(POST)
- Microprocessor serial number access
- Processor presence detection
- ROM-resident diagnostics
- System-error logging
- Vital product data (VPD) on memory
- Wake
on LAN capability
- Wake on PCI (PME) capability