Memory problems
See this section to resolve issues related to memory.
Displayed system memory is less than installed physical memory
Complete the following procedure to solve the problem.
- Make sure that:
No error LEDs are lit on the operator information panel.
No memory module error LEDs are lit on the system board.
Memory mirrored channel does not account for the discrepancy.
The memory modules are seated correctly.
You have installed the correct type of memory module (see PMEM rules for requirements).
After changing or replacing a memory module, memory configuration is updated accordingly in the Setup Utility.
All banks of memory are enabled. The server might have automatically disabled a memory bank when it detected a problem, or a memory bank might have been manually disabled.
There is no memory mismatch when the server is at the minimum memory configuration.
When PMEMs are installed:
Refer to PMEM rules and see if the displayed memory fits the mode description.
If the memory is set in App Direct Mode, all the saved data have been backed up, and created namespaces are deleted before any PMEM is replaced or added.
If PMEMs are recently set in Memory Mode, turn it back to App Direct Mode and examine if there is namespace that has not been deleted.
Go to the Setup Utility, select
, and make sure security of all the PMEM units is disabled.
Reseat the memory modules, and then restart the server.
Check the POST error log:
If a memory module was disabled by a systems-management interrupt (SMI), replace the memory module.
If a memory module was disabled by the user or by POST, reseat the memory module; then, run the Setup Utility and enable the memory module.
Run memory diagnostics. When you start a solution and press F1, the LXPM interface is displayed by default. (For more information, see the “Startup” section in the LXPM documentation compatible with your server at Lenovo XClarity Provisioning Manager portal page.) You can perform memory diagnostics with this interface. From the Diagnostic page, go to or PMEM test.
NoteWhen PMEMs are installed, run diagnostics based on the mode that is set presently:App Direct Mode:
Run Memory Test for DRAM memory modules.
Run PMEM Test for PMEMs.
Memory Mode:
Run both Memory Test and PMEM Test for PMEMs.
- Reverse the modules between the channels (of the same processor), and then restart the server. If the problem is related to a memory module, replace the failing memory module.NoteWhen PMEMs are installed, only adopt this method in Memory Mode.
Re-enable all memory modules using the Setup Utility, and then restart the server.
(Trained technician only) Install the failing memory module into a memory module connector for processor 2 (if installed) to verify that the problem is not the processor or the memory module connector.
(Trained technician only) Replace the system board.
Multiple memory modules in a channel identified as failing
Complete the following procedure to solve the problem.
- Reseat the memory modules; then, restart the server.
- Remove the highest-numbered memory module of those that are identified and replace it with an identical known good memory module; then, restart the server. Repeat as necessary. If the failures continue after all identified memory modules are replaced, go to step 4.
- Return the removed memory modules, one at a time, to their original connectors, restarting the server after each memory module, until a memory module fails. Replace each failing memory module with an identical known good memory module, restarting the server after each memory module replacement. Repeat step 3 until you have tested all removed memory modules.
- Replace the highest-numbered memory module of those identified; then, restart the server. Repeat as necessary.
- Reverse the memory modules between the channels (of the same processor), and then restart the server. If the problem is related to a memory module, replace the failing memory module.
- (Trained technician only) Install the failing memory module into a memory module connector for processor 2 (if installed) to verify that the problem is not the processor or the memory module connector.
- (Trained technician only) Replace the system board.
Attempt to change to another PMEM mode fails
After the PMEM mode is changed and the system is successfully restarted, if the PMEM mode stays the same instead of being changed, check the DRAM DIMMs and PMEM capacity to see if it meets the requirement of the new mode (see PMEM rules).
Extra namespace appears in an interleaved region
If there are two created namespaces in one interleaved region, VMware ESXi ignores the created namespaces and creates an extra new namespace during system booting. Delete the created namespaces in either the Setup Utility or the operating system before the first booting with ESXi.
Migrated PMEMs are not supported
- Intel Optane PMEM interleave set (DIMM X) is migrated from another system (Platform ID: 0x00), these migrated PMEMs are not supported nor warranted in this system.
- Move the modules back to the original system with the exact same configuration as the previous one.
- Back up stored data in PMEM namespaces.
- Disable PMEM security with one of the following options:
LXPM
Go to
, and input passphrase to disable security.Setup Utility
Go to
, and input passphrase to disable security.
- Delete namespaces with command corresponding to the operating system that is installed:
Linux command:
ndctl destroy-namespace all -f
Windows Powershell command
Get-PmemDisk | Remove-PmemDisk
- Clear Platform Configuration Data (PCD) and Namespace Label Storage Area (LSA) with the following ipmctl command (for both Linux and Windows).
ipmctl delete -pcd
NoteSee the following links to learn how to download and use impctl in different operating systems: Start the server and press the key specified in the on-screen instructions. (For more information, see the
Startup
section in the LXPM documentation compatible with your server at Lenovo XClarity Provisioning Manager portal page.) The Lenovo XClarity Provisioning Manager interface is displayed by default.- Power off the system.
- Remove the modules to be reused for a new system or configuration.
Invalid memory population detected
- Invalid memory population (unsupported DIMM population) detected. Please verify memory configuration is valid.
- See Memory module installation rules and order to make sure the present memory module population sequence is supported.
- If the present sequence is indeed supported, see if any of the modules is displayed as “disabled” in Setup Utility.
- Reseat the module that is displayed as “disabled,” and reboot the system.
- If the problem persists, replace the memory module.
PMEMs installed in wrong slots after system board replacement
- DIMM X of Intel Optane PMEM persistent interleave set should be moved to DIMM Y.
- Record each of the instructions of changing PMEM slot from XCC events.
- Power off the system, and remove the PMEMs that are mentioned in the warning messages. It is suggested to label these PMEMs to avoid confusion.
- Install the PMEMs in the correct slot number indicated in the warning messages. Remove the labels to avoid blocking airflow and cooling.
- Complete replacement and power on the system. Make sure there is no similar warning messages in XCC.
After PMEMs are reconfigured, error messages and LEDs persist to indicate PMEMs are installed in wrong slots
AC the system or restart XCC to solve this problem.
Cannot create goal successfully when installing PMEMs to the system for the first time
- ERROR: Cannot retrieve memory resources info
- ERROR: One or more PMEM modules do not have PCD data. A platform reboot is recommended to restore valid PCD data.
- If the PMEMs have been installed in another system with stored data, perform the following steps to erase the data.
- Based on the original population order, install the PMEMs to the original system where they were installed previously, and back up the data from the PMEMs to other storage devices.
- Disable PMEM security with one of the following options:
LXPM
Go to
, and input passphrase to disable security.Setup Utility
Go to
, and input passphrase to disable security.
- Delete namespaces with command corresponding to the operating system that is installed:
Linux command:
ndctl destroy-namespace all -f
Windows Powershell command
Get-PmemDisk | Remove-PmemDisk
- Clear Platform Configuration Data (PCD) and Namespace Label Storage Area (LSA) with the following ipmctl command (for both Linux and Windows).
ipmctl delete -pcd
NoteSee the following links to learn how to download and use impctl in different operating systems:
- Install the PMEMs back to the target system, and upgrade system firmware to the latest version without entering Setup Utility.
If the problem persists, overwrite PMEMs with the following ndctl command.
ndctl sanitize-dimm --overwrite all
Monitor the overwrite status with the following command.
watch -n 1 “ipmctl show -d OverwriteStatus -dimm”
- When seeing all PMEM OverwriteStatus=Completed, reboot the system and see if the problem persists.