Memory problems
See this section to resolve issues related to memory.
Common memory problems
PMEM specific problems
Multiple memory modules identified as failing or disabled
Complete the following procedure to solve the problem.
- Check XCC log to see if one of the disabled modules is malfunctioning. If so, replace the malfunctioning module and start the system to see if all the modules are properly recognized.NoteWhen one memory module is malfunctioning, the system will execute memory downgrade to the closest supported combination, while some installed memory modules will not be recognized by the system.
- Reseat the memory modules; then, restart the server.
- Remove the highest-numbered memory module of those that are identified and replace it with an identical known good memory module; then, restart the server. Repeat as necessary. If the failures continue after all identified memory modules are replaced, go to step 4.
- Return the removed memory modules, one at a time, to their original connectors, restarting the server after each memory module, until a memory module fails. Replace each failing memory module with an identical known good memory module, restarting the server after each memory module replacement. Repeat step 3 until you have tested all removed memory modules.
- Replace the highest-numbered memory module of those identified; then, restart the server. Repeat as necessary.
- Reverse the memory modules between the channels (of the same processor), and then restart the server. If the problem is related to a memory module, replace the failing memory module.
- (Trained technician only) Install the failing memory module into a memory module connector for processor 2 (if installed) to verify that the problem is not the processor or the memory module connector.
- (Trained technician only) Replace the system board.
Displayed system memory is less than installed physical memory
Complete the following procedure to solve the problem.
- Make sure that:
No error LEDs are lit on the operator information panel.
No memory module error LEDs are lit on the system board.
Memory mirrored channel does not account for the discrepancy.
The memory modules are seated correctly.
You have installed the correct type of memory module (see PMEM rules for requirements).
After changing or replacing a memory module, memory configuration is updated accordingly in the Setup Utility.
All banks of memory are enabled. The server might have automatically disabled a memory bank when it detected a problem, or a memory bank might have been manually disabled.
There is no memory mismatch when the server is at the minimum memory configuration.
If PMEMs are installed:
Refer to PMEM rules and see if the displayed memory fits the mode description.
All the saved data have been backed up, and created namespaces are deleted before any PMEM is replaced or added.
Go to the Setup Utility, select
, and make sure security of all the PMEM units is disabled.
Reseat the memory modules, and then restart the server.
Check the POST error log:
If a memory module was disabled by a systems-management interrupt (SMI), replace the memory module.
If a memory module was disabled by the user or by POST, reseat the memory module; then, run the Setup utility and enable the memory module.
Run memory diagnostics. When you start a solution and press the key specified in the on-screen instructions, the Lenovo XClarity Provisioning Manager interface is displayed by default. You can perform memory diagnostics with this interface. From the Diagnostic page, go to or PMEM test.
NoteWhen PMEMs are installed, run diagnostics based on the mode that is set presently:App Direct Mode:
Run Memory Test for DRAM memory modules.
Run PMEM Test for PMEMs.
Reverse the modules between the channels (of the same processor), and then restart the server. If the problem is related to a memory module, replace the failing memory module.
Re-enable all memory modules using the Setup utility, and then restart the server.
(Trained technician only) Install the failing memory module into a memory module connector for processor 2 (if installed) to verify that the problem is not the processor or the memory module connector.
(Trained technician only) Replace the system board.
Invalid memory population detected
Invalid memory population (unsupported DIMM population) detected. Please verify memory configuration is valid.
- See Memory module installation rules and order to make sure the present memory module population sequence is supported.
- If the present sequence is indeed supported, see if any of the modules is displayed as “disabled” in Setup Utility.
- Reseat the module that is displayed as “disabled,” and reboot the system.
- If the problem persists, replace the memory module that is displayed as “disabled.”
Extra namespace appears in an interleaved region
If there are two created namespaces in one interleaved region, VMware ESXi ignores the created namespaces and creates an extra new namespace during system booting. Delete the created namespaces in either the Setup Utility or the operating system before the first booting with ESXi.
Migrated PMEMs are not supported
Intel Optane PMEM interleave set (DIMM X) is migrated from another system (Platform ID: 0x00), these migrated PMEMs are not supported nor warranted in this system.
- Move the modules back to the original system with the exact same configuration as the previous one.
- Back up stored data in PMEM namespaces.
- Disable PMEM security with one of the following options:
LXPM
Go to
, and input passphrase to disable security.Setup Utility
Go to
, and input passphrase to disable security.
- Delete namespaces with command corresponding to the operating system that is installed:
Linux command:
ndctl destroy-namespace all -f
Windows Powershell command
Get-PmemDisk | Remove-PmemDisk
- Clear Platform Configuration Data (PCD) and Namespace Label Storage Area (LSA) with the following ipmctl command (for both Linux and Windows).
ipmctl delete -pcd
NoteSee the following links to learn how to download and use impctl in different operating systems: - Reboot the system, and press F1 to enter Setup Utility.
- Power off the system.
- Remove the modules to be reused for a new system or configuration.
PMEMs installed in wrong slots after system board replacement
DIMM X of Intel Optane PMEM persistent interleave set should be moved to DIMM Y.
- Record each of the instructions of changing PMEM slot from XCC events.
- Power off the system, and remove the PMEMs that are mentioned in the warning messages. It is suggested to label these PMEMs to avoid confusion.
- Install the PMEMs in the correct slot number indicated in the warning messages. Remove the labels to avoid blocking airflow and cooling.
- Complete replacement and power on the system. Make sure there is no similar warning messages in XCC.
After PMEMs are reconfigured, error messages and LEDs persist to indicate PMEMs are installed in wrong slots
AC the system or restart XCC to solve this problem.
Cannot create goal successfully when installing PMEMs to the system for the first time
- ERROR: Cannot retrieve memory resources info
- ERROR: One or more PMEM modules do not have PCD data. A platform reboot is recommended to restore valid PCD data.
- If the PMEMs have been installed in another system with stored data, perform the following steps to erase the data.
- Based on the original population order, install the PMEMs to the original system where they were installed previously, and back up the data from the PMEMs to other storage devices.
- Disable PMEM security with one of the following options:
LXPM
Go to
, and input passphrase to disable security.Setup Utility
Go to
, and input passphrase to disable security.
- Delete namespaces with command corresponding to the operating system that is installed:
Linux command:
ndctl destroy-namespace all -f
Windows Powershell command
Get-PmemDisk | Remove-PmemDisk
- Clear Platform Configuration Data (PCD) and Namespace Label Storage Area (LSA) with the following ipmctl command (for both Linux and Windows).
ipmctl delete -pcd
NoteSee the following links to learn how to download and use impctl in different operating systems:
- Install the PMEMs back to the target system, and upgrade system firmware to the latest version without entering Setup Utility.
- If the problem persists, overwrite PMEMs with the following ndctl command.
ndctl sanitize-dimm --overwrite all
- Monitor the overwrite status with the following command.
watch -n 1 “ipmctl show -d OverwriteStatus -dimm”
- When seeing all PMEM OverwriteStatus=Completed, reboot the system and see if the problem persists.