HP ProLiant Servers: Unexpected system reboot and Uncorrectable Machine Check Exception

I faced with strange problem on ProLiant server (BL460c ) in C7000 enclosure. One of eight servers had unexpected system reboots one or twice in 1-2 days.

All servers were updated with HP Service Pack 09.2013 through HP SUM 6.20 and configured for Hyper-V cluster 2008 R2 (up-to-date too).

From OS I didn’t see any errors/warnings besides Kernel Power with ID 41 , no memory dumps too

Went to Blade administration and noticed in IML logs:

Uncorrectable Machine Check Exception (Board 0, Processor 1, APIC ID 0x00000035, Bank 0x00000005, Status 0xBE000000’00800400,
Address 0x00003800’06D29323, Misc 0x00000000’00007FFF)

Searched at HP support site and found this article . Server’s ROM is 07/02/2013. So, is this article applicable to server with ROM > 2011?

Anyway, I applied offered actions (go to System BIOS ,change options below) and resolved the issue.

  • Minimum Processor Idle Power State  to  “No C-states”
  • Intel QPI Link Power Management to “Disabled”
  • HP Power Profile option to “Maximum Performance”

TIP: If you have warranty/support contract with HP, you do not need to make any changes. Locate your HP local partner and replace CPU /board or open HP support case! It’s not official resolution. According with support article, only HP ProLiant servers with ROMs older than May, 2011 are affected!

HP P2000 G3 MSA Array Systems – Microsoft Windows 2012 Cluster Validation Test Returns Persistent Reservation Errors

Issue

While performing the cluster validation test on Microsoft Windows 2012, the test returns persistent validation warnings like the ones provided below:

msa

Failure. Persistent Reservation not present on Test Disk 0 from node Server.XXXXX.net after successful call to update reservation holder’s registration key 0xb.

Failure. Persistent Reservation not present on Test Disk 1 from node Server.XXXXX.net after successful call to update reservation holder’s registration key 0x10000000b.

Test Disk 0 does not support SCSI-3 Persistent Reservations commands needed to support clustered Storage Pools.
Some storage devices require specific firmware versions or settings to function properly with failover clusters.
Please contact your storage administrator or storage vendor to check the configuration of the storage to allow it to function properly with failover clusters.

Test Disk 1 does not support SCSI-3 Persistent Reservations commands needed to support clustered Storage Pools.
Some storage devices require specific firmware versions or settings to function properly with failover clusters.
Please contact your storage administrator or storage vendor to check the configuration of the storage to allow it to function properly with failover clusters.

Solution

Microsoft Windows 2012 and Microsoft Windows 8 introduce the feature called Storage Spaces (Windows Virtual Disks), the storage spaces feature does not support volumes presented using any kind of RAID controllers including the HP P2000 G3 MSA Array Systems.

As part of the Cluster validation process Microsoft Windows 2012 also runs a Storage Spaces test. As the Storage Spaces feature does not support the P2000 volumes the Cluster Validation test will include the warnings shown on the issue section of this article, however; the warnings do not cause the Cluster Validation test, as the warnings are only related to the Storage Spaces feature.

This is the normal behavior and it is not related to any issues on the HP P2000 G3 MSA Array Systems, it is just a warning to remind that the Storage Spaces feature does not support RAID based storage devices, and can be safely ignored.

Source