Hello,
I'm currently having a hard time troubleshooting the root cause of a random and frequent reboot on a Proliant DL380e Gen8 server.
The errors given by the Integrated Management Log are the following ones :
Uncorrectable Machine Check Exception (Board 0, Processor 2, APIC ID 0x00000025, Bank 0x00000000, Status 0xB2000000'00000005, Address 0x00000000'00000000, Misc 0x00000000'00000000) Uncorrectable Machine Check Exception (Board 0, Processor 2, APIC ID 0x00000024, Bank 0x00000000, Status 0xB2000000'00000005, Address 0x00000000'00000000, Misc 0x00000000'00000000) Uncorrectable Machine Check Exception (Board 0, Processor 2, APIC ID 0x00000025, Bank 0x00000000, Status 0xF2000000'00000005, Address 0x00000000'00000000, Misc 0x00000000'00000000) Uncorrectable Machine Check Exception (Board 0, Processor 2, APIC ID 0x00000024, Bank 0x00000000, Status 0xF2000000'00000005, Address 0x00000000'00000000, Misc 0x00000000'00000000) Uncorrectable Machine Check Exception (Board 0, Processor 1, APIC ID 0x00000000, Bank 0x00000004, Status 0xB2000000'72000402, Address 0x00000000'00000000, Misc 0x00000000'00000000)
At first sight and giving the fact that these errors are concerning both processors, I would say that the problem is coming from somewhere on the system board.
I've tried to run a complete test with the integrated tool (intelligent provisionning) on the whole server, but it doesn't give any result (100% success).
I've also tried to search the forum and the web for a potential fix, but the only things I've got are regarding G7 DL380 servers, so I'm not really sure it's the same problem here.
If somebody could give me a hand on this problem, that would be very much appreciated.
Thanks !