I hope this is posted in the right place.
This DL380 g6 server is equipped with dual X5650 and 64GB ram, 4 disk raid 10, dual 750watt ps.
The machine just shutdown unexpectedly. ILO2 reports:
System Overheating (Zone 19, Location CPU, Temperature 73C)
Informational.
The machine attempted to start back up but just kept starting and then shutting down within a second or 2. Letting it sit awhile, it later booted and ran for 12 or so hours before repeating the same routine. This time it will not boot but ILO2 reports:
Power loss due to overheating. Attempting to restore power.
...then it immediately shuts back down. The health light is amber indicating a temperature caution.
If it is sitting for an hour with the power off and it reports overheating when I try to boot cold, then it must be a faulty reading. The machine is cold.
I noticed that when it booted and ran OK after the 1st failure, temp 19 was around 30C. Just before the second failure, I heard the fans start to spin up (75%) and I knew it was going to happend again, so I recorded temp 19 at around 63C and climbing, then it shut down.
I suspect a faulty temp sensor but have no way of determining which component is going bad. Could this be in the CPU, and if so which one. Is zone 19 on the MB or where? I have found no documentation to tell me.
Has anyone else seen this failure and what did you do to fix it?
Any help would be appreciated!