Quantcast
Channel: ProLiant Servers (ML,DL,SL) topics
Viewing all articles
Browse latest Browse all 10362

Randonm hangs on DL380p G8 HP servers for more than one year with no solution by HP

$
0
0

Hi

We are having problems with 5 "DL80p G8 HP servers" that hangs randomly ( period of month or weeks).

This hangs happen with a periodicity of months or weeks, but and hp don´t know the reason.

We are using these servers in a critical environment and this problem is not acceptable.

 

During the hangs it´s only possible to acces to the servers using ILO, KVM don´t work, no ping. ONLY ILO

 

We have worked with HP for more than one year ans thye have replaced

 

- HD Firmware

- ILO Firmware

- Drive firmware

- Rom

- hpsa firmware

 

They have replaced the system board but the problem persists.

 

 

The OS installed in these servers is SLES11+SP2

 

Nothing related to the OS is obtained after the crash even when we have activatef

 

 

1-      ASR service up and running

We have checked that ASR was running before and after the crash, but the ASR was not automatically activated after hang as expected.

The ASR was in charge of restarting the machine in this kind of situations, so we hat to restart it manually using iLO.

 

2-      NMI was up and running

We have checked that the NMI was up and running before and after the crash, but it was not possible to execute a NMI restart using iLO.

That problem could be the reason why we cannot obtain kernel dumps in these kind of hang.

 

3-      kernel dump configured

kernel dump was configured for being automatically generated after O.S.  hang however it didn´t work.  We are sure that the kernel dump was active because we were able to force a manual kernel dump as indicated by hp procedure (it was executed just before recover the situation of the first hang)

 

The hang seems so hard that the OS is totally blocked and it can generate a log, in fact if we create a hang

running echo c >/proc/sysrq-trigger

 

 

We have checked the status of the server when a hang happens and it says tha all is OK ==> BUT IS TOTALLY HANG

 

We are complelely blocked on this Issue, and HP doesn´t provide any solution even payiing.

We have requested a contract for a consultant, or any kind of service but no solution is provided by hp.

 

We have requeste HP the replacement of the srvers but ihp says that is not imposible, even when hp has replaced firmware, fans, system boards....

 

 

The servers are located in differnt physical places so the environment factor can not explain the hangs

 

The aplication software are different in each server, so it can not explain the hangs.

 

The servers have been bought at the same time, so it seem that it could be some hardware wrong related to them 

 

We have payed different ho care packs ==> HW, Software

 

Any clue will be weelcome, we arre totally nlocked for MORE THAN ONE YEAR,

 

 

 

 

 

 

 


Viewing all articles
Browse latest Browse all 10362

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>