February 1, 2008

Server went down!

So, at about 10:30 my time, the server went down, and I have no idea why.  I was connected to SSH during that time, and I could ping it from my other linux box.  At first, I thought it was some sort of spambot trying to access the server at a high rate, so the server is not responding.  Now, I am not so sure anymore.

I went to log into the webhost's administration page to reboot the server.  Once I got in, I did the usual rebooting.  Five minutes went by, nothing.  Ten, nothing.  It's not starting to look so good.  So, naturally, I looked at the IPMI sensors and found out that Fan #3 was not spinning at all.  Its RPM at exactly 0.

server-down.gif

Fan 3, 0 RPM…not a good sign.  So, I submitted a ticket to the webhost, this is my message:
"Hi, it seems like the server nhim.vietnhim.com is having trouble starting up. According to the IPMI sensors, one of the fans is not spinning. Could you please check up on this and see what is happening with the server?"

Within three minutes, I got this reply:
"To verify this physically, we would need to shutdown the server. When would be a convenient time to do this?"

They must be joking right?  I said that the server is having trouble starting up, I wonder why they must ask me when would be a good time to shutdown the server.  Probably because of some sort of company policy or that my original ticket is unclear, but I hoped they would have understood what I submitted.  Should this be a LOL moment or a WTF moment?  I don't know.

11:47PM
The tech guy went off to do to check the server, and since then, it has been off the IMPI control twice.  Still no replies from him, or any positive ping messages.  This is rather nerve racking as we have a lot of things on this particular server.

12:06AM
Well, the IMPI sensors data are back online and it's updated.  CPU temperature is higher, System temperature is higher, fan #3 is still at 0 RPM.  It seems almost like it's a motherboard error, not good.  I am guessing that he's trying to replace a fan, and the new one still doesn't spin.  Of course, all of this is speculation at this point.

12:13AM
Ping is back!!  Oh my!  Let us see what the tech said the issue was!  LMAO, the system is back on, but Fan 3 is still 0rpm.  Maybe I don't have a fan 3?  I guess too much speculation isn't good.  Got me quite worried :D.

12:25AM
Nothing was wrong with the server.  It had to do a filesystem check and that's why it didn't reboot right away.  Who knew!  My server doesn't have a Fan 3 either, lmao!  Wow, I put myself through a lot more trouble than I should have.

Filed under Blog, My computers, Servers by A.K.

Spread the Word!

Permalink Print Comment

Leave a Comment