[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Remote health test for servers?



Quoting Wolfgang Lonien <wjlonien@gmx.net>:
> Jacob S. wrote:
> 
> > Does anyone have some good scripts (or know of a progam, though I
> > suspect that's overkill) for having an automated process that
> > periodically runs to make sure a server is still alive?
> > 
> > If the server were running the other os, I think an occasional ping
> > would be good enough, but I've seen a Linux server lose it's hard drive
> > and still answer pings before. :-)
> 

Nagios, Big Brother, Big Sister, and others are good for Go/No Go
checking of hosts' and servers' aliveness.  I am most familiar with
Nagios.  It can take actions on detected problems like e-mail an alert
to your cellphone/pager.  It saved my system once.  I turned off the
A/C to clean the outdoor exchanger.  Forgot to turn it back on.  Went
into town to write and sip a latte.  In about an hour I received the
temperature warning message, halfway home I received the temperature
critical message.  Made it home with the ambient temperature over 90F
and the CPU temperature in the mid 50C's (130F's).  60C is about all
the manufacturer warrants.

Nagios will ping hosts.  Connect to HTTP, SMTP, IMAP, POP,
etc. servers. Query SNMP agents. And run scripts.  Nice.

HTH,
  Jeffrey



Reply to: