Monitoring (moved): Difference between revisions

imported>Jesstess
Line 3:
== Monitoring ==
 
=== The basics:===
 
* <code>linode.openhatch.org</code> is the main OpenHatch box, which runs the website.
* Nagios is running on <code>linode2.openhatch.org</code>. This is also the machine where OpenHatch runs Hudson.
 
=== Access: ===
 
* There is a <code>nagios</code> user on <code>linode2</code>. We use ssh keys for login.
* If you want SSH access to that account, file a bug requesting it, and attach an SSH key. You should hear back within 2 days; if you don't hear back by then, try to find paulproteus or jesstess on IRC.
* Then you can do:
 
ssh nagios@linode2.openhatch.org
* There is a <code>nagios</code> user on <code>linode2</code>. We use ssh keys for login. To get ssh access to <code>linode2</code>, paulproteus will need a public ssh key from you. After he's granted ssh access, you should be able to <code>ssh nagios@linode2.openhatch.org</code>.
 
=== Notifications: ===
 
* Nagios notifications go to <code>monitoring@lists.openhatch.org</code>. Anyone can subscribe to this list.
 
=== Viewing the web interface, and handling the daemon: ===
 
* On <code>linode2</code>, <code>~/nagios/secrets/</code> contains the mailman and Nagios web interface passwords.
Line 22 ⟶ 25:
* To restart the Nagios daemon, run <code>sudo /etc/init.d/nagios3 restart</code>
 
===In case of emergency:===
 
* See [[Emergency operations for the openhatch server]]. People with ssh keys set up for the Linode Shell (Lish) can reboot the box and have other limited emergency capabilities.
Anonymous user