Difference between revisions of "Monitoring (moved)"

From OpenHatch wiki
Jump to navigation Jump to search
imported>Paulproteus
imported>Paulproteus
Line 2: Line 2:
  
 
== Monitoring ==
 
== Monitoring ==
 +
 +
The basics:
  
 
# <code>linode.openhatch.org</code> is the main OpenHatch box, which runs the website.
 
# <code>linode.openhatch.org</code> is the main OpenHatch box, which runs the website.
 
# Nagios is running on <code>linode2.openhatch.org</code>. This is also the machine where OpenHatch runs Hudson.
 
# Nagios is running on <code>linode2.openhatch.org</code>. This is also the machine where OpenHatch runs Hudson.
 +
 +
Access:
 +
 
# There is a <code>nagios</code> user on <code>linode2</code>. We use ssh keys for login. To get ssh access to <code>linode2</code>, paulproteus will need a public ssh key from you. After he's granted ssh access, you should be able to <code>ssh nagios@linode2.openhatch.org</code>.
 
# There is a <code>nagios</code> user on <code>linode2</code>. We use ssh keys for login. To get ssh access to <code>linode2</code>, paulproteus will need a public ssh key from you. After he's granted ssh access, you should be able to <code>ssh nagios@linode2.openhatch.org</code>.
 +
 +
Notifications:
 +
 
# Nagios notifications go to <code>monitoring@lists.openhatch.org</code>. Anyone can subscribe to this list.
 
# Nagios notifications go to <code>monitoring@lists.openhatch.org</code>. Anyone can subscribe to this list.
# On <code>linode2</code>, <code>~/nagios/secrets/</code> contains the mailman and Nagios web interface passwords.
+
 
 +
Viewing the web interface, and handling the daemon:
 +
 
 +
# On <code>linode2</code>, <code>~nagios/secrets/</code> contains the mailman and Nagios web interface passwords.
 
# View the Nagios web interface at <code>http://linode2.openhatch.org/nagios3/</code>
 
# View the Nagios web interface at <code>http://linode2.openhatch.org/nagios3/</code>
 
# To restart the Nagios daemon, run <code>sudo /etc/init.d/nagios3 restart</code>
 
# To restart the Nagios daemon, run <code>sudo /etc/init.d/nagios3 restart</code>

Revision as of 21:35, 21 January 2011

This is a page about improving or modifying OpenHatch.

We call that "Hacking OpenHatch," and there is a whole category of pages about that.


Monitoring

The basics:

  1. linode.openhatch.org is the main OpenHatch box, which runs the website.
  2. Nagios is running on linode2.openhatch.org. This is also the machine where OpenHatch runs Hudson.

Access:

  1. There is a nagios user on linode2. We use ssh keys for login. To get ssh access to linode2, paulproteus will need a public ssh key from you. After he's granted ssh access, you should be able to ssh nagios@linode2.openhatch.org.

Notifications:

  1. Nagios notifications go to monitoring@lists.openhatch.org. Anyone can subscribe to this list.

Viewing the web interface, and handling the daemon:

  1. On linode2, ~nagios/secrets/ contains the mailman and Nagios web interface passwords.
  2. View the Nagios web interface at http://linode2.openhatch.org/nagios3/
  3. To restart the Nagios daemon, run sudo /etc/init.d/nagios3 restart

TODOs

  1. Send Nagios notifications to IRC (#openhatch-auto?)?
  2. Make the Nagios web interface world-viewable.
  3. Currently, only paulproteus has access to linode, so only he can do things like reboot the machine. We're still working out an access model that makes sense.