Monitoring (moved): Difference between revisions

From OpenHatch wiki
Content added Content deleted
imported>Paulproteus
No edit summary
imported>Paulproteus
No edit summary
Line 2: Line 2:


== Monitoring ==
== Monitoring ==

The basics:


# <code>linode.openhatch.org</code> is the main OpenHatch box, which runs the website.
# <code>linode.openhatch.org</code> is the main OpenHatch box, which runs the website.
# Nagios is running on <code>linode2.openhatch.org</code>. This is also the machine where OpenHatch runs Hudson.
# Nagios is running on <code>linode2.openhatch.org</code>. This is also the machine where OpenHatch runs Hudson.

Access:

# There is a <code>nagios</code> user on <code>linode2</code>. We use ssh keys for login. To get ssh access to <code>linode2</code>, paulproteus will need a public ssh key from you. After he's granted ssh access, you should be able to <code>ssh nagios@linode2.openhatch.org</code>.
# There is a <code>nagios</code> user on <code>linode2</code>. We use ssh keys for login. To get ssh access to <code>linode2</code>, paulproteus will need a public ssh key from you. After he's granted ssh access, you should be able to <code>ssh nagios@linode2.openhatch.org</code>.

Notifications:

# Nagios notifications go to <code>monitoring@lists.openhatch.org</code>. Anyone can subscribe to this list.
# Nagios notifications go to <code>monitoring@lists.openhatch.org</code>. Anyone can subscribe to this list.

# On <code>linode2</code>, <code>~/nagios/secrets/</code> contains the mailman and Nagios web interface passwords.
Viewing the web interface, and handling the daemon:

# On <code>linode2</code>, <code>~nagios/secrets/</code> contains the mailman and Nagios web interface passwords.
# View the Nagios web interface at <code>http://linode2.openhatch.org/nagios3/</code>
# View the Nagios web interface at <code>http://linode2.openhatch.org/nagios3/</code>
# To restart the Nagios daemon, run <code>sudo /etc/init.d/nagios3 restart</code>
# To restart the Nagios daemon, run <code>sudo /etc/init.d/nagios3 restart</code>

Revision as of 21:35, 21 January 2011

This is a page about improving or modifying OpenHatch.

We call that "Hacking OpenHatch," and there is a whole category of pages about that.


Monitoring

The basics:

  1. linode.openhatch.org is the main OpenHatch box, which runs the website.
  2. Nagios is running on linode2.openhatch.org. This is also the machine where OpenHatch runs Hudson.

Access:

  1. There is a nagios user on linode2. We use ssh keys for login. To get ssh access to linode2, paulproteus will need a public ssh key from you. After he's granted ssh access, you should be able to ssh nagios@linode2.openhatch.org.

Notifications:

  1. Nagios notifications go to monitoring@lists.openhatch.org. Anyone can subscribe to this list.

Viewing the web interface, and handling the daemon:

  1. On linode2, ~nagios/secrets/ contains the mailman and Nagios web interface passwords.
  2. View the Nagios web interface at http://linode2.openhatch.org/nagios3/
  3. To restart the Nagios daemon, run sudo /etc/init.d/nagios3 restart

TODOs

  1. Send Nagios notifications to IRC (#openhatch-auto?)?
  2. Make the Nagios web interface world-viewable.
  3. Currently, only paulproteus has access to linode, so only he can do things like reboot the machine. We're still working out an access model that makes sense.