Monitoring (moved): Difference between revisions

deprecation process in favor of sphinx and readthedocs
imported>Paulproteus
imported>Mdaniel
(deprecation process in favor of sphinx and readthedocs)
 
(21 intermediate revisions by 4 users not shown)
Line 1:
{{Hacking OpenHatch}}
{{Hacking_OpenHatch}}
 
We are updating our documentation system. This page is now included in our project package, and is automatically generated by sphinx at openhatch.readthedocs.org [[http://openhatch.readthedocs.org/en/latest/internals/monitoring.html Monitoring]
== Monitoring ==
 
The basics:
 
* <code>linode.openhatch.org</code> is the main OpenHatch box, which runs the website.
* Nagios is running on <code>linode2.openhatch.org</code>. This is also the machine where OpenHatch runs Hudson.
 
Access:
 
* There is a <code>nagios</code> user on <code>linode2</code>. We use ssh keys for login. To get ssh access to <code>linode2</code>, paulproteus will need a public ssh key from you. After he's granted ssh access, you should be able to <code>ssh nagios@linode2.openhatch.org</code>.
 
Notifications:
 
* Nagios notifications go to <code>monitoring@lists.openhatch.org</code>. Anyone can subscribe to this list.
 
Viewing the web interface, and handling the daemon:
 
* On <code>linode2</code>, <code>~nagios/secrets/</code> contains the mailman and Nagios web interface passwords.
* View the Nagios web interface at <code>http://linode2.openhatch.org/nagios3/</code>
* To restart the Nagios daemon, run <code>sudo /etc/init.d/nagios3 restart</code>
 
== TODOs ==
 
# Send Nagios notifications to IRC (<code>#openhatch-auto</code>?)?
# Make the Nagios web interface world-viewable.
# Currently, only paulproteus has access to <code>linode</code>, so only he can do things like reboot the machine. We're still working out an access model that makes sense.
Anonymous user