Monitoring (moved): Difference between revisions
Content added Content deleted
imported>Paulproteus (→Access) |
imported>Mdaniel (deprecation process in favor of sphinx and readthedocs) |
||
(10 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
{{Hacking OpenHatch}} |
|||
{{Hacking_OpenHatch}} |
|||
We are updating our documentation system. This page is now included in our project package, and is automatically generated by sphinx at openhatch.readthedocs.org [[http://openhatch.readthedocs.org/en/latest/internals/monitoring.html Monitoring] |
|||
== Monitoring == |
|||
=== The basics=== |
|||
* <code>linode.openhatch.org</code> is the main OpenHatch box, which runs the website. |
|||
* <code>linode2.openhatch.org</code> is the secondary server for OpenHatch. It hosts the Hudson continuous integration server, as well as Nagios! |
|||
* The Nagios configuration is owned by a user called ''nagios'' on ''linode2.openhatch.org''. |
|||
=== Access === |
|||
* We use ssh keys for login. |
|||
* If you want SSH access to that account, file a bug requesting it, and attach an SSH key. You should hear back within 2 days; if you don't hear back by then, try to find paulproteus or jesstess on IRC. |
|||
* Then you can do: |
|||
ssh nagios@linode2.openhatch.org |
|||
* You'll know it's working if you are logged in. If you see a "Password:" prompt, then it is not working. |
|||
=== Notifications === |
|||
* Nagios notifications go to <code>monitoring@lists.openhatch.org</code>. Anyone can subscribe to this list. |
|||
=== Viewing the web interface, and handling the daemon === |
|||
* On <code>linode2</code>, <code>~/nagios/secrets/</code> contains the mailman and Nagios web interface passwords. |
|||
* View the Nagios web interface at <code>http://linode2.openhatch.org/nagios3/</code> |
|||
* To restart the Nagios daemon, run <code>sudo /etc/init.d/nagios3 restart</code> |
|||
===In case of emergency=== |
|||
* See [[Emergency operations for the openhatch server]]. People with ssh keys set up for the Linode Shell (Lish) can reboot the box and have other limited emergency capabilities. |
|||
== TODOs == |
|||
# Send Nagios notifications to IRC (<code>#openhatch-auto</code>?)? |
|||
# Make the Nagios web interface world-viewable. |
|||
# Version the monitoring configurations. |
|||
# Send SMS alerts to people who want them. |
|||
# Add historical trending (Munin)? |
|||
== Related == |
|||
* See also [[Emergency operations for the openhatch server]] |
Latest revision as of 16:41, 25 June 2012
This is a page about improving or modifying OpenHatch.
We call that "Hacking OpenHatch," and there is a whole category of pages about that.
We are updating our documentation system. This page is now included in our project package, and is automatically generated by sphinx at openhatch.readthedocs.org [Monitoring