Monitoring (moved)
This is a page about improving or modifying OpenHatch.
We call that "Hacking OpenHatch," and there is a whole category of pages about that.
Monitoring
The basics
linode.openhatch.org
is the main OpenHatch box, which runs the website.linode2.openhatch.org
is the secondary server for OpenHatch. It hosts the Hudson continuous integration server, as well as Nagios!- The Nagios configuration is owned by a user called nagios on linode2.openhatch.org.
Access
- We use ssh keys for login.
- If you want SSH access to that account, file a bug requesting it, and attach an SSH key. You should hear back within 2 days; if you don't hear back by then, try to find paulproteus or jesstess on IRC.
- Then you can do:
ssh nagios@linode2.openhatch.org
- You'll know it's working if you are logged in. If you see a "Password:" prompt, then it is not working.
Notifications
- Nagios notifications go to
monitoring@lists.openhatch.org
. Anyone can subscribe to this list.
Viewing the web interface, and handling the daemon
- On
linode2
,~/nagios/secrets/
contains the mailman and Nagios web interface passwords. - View the Nagios web interface at
http://linode2.openhatch.org/nagios3/
- To restart the Nagios daemon, run
sudo /etc/init.d/nagios3 restart
In case of emergency
- See Emergency operations for the openhatch server. People with ssh keys set up for the Linode Shell (Lish) can reboot the box and have other limited emergency capabilities.
TODOs
- Send Nagios notifications to IRC (
#openhatch-auto
?)? - Make the Nagios web interface world-viewable.
- Version the monitoring configurations.
- Send SMS alerts to people who want them.
- Add historical trending (Munin)?