- Ganglia Monitoring For Sysadmins
- Wikis, Books, Magazines
- Academic Papers on Ganglia
- Slideshows, Talks and Presentations on Ganglia
- Plugin Repositories
- Report and Modular Graph Documentation
- Other Complimentary Applications
- Other Similar Applications
- Google Summer of Code
- Deploying Ganglia
Ganglia Monitoring For Sysadmins
|gmond||The ganglia monitoring daemon (gmond) is a lightweight service that is installed on every machine you'd like to monitor. This daemon uses a simple listen/announce protocol via XDR to collect monitoring state and then shares this information via XML over TCP. Gmond is portable and collects dozens of system metrics: CPU, memory, disk, network and process data.|
|gmetad||The ganglia meta daemon (gmetad) is a service that collects data from other gmetad and gmond sources and stores their state to disk in indexed round-robin databases. Gmetad provides a simple query mechanism for collecting specific information about groups of machines. Gmetad supports hierarchical delegation for creating manageable monitoring domains.|
|gmetric||The ganglia metric tool is a commandline application that you can use to inject custom made metrics about hosts that are being monitored by ganglia. It has the ability to spoof messages as coming from a different host in case you want to capture and report metrics from a device where you don't have gmond running (like a network or other embedded device).|
|gstat||The ganglia stat tool is a commandline application that you can use to query a gmond for metric information directly.|
|web||The ganglia web frontend expresses the data stored by gmetad in a graphical web interface using PHP.|
Ganglia Execution Environment
Ganglia can do more than just monitoring too! GEXEC is a scalable cluster remote execution system which provides fast, RSA authenticated remote execution of parallel and distributed jobs. It provides transparent forwarding of stdin, stdout, stderr, and signals to and from remote processes, provides local environment propagation, and is designed to be robust and to scale to systems over thousands of nodes.
In addition, with authd authentication from RocketCalc, allows SSH to sit on top of GEXEC allowing better performance and easier maintenance than SSH alone.
You can also embed ganglia into your favorite application using embedded gmetric. This project adds a simple embedded version of gmetric that sends ganglia compatible packets. Right now the C/C++ version only supports unicast. The pure-python version supports unicast and multicast. An alternative solution is to use a scripted interface to libganglia and therefore generate metrics from your own scripted application as suggested by Ganglia::Gmetric for perl applications or to use libganglia directly from your application.
If you can't integrate your application with ganglia to generate metrics and your only option is to parse its logs, you might be interested on ganglia logtailer.
The following applications had already an integration hook for ganglia :
Wikis, Books, Magazines
There is a great ganglia Wiki on the IBM website. This wiki has great information about running ganglia in IBM platforms (including AIX) and a very graphic description of how the ganglia architecture works.
The book The Linux Enterprise Cluster: Build a Highly Available Cluster with Commodity Hardware and Free Software by Karl Kopper from No Starch Press has a nice chapter on installing and configuring ganglia.
The Grid2 by Ian Foster and Carl Kesselman has basic information about ganglia including screenshots and explanation of how it related to the overall Grid ecosystem.
The August 2003 edition of Linux Magazine published an article on ganglia in their Extreme Linux section.
Jesse Becker wrote an article for Linux Magazine titled Marching Penguins: Monitoring Your HPC Cluster which was published on September 3rd, 2008.
Academic Papers on Ganglia
The Ganglia Distributed Monitoring System: Design, Implementation, and Experience.
Matthew L. Massie, Brent N. Chun, and David E. Culler.
Parallel Computing, Vol. 30, Issue 7, July 2004.
Wide Area Cluster Monitoring with Ganglia
Federico Sacerdoti, Mason Katz, Matt Massie, David E. Culler.
IEEE Cluster 2003 Conference, Hong Kong. December 2003.
Slideshows, Talks and Presentations on Ganglia
Using the CIM statistical model to monitor your data
Presentation given at Management Developers Conference 2007 by Brad Nicholes
Monitoring Your Data Center using Ganglia
Presentation given at ApacheCon 2007 by Brad Nicholes
Brooks Davis (a ganglia developer) was invited to speak on TechTV about FreeBSD clusters. During the 6 minute segment, he was kind enough to plug ganglia and demonstrate the web interface.
We are currently hosting three plugin repositories at GitHub:
- Gmetric: http://github.com/ganglia/gmetric
- Gmond Python DSO metric modules: http://github.com/ganglia/gmond_python_modules
- Other Ganglia contributions: http://github.com/ganglia/ganglia_contrib
The deprecated repository of user-contributed gmetric scripts can still be found at http://ganglia.sourceforge.net/gmetric/ (most of the scripts have already been imported to the Gmetric repository mentioned above).
Report and Modular Graph Documentation
Documentation on using the new (as of 3.1.x) modular graph code.
Other Complimentary Applications
Most other monitoring solutions with the following probably worth a mention as they are usually found alongside ganglia deployments :
Other Similar Applications
Similar in some ways to ganglia and worth some consideration if for whatever reason ganglia doesn't fit the bill for you :
Google Summer of Code
- Using Chef https://github.com/gchef/ganglia-cookbook