Nagios

Introduction


The nagiostats utility allows you to graph various Nagios performance statistics over time using MRTG. This is important because it can help you:

Sample MRTG Configuration


Sample MRTG configuration file snippets for graphing various Nagios performance statistics can be found in the mrtg.cfg file located in the sample-config/ subdirectory of the Nagios distribution. You can create graphs of other performance information if you'd like - the samples just provide you with a good starting point.

Once you copy these sample entries into your MRTG config file (/etc/mrtg/mrtg.cfg) you should have some new graphs the next time MRTG runs.

Example Graphs


I'll describe what a few of the sample MRTG graphs mean and what they can be used for...

Active Host Checks - This graph shows how may active host checks (regularly scheduled and on-demand) have occurred over time. Useful for understanding:

Active Host Checks


Active Service Checks - This graph shows how may active service checks (reguarly scheduled and on-demand) have occurred over time. Useful for understanding:

Active Service Checks


Cached Host and Service Checks - This graph shows how may cached host and service checks have occurred over time. Useful for understanding:

Cached Host and Service Checks


Passive Host and Service Checks - This graph shows how may passive host and service checks have occurred over time. Useful for understanding:

Passive Host and Service Checks


Hosts/Services Actively Checked - This graph shows how many (of the total number of) hosts and services were last checked actively over time. Useful for understanding:

Hosts/Services Actively Checked


Hosts/Services Passively Checked - This graph shows how many (of the total number of) hosts and services were last checked passively over time. Useful for understanding:

Hosts/Services Passively Checked


Average Service Check Latency and Execution Time - This graph shows average service check latency and execution times over time. Useful for understanding:


Consistently high latencies can be an indication that one of more of the following variables need tweaking:

Average Service Check Latency and Execution Time


Average Service State Change - This graph shows the average percent state change (a measure of volatility) for services over time, broken down by services that were last checked either actively or passively. Useful for understanding:

Average Service State Change


Average Host Check Latency and Execution Time - This graph shows average host check latency and execution times over time. Useful for understanding:


Consistently high latencies can be an indication that one of more of the following variables need tweaking:

Average Host Check Latency and Execution Time


Average Host State Change - This graph shows the average percent state change (a measure of volatility) for hosts over time, broken down by hosts that were last checked either actively or passively. Useful for understanding:

Average Host State Change


External Commands - This graph shows how may external commands have been processed by the Nagios daemon over time. Unless you're processing a large number of external commands (as in the case with distributed monitoring setups), this graph may appear mostly empty. Monitoring external commands can be useful for understanding the impacts of:

External Commands


External Command Buffers - This graph shows how may external command buffer slots are in use over time. If the number of used buffers is near the number of available buffers on a regular basis, it is likely you need to increase the available external command buffer slots. Each buffer slot can hold one external command. Buffers are used for temporarily holding external commands from the time they are read from the external command file to the time they are processed by the Nagios daemon.

External Command Buffers

See Also See Also: Nagiostats Utility, Performance Tuning

English Deutsch 日本語

Table of Contents