Nagios

Introduction


When Nagios receives passive host checks from remote sources (i.e other Nagios instances in distributed or failover setups), the host state reported by the remote source may not accurately reflect the state of the host from Nagios' view. As distributed and failover monitoring installations are fairly common, it is important to provide a mechanism for ensuring accurate host states between different instances of Nagios.

Different World Views


The image below shows a simplified view of a failover monitoring setup.

Passive State Translation

What states are Router-C and Router-D currently in? The answer depends on which Nagios instance you ask.

Each Nagios instance has a different view of the network. The backup monitoring servers should not blindly accept passive host states from the primary monitoring server, or they will have incorrect information on the current state of the network.

Without translating passive host check results from the primary monitoring server (Nagios-A), Nagios-C would see Router-D as UNREACHABLE, when it is really DOWN based on its viewpoint. Similarly, the DOWN/UNREACHABLE states (from the viewpoint of Nagios-A) for Router-C and Router-D should be flipped from the viewpoint of Nagios-B.

Note: There may be some situations where you do not want Nagios to translate DOWN/UNREACHABLE states from remote sources to their "correct" state from the viewpoint of the local Nagios instance. For example, in distributed monitoring environments you may want the central Nagios instance to know how distributed instances see their respective portions of the network.

Enabling State Translation


By default, Nagios will not automatically translate DOWN/UNREACHABLE states from passive check results. You will need to enable this feature if you need and want it.

The automatic translation of passive host check states is controlled by the translate_passive_host_checks variable. Enable it and Nagios will automatically translate DOWN and UNREACHABLE states from remote sources to their correct state for the local instance of Nagios.

See Also See Also: Host Checks, Network Reachability, Passive Checks, Distributed Monitoring, Redundant/Failover Monitoring

English Deutsch 日本語

Table of Contents