The Management Pack for Apache Hadoop creates alerts (and in some cases provides recommended actions) based on various symptoms it detects in your Apache Hadoop Environment. See the table below for the list of alerts available in the Management Pack.

Alerts List

Name Description Symptom Recommendation
Node Managers Unhealthy Immediate Node Manager(s) have failed YARN health checks. Node Managers Unhealthy Check that NodeManagers have adequate disk space available.
Node Managers Unhealthy Critical Node Manager(s) have failed YARN health checks. Node Managers Unhealthy Check that NodeManagers have adequate disk space available.
NameNode High Availability State Changed NameNode is now the active NameNode. NameNode High Availability Failover Occurred If unplanned, check the status of the previously active NameNode.
NameNode Missing Blocks Blocks missing. NameNode may be unhealthy. NameNode Missing Blocks Check for NameNode in maintenance.
NameNode Unreachable NameNode is unreachable. NameNode Unreachable Check the NameNode to ensure it is functional.
NameNode Volume Failures At least one volume has failed. NameNode Volume Failures Replace volume or NameNode to prevent data loss.
NameNode Volume Failures At least one volume has failed. NameNode Volume Failures Replace volume or NameNode to prevent data loss.
NameNode Dead DataNodes At least one DataNode is dead. NameNode Number of DataNodes Dead Replace DataNode to prevent data loss.
DataNode Used Capacity Used disk capacity threshold exceeded. DataNode Used Capacity Add disks to DataNode.
DataNode Unreachable DataNode is unreachable. DataNode Unreachable Check the DataNode to ensure it is functional.
DataNode Dead DataNode is Dead. DataNode Dead Check the DataNode to ensure it is functional.
DataNode Decommissioned DataNode is Decommissioned. DataNode Decommissioned Check the DataNode to ensure it is functional.
ResourceManager Unreachable ResourceManager is unreachable ResourceManager Unreachable Check the ResourceManager to ensure it is functional.
ResourceManager Used Memory Used disk capacity threshold exceeded. ResourceManager Used Memory Add DataNodes or disks to existing DataNodes.
ResourceManager Used Memory Used disk capacity threshold exceeded. ResourceManager Used Memory Add DataNodes or disks to existing DataNodes.
ResourceManager High Availability State Changed ResourceManager is now the active ResourceManager. ResourceManager High Availability Failover Occurred If unplanned, check the status of the previously active ResourceManager.
NodeManager Unreachable NodeManager is unreachable. NodeManager Unreachable Check the NodeManager to ensure it is functional.