The Management Pack for Apache Hadoop creates alerts (and in some cases provides recommended actions) based on various symptoms it detects in your Apache Hadoop Environment. See the table below for the list of alerts available in the Management Pack.

Alerts List

Name

Description

Symptom

Recommendation

Node Managers Unhealthy Immediate

Node Manager(s) have failed YARN health checks.

Node Managers Unhealthy

Check that NodeManagers have adequate disk space available.

Node Managers Unhealthy Critical

Node Manager(s) have failed YARN health checks.

Node Managers Unhealthy

Check that NodeManagers have adequate disk space available.

NameNode High Availability State Changed

NameNode is now the active NameNode.

NameNode High Availability Failover Occurred

If unplanned, check the status of the previously active NameNode.

NameNode Missing Blocks

Blocks missing. NameNode may be unhealthy.

NameNode Missing Blocks

Check for NameNode in maintenance.

NameNode Unreachable

NameNode is unreachable.

NameNode Unreachable

Check the NameNode to ensure it is functional.

NameNode Volume Failures

At least one volume has failed.

NameNode Volume Failures

Replace volume or NameNode to prevent data loss.

NameNode Volume Failures

At least one volume has failed.

NameNode Volume Failures

Replace volume or NameNode to prevent data loss.

NameNode Dead DataNodes

At least one DataNode is dead.

NameNode Number of DataNodes Dead

Replace DataNode to prevent data loss.

DataNode Used Capacity

Used disk capacity threshold exceeded.

DataNode Used Capacity

Add disks to DataNode.

DataNode Unreachable

DataNode is unreachable.

DataNode Unreachable

Check the DataNode to ensure it is functional.

DataNode Dead

DataNode is Dead.

DataNode Dead

Check the DataNode to ensure it is functional.

DataNode Decommissioned

DataNode is Decommissioned.

DataNode Decommissioned

Check the DataNode to ensure it is functional.

ResourceManager Unreachable

ResourceManager is unreachable

ResourceManager Unreachable

Check the ResourceManager to ensure it is functional.

ResourceManager Used Memory

Used disk capacity threshold exceeded.

ResourceManager Used Memory

Add DataNodes or disks to existing DataNodes.

ResourceManager Used Memory

Used disk capacity threshold exceeded.

ResourceManager Used Memory

Add DataNodes or disks to existing DataNodes.

ResourceManager High Availability State Changed

ResourceManager is now the active ResourceManager.

ResourceManager High Availability Failover Occurred

If unplanned, check the status of the previously active ResourceManager.

NodeManager Unreachable

NodeManager is unreachable.

NodeManager Unreachable

Check the NodeManager to ensure it is functional.