The Kubernetes objects raise alerts and alert symptom for instances that are available in the cluster.
Alert Definition |
Symptoms |
Severity |
Recommendation |
|---|---|---|---|
Container CPU limit is set to unlimited |
Container CPU limit is not defined |
Info |
A container running without CPU limit may claim all of Node's resources. Modify your Pod configuration with a CPU limit on the affected container. A quick glimpse at the CPU usage trend can help you set the limit. |
Container CPU usage is high |
Container CPU usage is higher than 90% Container CPU usage is higher than 80% Container CPU usage is higher than 70% |
Critical Immediate Warning |
Consider the option of increasing CPU limit on the affected container if Node's resources permit. Else, you may have to add a new Node to the cluster to ease out the CPU crunch. |
Container Memory limit is set to unlimited |
Container Memory limit is not defined |
Info |
A container running without Memory limit may claim all of Node's resources. Modify your Pod configuration with a Memory limit on the affected container. A quick glimpse at the Memory usage trend can help you set the limit. |
Container is not available |
Container is not available |
Immediate |
Redeploy the Pod and make sure it goes to Ready state |
Container Memory usage is high |
Container Process CPU usage is higher than 90% Container Process CPU usage is higher than 80% Container Process CPU usage is higher than 70% |
Critical Immediate Warning |
Consider the option of increasing Memory limit on the affected container if Node's resources permit. Else, you may have to add a new Node to the cluster to ease out the Memory crunch. |
Container Process has high Memory Usage |
Container Process Memory usage is higher than 90% Container Process Memory usage is higher than 80% Container Process Memory usage is higher than 70% |
Critical Immediate Warning |
Consider increasing CPU limit of the container. |
Container Process has high Memory Usage |
Node Process Memory usage is higher than 90% Node Process Memory usage is higher than 80% Node Process Memory usage is higher than 70% |
Critical Immediate Warning |
Consider increasing Memory limit of the container. |
Master Node is not available |
Master Node is not available |
Immediate |
Ensure that the Master Node is reachable and API server is up and running. |
Namespace is not Available |
Namespace is not Available |
Immediate |
Check if the namespace has been deleted |
Node has high CPU Usage |
Node CPU usage is higher than 90% Node CPU Memory usage is higher than 80% Node CPU Memory usage is higher than 70% Node CPU Memory usage is higher than 60% |
Critical Immediate Warning Info |
Consider increasing CPU resource of the Node OR add a new Node to the cluster |
Node is not available |
Node is not available |
Immediate |
Verify if the Node is reachable and in Ready state |
Node has high Memory Usage |
Node Memory usage is higher than 90% NodeMemory usage is higher than 80% NodeMemory usage is higher than 70% NodeMemory usage is higher than 60% |
Critical Immediate Warning Info |
Consider increasing Memory resource of the Node OR add a new new Node to the cluster |
One of the Pods has highest CPU usage on Namespace |
Pod with highest utilization on namespace has CPU usage higher than 90% Pod with highest utilization on namespace has CPU usage higher than 80% Pod with highest utilization on namespace has CPU usage higher than 70% Descendant pod object (OR Operation) Pod memory usage is higher than 70% Pod memory usage is higher than 80% Pod memory usage is higher than 90% |
Critical Immediate Warning Info |
Consider modifying the affected Pod configurations to increase CPU limits |
One of the pods has high Memory usage on Namespace |
Pod with highest utilization on namespace has memory usage higher than 90% Pod with highest utilization on namespace has memory usage higher than 80% Pod with highest utilization on namespace has memory usage higher than 70% Descendant pod object (OR Operation) Pod CPU Usage is higher than 70% Pod CPU Usage is higher than 80% Pod CPU Usage is higher than 90% |
Critical Immediate Warning Info |
Consider modifying the affected Pod configurations to increase Memory limits |
One of the pods has high CPU usage on Service |
Pod with highest utilization on Service has CPU usage higher than 70% Pod with highest utilization on Service has CPU usage higher than 80% Pod with highest utilization on Service has CPU usage higher than 90% Descendant pod object (OR Operation) Pod CPU Usage is higher than 70% Pod CPU Usage is higher than 80% Pod CPU Usage is higher than 90% |
Critical Immediate Warning |
Consider modifying the affected Pod configurations to increase CPU limits |
One of the pods has high Memory usage on Service |
Pod with highest utilization on Service has memory usage higher than 70% Pod with highest utilization on Service has memory usage higher than 80% Pod with highest utilization on Service has memory usage higher than 90% Descendant pod object (OR Operation) Pod memory usage is higher than 70% Pod memory usage is higher than 80% Pod memory usage is higher than 90% |
Critical Immediate Warning |
Consider modifying the affected Pod configurations to increase Memory limits |
Pod has high CPU Usage |
Pod CPU Usage is higher than 90% Pod CPU Usage is higher than 80% Pod CPU Usage is higher than 70% Pod CPU Usage is higher than 60% |
Critical Immediate Warning Info |
Go through the individual usage of the affected Pod's containers and balance their CPU limits. |
Pod has high Memory Usage |
Pod memory usage is higher than 90% Pod memory usage is higher than 80% Pod memory usage is higher than 70% Pod memory usage is higher than 60% |
Critical Immediate Warning Info |
Go through the individual usage of the affected Pod's containers and balance their Memory limits. |
Pod is not available |
Pod is not available |
Critical |
Redeploy the Pod and make sure it goes to Ready state |
ReplicaSet is not available |
ReplicaSet is not available |
Immediate |
Make sure that the Replica Set is present. |
Service is not available |
Service is not available |
Immediate |
Make sure that the Service is present. |
Sum of Resource Requests of Pods exceed Node Capacity |
CPU Requests greater than node capacity Memory Requests greater than node capacity |
Critical |
Minimum CPU/Memory resources required to run the Pods of the affected node has exceeded Node capacity. Consider increasing Node resources OR add more Nodes to distribute the workload. |