Use these monitoring standards to ensure server health.

Metrics to Capture

Hardware Monitors
CPU Usage
Memory Usage
Hard Disk Free space
Network Usage

Alerts and Thresholds

VMware recommends analyzing each individual use case to determine the correct thresholds for individual environments.

Hardware Alerts, Samples, Thresholds
CPU

Samples: 5 minute samples

Threshold: 90% over 1 hour, 95 over 1 hour

Alerts: 90% load is a warning, 95% is critical

Memory

Samples: 5 minute samples

Threshold: 90% over 1 hour, 95 over 1 hour

Alerts: 90% used is a warning, 95% used is critical

Hard Disk

Samples: 5 minute samples

Threshold: 90% over 1 hour, 95 over 1 hour

Alerts: 90% used is a warning, 95% used is critical

Network

Samples: 5 minute samples

Threshold: 90% over 1 hour, 95 over 1 hour

Alerts: 90% load is a warning, 95% is critical

Strategy for Capture

The metrics are captured by the underlying virtual infrastructure utilizing tools such as vSphere or vRealize Operations.