Use the inbuilt problem and alert signatures in vRealize Log Insight for storage monitoring.

About this task

For monitoring storage in the Software-Defined Data Center, you can use the following alerts in vRealize Log Insight:

Table 1. Storage Alerts in vRealize Log Insight

Alert Name

Purpose

Severity

*** CRITICAL *** Storage: All Paths Down (APD)

One or more datastores has experienced an All Paths Down (APD) outage situation. This indicates that one or more datastores is or was unavailable. As a result of this issue, VMs are or were unavailable and ESX/ESXi hosts may have been disconnected from vCenter Server. This issue requires immediate attention.

Critical

*** CRITICAL *** Storage: VSAN device offline

A Virtual SAN storage device that backs up the datastores might fail.

This occurs due to a faulty device firmware, physical media, or storage controller or when certain storage devices are not readable or writeable.

Typically, such failures are irreversible. In some instances, permanent data loss might also occur, especially when data is not replicated on other nodes before failure. Virtual SAN automatically recovers data when new devices are added to the storage cluster, unless data lost is permanent.

Critical

Storage: NFS connectivity issue

The purpose of this alert is to notify when an NFS connectivity issue was detected. This means an NFS datastore is or was unavailable. Do to this issue, one or more VMs may be unavailable.

Critical

Storage: NFS lock file issue

The purpose of this alert is to notify when an NFS lock file issue has been detected. Stale NFS lock files can prevent VMs from powering on.

Storage SCSI Path dead

The purpose of this alert is to notify when a SCSI path has become unavailable. Assuming multiple paths are in use and the other paths are online this means reduced redundancy and performance. If all paths to a storage device become unavailable then VMs running on the storage device will become unavailable.

Critical

Storage: Snapshot consolidation required

The purpose of this alert is to notify when a snapshot consolidation is required. A failed snapshot consolidation operation that is not manually addressed can lead to a full datastore.

Critical

Procedure

  1. Open the vRealize Log Insight user interface.
    1. Open a Web browser and go to the following URL.

      Region

      vRealize Log Insight URL

      Region A

      https://vrli-cluster-01.sfo01.rainpole.local

      Region B

      https://vrli-cluster-51.lax01.rainpole.local

    2. Log in using the following credentials.

      Setting

      Value

      User name

      admin

      Password

      vrli_admin_password

  2. In the vRealize Log Insight user interface, click Interactive Analytics.
  3. Click the icon and select Manage Alerts.
  4. Select the alerts that are storage related.
    1. In the search box of the Alerts dialog box, enter storage as a search phrase.
    2. Select the following alerts from the results.

      Alert

      *** CRITICAL *** Storage: All Paths Down (APD)

      *** CRITICAL *** Storage: VSAN device offline

      Storage: NFS connectivity issue

      Storage: NFS lock file issue

      Storage SCSI Path dead

      Storage: Snapshot consolidation required





  5. Enable the alerts.
    1. In the Alerts dialog box, click Enable.
    2. In the Enable Alerts dialog box, configure the following alert settings and click Enable.

      Setting

      Region A

      Region B

      Email

      Email address to send alerts to

      Email address to send alerts to

      Send to vRealize Operations Manager

      Selected

      Selected

      Fallback Object

      SFO01

      LAX01

      Criticality

      critical

      critical





  6. In the Alerts dialog box, set the Raise an alert option for each enabled alert.
    1. Click the Edit button on the first enabled Storage Resources alert.



    2. In the Edit Alert dialog box, under Raise an alert, select On any match, and click Save.




    3. Repeat the steps for the other enabled alerts.
    4. Close the Alerts dialog box.
  7. Repeat the steps on https://vrli-cluster-51.lax01.rainpole.local to enable alerts for the LAX01 data center.