When your workloads are not allocated enough resources in a cluster, you must identify the hosts that are not fully operational to keep the workloads working according to business requirements. You examine the Cumulative Up-time of all Clusters widget in the Operations Overview dashboard, identify a disconnected host and take an action to bring the disconnected host back online.

About this task

Use the predefined dashboards of vRealize Operations Manager to troubleshoot capacity issues in a data center and resolve them using generated recommendations.

For this scenario for an SDDC that implements VMware Validated Design Intelligent Operations, assume that a sfo01-w01dc data center is not operating at its full capacity because a host, for example sfo01m01esx03.sfo01.rainpole.local, is disconnected. The sfo01-w01dc data center runs tenant workloads that are provisioned directly on virtual infrastructure or by using some cloud management platform.

Procedure

  1. Log in to vRealize Operations Manager by using the operations interface.
    1. Open a Web browser and go to https://vrops01svr01.rainpole.local.
    2. Log in using the following credentials.

      Setting

      Value

      User name

      admin

      Password

      vrops_admin_password

  2. On the main navigation bar, click Dashboards.
  3. Select All Dashboards > Getting Started.
  4. On the Getting Started page, click Operations Overview under Operations.
  5. On the Operations Overview dashboard, select the sfo01-w01dc data center in the Select a Datacenter (DC) widget.
  6. Review the Cumulative Up-time of all Clusters (in selected DC) widget.

    The data center availability is not 100%.

  7. Locate the problematic data center object sfo01-w01dc by entering its name in the Search box on the navigation bar of vRealize Operations Manager and pressing Enter.

    The Summary tab for the sf01-w01dc object opens in the operations user interface.

  8. Knowing that the cluster availability is not 100% and an issue with one or more of the hosts in the problematic data center exists, in the Recommended Actions widget scroll through the tabs and go directly to Host System.
  9. In the Recommended Actions widget, select Alert Type from the All Filters from the drop-down menu, select Virtualization/Hypervisor in the Select Alert Type drop-down dialog box, and click OK.
  10. Locate the Host has lost connection to vCenter Server alert that is raised for the problematic host.
  11. Select the alert to view and apply the Recommendations for the disconnected host.
  12. Verify that the cluster availability is running at 100%.
    1. Wait for the appropriate number of collection cycles for the data to be updated in vRealize Operations Manager.
    2. Select All Dashboards > Getting Started > Operations Overview.
    3. On the Operations Overview dashboard, select the sfo01-w01dc data center in the Select a Datacenter (DC) widget.
    4. Verify that the Cumulative Up-time of all Clusters (in selected DC) widget shows that the data center is operating at its full capacity.