vRealize Operations Manager Restore Process

A system administrator uses backups to restore vRealize Operations Manager to a functional state after a system failure.

You must start the vRealize Operations Manager components in a specified order after the restore is complete.

Common Restore Scenarios

The common restore scenarios for vRealize Operations Manager systems include a full restore of a single-node virtual appliance system and restore of multiple-node virtual appliance clusters.

Satisfy the following prerequisites for the vRealize Operations Manager cluster.

Power off the virtual machines in the multi-node cluster that you want to restore.
Before restoring to a different host, power off virtual machines at the original location, and then bring up the environment on the new host to avoid hostname or IP conflict. Verify that the datastore on the new host has sufficient capacity for the new cluster.
Verify that all VMDK files have been assigned to the same datastore.
Back up the vRealize Operations Manager multi-node cluster.

Single-Node Virtual Appliance

This scenario restores a single node system on the same host.

After the restore is complete, power on the VM.
Verify that you have set up a static IP address for the node and that the IP address is restored.
Log in to the node to verify that all your data is preserved and that all vmware-vcops services are running.
If your root password was reset, you must change it now.

For incremental backups, a user must create a cumulative differential policy. Every backup after a full backup is an incremental backup.

Note:

If you restore to another host, you must power off the environment at the original location and start the environment on the new host.

Multiple-Node Virtual Appliance Clusters

This scenario restores multiple-node virtual appliance clusters.

After the restore is complete, you must power on the nodes in the following order:
1. Primary
2. replica
3. witness
4. data
5. remote collector
Ensure that each node is online before attempting to start the next component.
Verify that you have set up a static IP address for the node and that the IP address is restored.
Log in to the administrator interface of the Primary node and verify that all the nodes are online and running. Log in to each node to verify that all your data is preserved and all of the vmware-vcops services are up and running by using the vmware-vcops status command.
Log in to the administrator interface of the Primary node.
1. Verify that high availability is activated. If the Activate High Availability button appears, high availability was deactivated during the backup and restore process. Activate high availability.
2. Verify that all nodes are collecting metrics.
If your root password was reset, you must change it now.
(Optional)
If a node in a non-high availability cluster does restart, you must take it offline with the administrator interface.
(Optional)
If the replica node in a high availability cluster does not restart, you must assign a new Primary-replica node.
(Optional)
You can execute an incremental backup only if a full backup already exists. After an incremental backup, you can choose to restore to either the full or incremental backup.

Note:

If you restore to another host, you must power off the environment at the original location before starting the environment on the new host.

Verify the Restore of vRealize Operations Manager Systems

After the restore operation of the VM is finished, verify that vRealize Operations Manager is in a functional state.

Note:

Do not power on any vRealize Operations Manager nodes during the restore operation. Wait until the entire cluster restore has finished before you power on any node.

For a non-HA cluster, power on the Primary node followed by the data nodes. For an HA environment, power on the Primary node followed by the replica node, data nodes, and remote collectors.

Use SSH to log in to the vRealize Operations Manager Primary node to verify the vRealize Operations Manager service status.

Use SSH to switch to the vRealize Operations Manager Primary node and enter the service vmware-vcops status command.

# service vmware-vcops status
Slice Online-true
admin Role Enabled-true
        vRealize Operations vPostgres Replication Database is running (31810).
        vRealize Operations Gemfire Locator is running (31893).
data Role Enabled-true
        vRealize Operations vPostgres Database is running (32013).
        vRealize Operations Cassandra Distributed Database is running (21062).
        vRealize Operations Analytics is running (32142).
        vRealize Operations Collector is running (32225).
        vRealize Operations API is running (32331).
ui Role Enabled-true
remote collector Role Enabled-false

Confirm that the admin, data, and UI roles are running.

Log in to the administration UI of the Primary node and verify that all nodes in the cluster are up and collecting data.
1. Open a browser and go to the administration UI of the Primary node: https://<Primary_Node_IP>/admin/login.action.
2. Log in as administrator.
3. Verify that each node is in the Online status.
4. Click each node and verify that adapter instances are in the Data receiving status.
If the High Availability state indicates Activated, degraded after the restore, one or more nodes are inaccessible, and you must power on the nodes or you must restart the cluster.
1. Open a browser and go to the administration UI of the Primary node: https://<Primary_Node_IP>/admin/login.action.
2. Verify that all nodes are in Running state and Online.
  1. If a node is not in Running state and Online, power on the node and start it online.
  2. If all nodes are in Running state and Online, but HA is still activated, degraded, restart the cluster.