You can replace the capacity drive in a host when you see an Operation status is down for storage device alert. The alert description says SSD_DOWN_ALERT or HDD_DOWN_ALERT.

Procedure

  1. Expand the alert and note the rack number, host name, and disk type displayed in the Description field.
  2. On the SDDC Manager Dashboard, click View Details in the Physical Resources section.
  3. Click the affected rack and then click the host name.

    The Host Details page displays host details.

  4. If the host does not belong to a workload domain (the Workload Domain field is blank), pull the disk out of the host and replace it with a new disk. For details, refer to the vendor documentation.
  5. If the host is part of a workload domain (the Workload Domain field displays the domain name), follow the steps below.
    1. Note the ESX Cluster name.

    2. In vCenter Server, click the vCenter link.

    3. Navigate to the ESX Cluster name you had noted earlier.

    4. In the Manage tab, select vSAN > General.

    5. In the vSAN is Turned On field, click Edit.

    6. In Add disks to Storage, select Manual and click OK.

    7. Select the host with the failed disk, click the Manage tab and select Disk Management in the vSAN section.

    8. Select the disk group with the failed capacity drive.

    9. Select the failed capacity drive and click Remove selected disk(s) from disk group.

    10. Wait for the disk to be deleted and then remove the disk from the host.

      Note:

      For a cache drive, the corresponding disk group is also deleted.

    11. Add the new disk to the host and wait for vCenter Server to detect it.

      If vCenter Server is unable to detect the drive, confirm that the disk is seated properly in the slot and perform a device re-scan.

    12. Select the host with the newly replaced disk, click the Manage tab and select Disk management in the vSAN section.

    13. For a cache drive, re-create the disk group.

      1. In vCenter, select the host with the replaced cache drive.

      2. In the Manage tab, select vSAN > Disk Management and select the host that had the drive replaced.

      3. Click Create a new disk group.

      4. Select a flash device under cache tier and select 4 HDD/SSD devices under Capacity tier.

      5. Click OK.

      6. Wait for the task to complete and then verify that the new disk group was created for the host.

    14. In the vSAN is Turned On field, click Edit.

    15. In Add disks to Storage, select Automatic and click OK.

    16. Log out of all open systems.

  6. If the host you just replaced contains all flash storage, mark the disk as capacity.
    1. SSH to the ESXi host and run the esxcli storage core device list command. Locate the diskID of the newly added SSD.

    2. Run the esxcli vsan storage tag add -d diskID -t capacityFlash command.

    3. SSH to the SDDC Manager Controller VM and copy the /opt/vmware/scripts/capacityflash.py script to the host on which you replaced the SSD.

    4. Run the capacityflash.py script on the host.

Results

The drive on the host is successfully replaced.