A storage device is considered to be in the all paths down (APD) state when it becomes unavailable to your ESXi host for an unspecified period of time.

The reasons for an APD state can be, for example, a failed switch or a disconnected storage cable.

In contrast with the permanent device loss (PDL) state, the host treats the APD state as transient and expects the device to be available again.

The host indefinitely continues to retry issued commands in an attempt to reestablish connectivity with the device. If the host's commands fail the retries for a prolonged period of time, the host and its virtual machines might be at risk of having performance problems and potentially becoming unresponsive.

To avoid these problems, your host uses a default APD handling feature. When a device enters the APD state, the system immediately turns on a timer and allows your host to continue retrying nonvirtual machine commands for a limited time period.

By default, the APD timeout is set to 140 seconds, which is typically longer than most devices need to recover from a connection loss. If the device becomes available within this time, the host and its virtual machine continue to run without experiencing any problems.

If the device does not recover and the timeout ends, the host stops its attempts at retries and terminates any nonvirtual machine I/O. Virtual machine I/O will continue retrying. The vSphere Web Client displays the following information for the device with the expired APD timeout:

  • The operational state of the device changes to Dead or Error.

  • All paths are shown as Dead.

  • Datastores on the device are dimmed.

Even though the device and datastores are unavailable, virtual machines remain responsive. You can power off the virtual machines or migrate them to a different datastore or host.

If later one or more device paths becomes operational, subsequent I/O to the device is issued normally and all special APD treatment ends.