Some controllers in an NSX Controller cluster report incorrect status for one of the controllers.

Problem

After a controller is powered off and on a number of times, the other controllers report that it is inactive when it is up and running.

Cause

An internal error involving the ZooKeeper module sometimes occurs when a controller is powered off and on and causes a communication failure between this controller and the other controllers in the cluster.

Solution

  1. Remove the controller node that is reported to be inactive from the cluster, remove the cluster configuration from the node and rejoin the node to the cluster. For more information, see the section "Replace a Member of the NSX Controller Cluster" in the NSX-T Administration Guide.