NSX Advanced Load Balancer’s behaviour after a failure in a GSLB deployment depends on whether the failure is
At a leader site or one of the followers
Of the entire site or just the NSX Advanced Load Balancer Controller
Follower Site Failures
In our follower-site failure examples, we focus on infrastructure deployed in Santa Clara, Chicago, and New York.
- Full-Site Failure
-
A full-site failure occurs at the NY-1 follower site, as shown below.
The leader in Santa Clara and Chicago are active sites and therefore detect the failure.
Administrative changes to the GSLB configuration continue to be possible on the leader, but they will not make it to the NY-1 site.
Both control-plane and data-plane health monitors will mark NY-1's GS members Down. Refer toNSX Advanced Load Balancer GSLB Service and Health Monitors for more details.
DNS service for the GSLB configuration remains operational at the two surviving sites.
Global application service will continue on the surviving sites (Santa Clara, Chicago, and NY-2).
- Partial-site Failure
-
If only the NSX Advanced Load Balancer Controller at the NY-1 site fails, the SEs continue to serve applications in headless mode.
The leader and Chicago Controllers detect the failure using their control-plane monitors.
Any administrative changes made on the leader do not propagate to the NY-1 site.
Data-plane health monitors running in Santa Clara and Chicago continue to perceive NY-1's members as Up.
DNS service for the GSLB configuration remains operational at all three sites (because it comes from SEs, none of which have failed).
Global application service continues on all four sites (Santa Clara, Chicago, NY-1, and NY-2).
- Follower Site Recovery
-
The following holds true for either full-site or partial-site failures.
The leader Controller in Santa Clara detects connectivity to the (newly rebooted) follower Controller at NY-1. The latest GSLB configuration is pushed to it.
Other active sites likewise detect successful connectivity to the NY-1 follower Controller as a result of their control-plane health monitors.
If the data-plane never went down (partial-site failure), no more action is required.
If data-plane monitors for NY-1's GS members had been configured and previously marked NY-1's GS members as Down, NY-1's members will be marked Up and traffic to them will resume only after those data-plane monitors once again perceive good health.
Leader Site Failures
- Full-Site Failure
-
A full-site failure occurs at the Santa Clara leader site, as shown below.
As they are active sites, both Chicago and NY-1 detect the failure.
No administrative changes to the GSLB configuration can be made.
Both control-plane and data-plane health monitors mark Santa Clara's GS members as Down.
DNS service for the GSLB configuration remains operational at the two surviving active sites (Chicago and NY-1).
Global application service continues on the three surviving sites (Chicago, NY-1, and NY-2).
- Partial-Site Failure
-
If only the NSX Advanced Load Balancer Controller at the Santa Clara site fails, the site’s SEs continue to serve applications in headless mode.
As they are active sites, both Chicago and NY detect the Controller failure using their control-plane health monitors.
No administrative changes to the GSLB configuration can be made.
Data-plane health monitors running in Chicago and NY continue to perceive Santa Clara's members as Up.
DNS service for the GSLB configuration remains operational at Santa Clara, Chicago, and NY-1.
Global application service will continue on all sites.
Site Configuration Errors
Errors in site configuration related to IP address and credentials show up when the site information is saved. Some sample error screens are as follows:
Authentication Failure

Max Retry Login Failure

Redress
When defining a new GSLB configuration or adding a GSLB site to an existing configuration, one specifies account credentials to be associated with the site. It is a best practice to define the same GSLB administrative account (e.g., gslbadmin) for all participating GSLB sites. By associating with that account the No-Lockout-User-Account-Profile as shown below, one can eliminate max retry login failures.

HTTP 400 Error
