NSX Manager 集中式 CLI 中,您可以检查每个 ESXi 主机的运行状况。

运行状况将报告为严重、不正常或正常。还可以通过 NSX Manager API 调用 host health-check 命令。

例如:

nsxmgr> show host host-30 health-status
status: HEALTHY

nsxmgr> show host host-29 health-status
UNHEALTHY, Standard Switch vSwitch1 has no uplinks.

UNHEALTHY, Host host-29 mount point /vmfs/volumes/58e2c6d5-83319ac1-7830-005056014d86
volume "local_esx-06a" free percentage less than 20%: 0%.
Status: UNHEALTHY

nsxmgr> show host host-28 health-status
CRITICAL, VXLAN VDS vds-site-a VNI 200000 multicast addr is not synchronized with VSM: 0.0.0.0.
CRITICAL, VXLAN VDS vds-site-a VNI 200003 multicast addr is not synchronized with VSM: 0.0.0.0.
CRITICAL, VXLAN VDS vds-site-a VNI 5000 multicast addr is not synchronized with VSM: 0.0.0.0.
Status: CRITICAL

CRITICAL, Control plane agent on host is not in established state with controller. Please
investigate host connection issue with esxcli network ip connection
command.
CRITICAL, VXLAN VDS RegionA01-vDS-MGMT VNI 5007 connection to controller is down: 0.0.0.0
(down).

您可以使用 show host hostID health-status detail 命令针对每个参数查看指定主机的详细运行状况。

运行状况检查分为以下不同的类别:

  1. 部署运行状况检查,包括 NSX Data Center for vSphere VIB 和主机代理的部署。请注意,可以在运行时使用其中的某些检查,例如,netcpa 和防火墙代理检查。
    The host associated cluster check [DONE]
    The NSX component installation check [DONE]
    The NSX module VXLAN installation check [DONE]
    The NSX module DLR installation check [DONE]
    The NSX VIB installation check [DONE]
    The control plane agent check [DONE]
    The firewall agent check [DONE]
    The NSX configuration file check [DONE]
    The host connection to controller check [DONE]
  2. 网络连接运行状况。
    The Distributed Switch uplink check [DONE]
    The Standard Switch uplink check [DONE]
  3. CPU、内存和存储利用率运行状况。
    The storage volume check [DONE]
    UNHEALTHY, Host host-34 mount point /vmfs/volumes/58e2c69b-38d4fa25-b33a-00505601dfe2
    volume "local_esx-05a" free percentage less than 20%: 0%.
    The Memory overcommit check [DONE]
    The physical CPU load check [DONE]
    The free memory usage check [DONE]
  4. 逻辑交换机运行状况。

    The VXLAN VDS existence check [DONE]
    The VXLAN VDS MTU check [DONE]
    The VXLAN gateway IP/MAC check [DONE]
    The VXLAN vmknic check [DONE]
    The VXLAN uplinks check [DONE]
    The VXLAN configuration check [DONE]
    The VXLAN vmknic IP address check [DONE]
    The VXLAN Overlay settings check [DONE]
    The VXLAN vni settings check [DONE]
    The host vmnic check [DONE]
    The VXLAN vmknic check [DONE]

下面是有关输出的几个示例:

nsx-mgr> show host host-33 health-status detail
The host associated cluster check [DONE]
The NSX component installation check [DONE]
The NSX module VXLAN installation check [DONE]
The NSX module DLR installation check [DONE]
The NSX VIB installation check [DONE]
The control plane agent check [DONE]
The firewall agent check [DONE]
The NSX configuration file check [DONE]
The host connection to controller check [DONE]
The Distributed Switch uplink check [DONE]
The Standard Switch uplink check [DONE]
The storage volume check [DONE]
UNHEALTHY, Host host-34 mount point /vmfs/volumes/58e2c69b-38d4fa25-b33a-00505601dfe2
volume "local_esx-05a" free percentage less than 20%: 0%.
The Memory overcommit check [DONE]
The physical CPU load check [DONE]
The free memory usage check [DONE]
The VXLAN VDS existence check [DONE]
The VXLAN VDS MTU check [DONE]
The VXLAN gateway IP/MAC check [DONE]
The VXLAN vmknic check [DONE]
The VXLAN uplinks check [DONE]
The VXLAN configuration check [DONE]
The VXLAN vmknic IP address check [DONE]
The VXLAN Overlay settings check [DONE]
The VXLAN vni settings check [DONE]
The host vmnic check [DONE]
The VXLAN vmknic check [DONE]
Status: UNHEALTHY

nsx-mgr> show host host-33 health-status detail
The host associated cluster check [DONE]
The NSX component installation check [DONE]
The NSX module VXLAN installation check [DONE]
The NSX module DLR installation check [DONE]
The NSX VIB installation check [DONE]
The control plane agent check [DONE]
The firewall agent check [DONE]
The NSX configuration file check [DONE]
The host connection to controller check [DONE]
CRITICAL, Control plane agent on host is not in established state with controller. Please
investigate host connection issue with esxcli network ip connection
command.
The Distributed Switch uplink check [DONE]
The Standard Switch uplink check [DONE]
The storage volume check [DONE]
The Memory overcommit check [DONE]
The physical CPU load check [DONE]
The free memory usage check [DONE]
The VXLAN VDS existence check [DONE]
The VXLAN VDS MTU check [DONE]
The VXLAN gateway IP/MAC check [DONE]
The VXLAN vmknic check [DONE]
The VXLAN uplinks check [DONE]
The VXLAN configuration check [DONE]
The VXLAN vmknic IP address check [DONE]
The VXLAN Overlay settings check [DONE]
The VXLAN vni settings check [DONE]
CRITICAL, VXLAN VDS RegionA01-vDS-MGMT VNI 5007 connection to controller is down: 0.0.0.0
(down).
The host vmnic check [DONE]
The VXLAN vmknic check [DONE]
Status: CRITICAL
表 1.
运行状况检查 消息 说明
与主机关联的群集检查

Host {hostId} doesn't belong to a cluster.

此运行状况检查可发现特定的主机(主机 ID)是否与群集正确关联。

NSX 组件安装检查

Host {hostId} has no NSX components installed.

此运行状况检查可发现是否已为主机(主机 ID)正确安装所有 NSX Data Center for vSphere 组件。

NSX 模块 VXLAN 安装检查 The NSX module VXLAN is not installed/loaded/enabled on host {hostId}.

此运行状况检查可发现是否已在主机(主机 ID)上正确配置 VXLAN。

NSX 模块 DLR 安装检查 The NSX module for distributed routing is not installed, loaded, or enabled on the host.

此运行状况检查可发现是否已为主机(主机 ID)正确安装分布式逻辑路由器 (DLR)。

NSX VIB 安装检查 The NSX VIB installation check has failed on host.

此运行状况检查可发现已安装的 NSX Data Center for vSphere VIB 是否适用于新的 ESXi 版本。

NSX Data Center for vSphere 数据层面与 ESXi 主机版本不兼容。可通过确认主机上安装的 esx-vsip VIB 与 NSX Data Center for vSphere 版本相匹配,来进一步执行调查。

控制层面代理检查

Service netcpad is not running on host {hostId}.

此运行状况检查可发现控制层面代理是否正在与 NSX Manager 通信。

有关故障排除信息,请参阅控制层面代理 (netcpa) 问题

防火墙代理检查 The NSX firewall agent is running or not on the host.

此运行状况检查可发现防火墙代理是否正在主机(主机 ID)上运行。

有关故障排除信息,请参阅对分布式防火墙进行故障排除

NSX 配置文件检查 The NSX configuration file /etc/vmware/netcpa/config-by-vsm.xml is not synchronized on host {hostId}.

此运行状况检查可发现配置文件 /etc/vmware/netcpa/config-by-vsm.xml 是否与主机(主机 ID)同步。

主机与控制器的连接检查 Control plane agent on host is not in established state with controller.

此运行状况检查可发现控制层面代理是否正在与主机(主机 ID)通信。

有关故障排除信息,请参阅控制层面代理 (netcpa) 问题

分布式交换机上行链路检查 Distributed Switch {dvsName} has no uplinks. 有关故障排除信息,请参阅《vSphere 网络连接》指南。
标准交换机上行链路检查 Standard Switch {vswitchName} has no uplinks. 有关故障排除信息,请参阅《vSphere 网络连接》指南。
存储卷检查 Host "+hostId+" mount point "+mountPoint+" volume \""+volumeName+"\" free percentage less than 20%: "+freePercent+"%.\n.
内存过量分配检查

Memory overcommit info from esxtop can't be parsed.Host {hostId} memory is over commited.

物理 CPU 负载检查

Host {hostId} Physical CPU Load (5 Minute Avg) info from esxtop can't be parsed.Host {hostId} Physical CPU Load (5 Minute Avg) is too high.

可用内存使用情况检查

Host {hostId} mount point {mountPoint} volume {volumeName} free percentage less than 20%.

VXLAN VDS 存在性检查

No VXLAN VDS is configured on host {hostId}.

有关故障排除信息,请参阅《vSphere 网络连接》指南。
VXLAN VDS MTU 检查

VXLAN VDS {vxlanVds} MTU is less than 1550: {vxlanVdsMtu}.Please check MTU value of connected VMs is not greater than { vxlanVdsMtu-50}.

VXLAN 网关 IP/MAC 检查 Gateway IP/mac address of VXLAN VDS {vxlanVds} is not resolved.
VXLAN vmknic 检查 VXLAN VDS {vxlanVds} has no vmknic. 收集 vmkernel.log,然后联系 VMware 技术支持部门。
VXLAN 上行链路检查 VXLAN VDS {vxlanVds} has no uplinks.
VXLAN 配置检查 Configuration of VXLAN VDS {+vxlanVds} of host {hostId} is not synchronized.Configuration of VXLAN VDS {vxlanVds} is not defined on NSX Manager.VXLAN VDS {vxlanVds} vmknic and uplink counts are not equal for teaming policy {vdsContext.getTeamingPolicy }. 将 VXLAN 端口组绑定策略重新设置为原始值。
VXLAN vmknic IP 地址检查 VXLAN VDS {vxlanVds} {vmknic}+{vmknic}+{ip address} is not resolved. VXLAN VDS {vxlanVds vmknic} +{vmknic}{ ip address} is not resolved. 此检查指示 VTEP 不具备有效的 IP 地址,或者丢失了有效的 IP 地址。
VXLAN 覆盖设置检查 VXLAN VDS {vxlanVds} overlay settings are not VXLAN: {overlay}{ VXLAN VDS }{vxlanVds} overlay settings are not found.
VXLAN vni 设置检查 VXLAN VDS {vxlanVds+ " VNI "+vni+}connection to controller is down VXLAN VDS {vxlanVds+ " VNI "+vni+"} segment info is not configured in NSX Manager. VXLAN VDS {"+vxlanVds+" VNI "+vni+" multicast addr "+multicastIp+"} is not synchronized with NSX Manager.