To upgrade Tanzu Kubernetes Grid (TKG), you download and install the new version of the Tanzu CLI on the machine that you use as the bootstrap machine. You must also download and install base image templates and VMs, depending on whether you are upgrading clusters that you previously deployed to vSphere, Amazon Web Services (AWS), or Azure.
In the TKG upgrade path, v2.4 immediately follows v2.3.
After you have installed the new versions of the components, you use the tanzu mc upgrade
and tanzu cluster upgrade
CLI commands to upgrade management clusters and workload clusters.
The next sections are the overall steps required to upgrade Tanzu Kubernetes Grid. This procedure assumes that you are upgrading to Tanzu Kubernetes Grid v2.4.1.
Some steps are only required if you are performing a minor upgrade from Tanzu Kubernetes Grid v2.3.x to v2.4.x and are not required if you are performing a patch upgrade from Tanzu Kubernetes Grid v2.4.x to v2.4.y.
ImportantTanzu Kubernetes Grid v2.4.x is the last version of TKG that supports upgrading existing standalone TKG management clusters and TKG workload clusters on AWS and Azure. The ability to upgrade standalone TKG management clusters and TKG workload clusters on AWS and Azure will be removed in the Tanzu Kubernetes Grid v2.5 release.
Starting from now, VMware recommends that you use Tanzu Mission Control to create native AWS EKS and Azure AKS clusters. However, upgrading existing standalone TKG management clusters and TKG workload clusters on AWS and Azure remains fully supported for all TKG releases up to and including TKG v2.4.x.
For more information, see Deprecation of TKG Management and Workload Clusters on AWS and Azure in the VMware Tanzu Kubernetes Grid v2.4 Release Notes.
Before you upgrade to TKG v2.4.x, ensure that your current deployment is TKG v2.3.x or an earlier v2.4 version. To upgrade to TKG v2.4.x from versions earlier than v2.3, you must first upgrade to v2.3.x with a v2.3.x of the Tanzu CLI.
If you are not sure which version of TKG is running in your management cluster, you can extract the version from the management cluster object.
Run the tanzu context use
command to see an interactive list of management clusters available for upgrade.
tanzu context use
Select the management cluster that you want to check.
Get the admin credentials of the cluster. The Tanzu CLI alias mc
is short for management-cluster
.
tanzu mc kubeconfig get --admin
Connect kubectl
to the management cluster.
kubectl config use-context MGMT-CLUSTER-NAME-admin@MGMT-CLUSTER-NAME.
You can copy the exact command, including the name of the management cluster context, from the output of the tanzu mc kubeconfig get --admin
command that you ran in the preceding step.
Run the kubectl get cluster
command to extract the TKG version.
kubectl get cluster MGMT-CLUSTER-NAME -n tkg-system -o yaml | grep TKGVERSION
The output will show the version of TKG running in your management cluster, for example:
TKGVERSION: v2.4.1
If you have used environment variables for cluster creation, you must unset some of them before starting the upgrade process. If these variables are set in the environment where you run the upgrade, the upgrade process will treat them as reconfiguration options and the upgrade might fail.
Run the following commands to unset the relevant variables:
unset VSPHERE_DATACENTER
unset VSPHERE_DATASTORE
unset VSPHERE_FOLDER
unset VSPHERE_NETWORK
unset VSPHERE_RESOURCE_POOL
This step is required for both major v2.3.x to v2.4.x and patch v2.4.x to v2.4.y upgrades.
To download and install the new version of the Tanzu CLI and plugins, perform the following steps.
Delete the ~/.config/tanzu/tkg/compatibility/tkg-compatibility.yaml
file.
If you do not delete this file, the new version of the Tanzu CLI will continue to use the Bill of Materials (BOM) for the previous release. Deleting this file causes the Tanzu CLI to pull the updated BOM. You must perform this step both when upgrading from 2.3.x to 2.4.x, and when upgrading from 2.4.x to 2.4.y.
Follow the instructions in Install the Tanzu CLI and Kubernetes CLI for Use with Standalone Management Clusters to download and install the Tanzu CLI and plugins and kubectl
on the machine where you currently run your tanzu
commands. If any of your standalone management clusters are configured to use an LDAP identity provider, perform the steps in (LDAP Only) Update LDAP Settings after installing Tanzu CLI and before updating your CLI plugins to Tanzu Kubernetes Grid v2.4.
tanzu plugin install
to install the current --vmware-tkg
plugin group.tanzu version
to check that the correct version of the Tanzu CLI is properly installed. For a list of CLI versions compatible with Tanzu Kubernetes Grid v2.4, see Product Interoperability Matrix.kubectl
, run kubectl version
to check that the correct version of kubectl
is properly installed.For information about Tanzu CLI commands and options that are available, see the Tanzu CLI Command Reference.
Before you can upgrade management and workload clusters, you must perform preparatory steps depending whether you deployed clusters on vSphere, AWS, or Azure. This step is required for both major v2.3.x to v2.4.x and patch v2.4.x to v2.4.y upgrades.
This procedure assumes that you are upgrading to Tanzu Kubernetes Grid v2.4.x.
Download the latest Tanzu Kubernetes Grid OVAs for the OS and Kubernetes version lines that your management and workload clusters are running.
For example, for Photon v3 images:
For Ubuntu 20.04 images:
ImportantMake sure you download the most recent OVA base image templates in the event of security patch releases. You can find updated base image templates that include security patches on the Tanzu Kubernetes Grid product download page.
Follow the installer prompts to deploy a VM from the OVA.
tkg-user
, to the template with the Tanzu Kubernetes Grid role, for example, TKG
. You created this user and role in Prepare to Deploy Management Clusters to vSphere.Repeat the procedure for each of the Kubernetes versions for which you have downloaded the OVA file.
VMware Cloud on AWS SDDC Compatibility
If you are upgrading workload clusters that are deployed on VMware Cloud on AWS, verify that the underlying Software-Defined Datacenter (SDDC) version used by your existing deployment is compatible with the version of Tanzu Kubernetes Grid you are upgrading to.
To view the version of an SDDC, select View Details on the SDDC tile in VMware Cloud Console and click on the Support pane.
To validate compatibility with Tanzu Kubernetes Grid, refer to the VMware Product Interoperablity Matrix.
tanzu mc permissions aws set
command.
tanzu mc permissions aws set
This step is required for either major v2.3.x to v2.4.x or patch v2.4.x to v2.4.y upgrades. For more information about the AWS permission that the command sets, see Required AWS Permissions.
Amazon Linux 2 Amazon Machine Images (AMI) that include the supported Kubernetes versions are publicly available to all AWS users, in all supported AWS regions. Tanzu Kubernetes Grid automatically uses the appropriate AMI for the Kubernetes version that you specify during upgrade.
To accept the terms:
List all available VM images for Tanzu Kubernetes Grid in the Azure Marketplace:
az vm image list --publisher vmware-inc --offer tkg-capi --all
Accept the terms for the new default VM image:
az vm image terms accept --urn publisher:offer:sku:version
For example, to accept the terms for the default VM image in Tanzu Kubernetes Grid v2.4.1, k8s-1dot27dot5-ubuntu-2004
, run:
az vm image terms accept --urn vmware-inc:tkg-capi:k8s-1dot27dot5-ubuntu-2004:2021.05.17
If you plan to upgrade any of your workload clusters to a non-default Kubernetes version, such as v1.26.8 or v1.25.13, accept the terms for each non-default version that you want to use for your cluster VMs.
This step is only required for TKG with a standalone management cluster. If you are running TKG with a vSphere with Tanzu Supervisor, you upgrade the Supervisor as part of vSphere and update the Supervisor’s Kubernetes version by upgrading its TKrs.
This step is required for both major v2.3.x to v2.4.x and patch v2.4.x to v2.4.y upgrades.
To upgrade Tanzu Kubernetes Grid, you must upgrade all management clusters in your deployment. You cannot upgrade workload clusters until you have upgraded the management clusters that manage them.
Follow the procedure in Upgrade Standalone Management Clusters to upgrade your management clusters.
This step is required for both major v2.3.x to v2.4.x and patch v2.4.x to v2.4.y upgrades.
Follow the procedure in Upgrade Workload Clusters to upgrade the workload clusters that are running your workloads.
After you have upgraded your clusters, there are additional steps to perform to complete the upgrade process.
Some packages that are installed by default in the management cluster, for example, cert-manager
can be installed as CLI-managed packages in workload and the shared services clusters. When the management cluster is upgraded to the latest Tanzu Kubernetes Grid release, its default packages are automatically updated.
You can run different versions of the CLI-managed packages in different workload clusters. In a workload cluster, you can run either the latest supported version of a CLI-managed package or the package’s versions in your last two previously-installed versions of Tanzu Kubernetes Grid. For example, if the latest packaged version of cert-manager
is v1.11.1 and your previous two Tanzu Kubernetes Grid installations ran cert-manager
v1.10.1 and v1.7.2, then you can run cert-manager
versions v1.11.1, v1.10.1, and v1.7.2 in workload clusters.
For any workload clusters that are running package versions that are more than n-2 installed Tanzu Kubernetes Grid versions older than the package versions in the management cluster, you must update the package repository (See Update a Package Repository) and then upgrade the package in the workload clusters (See Update a Package). If you do not upgrade the package version, you will not be able to update the package configuration because the package repository might not include over n-2 older version of the package.
ImportantIf you are have installed Prometheus on a workload cluster and you upgrade the workload cluster to Kubernetes v1.25, you must upgrade Prometheus to at least version
2.37.0+vmware.3-tkg.1
. Earlier versions of the Prometheus package, for example version2.37.0+vmware.1-tkg.1
, are not compatible with Kubernetes 1.25.
Depending on whether your clusters are running on vSphere, AWS, or Azure, there are operations that you must perform after you have upgraded the clusters.
If NSX ALB was not enabled in your TKG v2.3 installation, see Install and Configure NSX Advanced Load Balancer for information on how to install NSX ALB.
If NSX ALB was enabled in your TKG v2.3 installation, see the Tanzu Kubernetes Grid v2.4 Release Notes for which Avi Controller versions are supported in this release, and if needed upgrade the Avi Controller to a compatible version. For how to upgrade the Avi Controller, see Flexible Upgrades for Avi Vantage.
TKG v2.2 and later automatically install the AWS EBS CSI driver on newly-created workload clusters, but to run AWS EBS CSI on clusters upgraded from v2.1, the driver must be installed manually. Follow this procedure to install the AWS EBS CSI driver manually on a cluster that was created in TKG v2.1 or earlier and has never had the AWS EBS CSI Driver installed.
Grant permissions for the AWS EBS CSI driver:
export AWS_REGION={YOUR_AWS_REGION}
tanzu mc permissions aws set
For each workload cluster that uses CSI storage:
Export the following environment variables and set feature flag:
export _TKG_CLUSTER_FORCE_ROLE="management"
export FILTER_BY_ADDON_TYPE="csi/aws-ebs-csi-driver"
export NAMESPACE="tkg-system"
export DRY_RUN_MODE="legacy"
tanzu config set features.cluster.allow-legacy-cluster true
Set NAMESPACE
to the cluster’s namespace, tkg-system
in the example above.
Generate the CSI driver manifest:
tanzu cl create ${TARGET_CLUSTER_NAME} --dry-run -f ~/MANAGEMENT_CLUSTER_CREATE_CONFIG.yaml > csi-driver-addon-manifest.yaml
Where TARGET_CLUSTER_NAME
is the name of the cluster on which you are installing the CSI driver.
Update the namespace of the secret in the metadata in csi-driver-addon-manifest.yaml
with the namespace of the workload cluster. Use the command kubectl get cluster -A
to view the namespace of the cluster.
Apply the changes in management cluster’s context:
kubectl apply -f csi-driver-addon-manifest.yaml
Unset the following environment variables and feature flag:
unset _TKG_CLUSTER_FORCE_ROLE
unset FILTER_BY_ADDON_TYPE
unset NAMESPACE
unset DRY_RUN_MODE
tanzu config set features.cluster.allow-legacy-cluster false
For management cluster that uses CSI storage:
Export the following environment variables:
export _TKG_CLUSTER_FORCE_ROLE="management"
export FILTER_BY_ADDON_TYPE="csi/aws-ebs-csi-driver"
export NAMESPACE="tkg-system"
export DRY_RUN_MODE="legacy"
tanzu config set features.cluster.allow-legacy-cluster true
Set NAMESPACE
to the cluster’s namespace, tkg-system
in the example above.
Generate the CSI driver manifest:
tanzu mc create ${MANAGEMENT_CLUSTER_NAME} --dry-run -f ~/MANAGEMENT_CLUSTER_CREATE_CONFIG.yaml > csi-driver-addon-manifest.yaml
Where MANAGEMENT_CLUSTER_NAME
is the name of the management cluster.
Update the namespace of the secret in the metadata in csi-driver-addon-manifest.yaml
with the namespace of the management cluster. Use the command kubectl get cluster -A
to view the namespace of the cluster.
Apply the changes in management cluster’s context:
kubectl apply -f csi-driver-addon-manifest.yaml
Unset the following environment variables and feature flag:
unset _TKG_CLUSTER_FORCE_ROLE
unset FILTER_BY_ADDON_TYPE
unset NAMESPACE
unset DRY_RUN_MODE
tanzu config set features.cluster.allow-legacy-cluster false
TKG v2.1 and later automatically install the Azure Disk CSI driver on newly-created workload clusters, but to run Azure Disk CSI on clusters upgraded from v1.6, the driver must be installed manually. Follow this procedure to install the Azure Disk CSI driver manually on a cluster that was created in TKG v1.6 or earlier and has never had the Azure Disk CSI Driver installed.
Export the following environment variables and set feature flag:
export _TKG_CLUSTER_FORCE_ROLE="management"
export FILTER_BY_ADDON_TYPE="csi/azuredisk-csi-driver"
export NAMESPACE="tkg-system"
export DRY_RUN_MODE="legacy"
tanzu config set features.cluster.allow-legacy-cluster true
Set NAMESPACE
to the cluster’s namespace, tkg-system
in the example above.
For each workload cluster that uses CSI storage:
Generate the CSI driver manifest:
tanzu cl create ${TARGET_CLUSTER_NAME} --dry-run -f ~/MANAGEMENT_CLUSTER_CREATE_CONFIG.yaml > csi-driver-addon-manifest.yaml
Where TARGET_CLUSTER_NAME
is the name of the cluster on which you are installing the CSI driver.
Update the namespace of the secret in the metadata in csi-driver-addon-manifest.yaml
with the namespace of the workload cluster. Use the command kubectl get cluster -A
to view the namespace of the cluster.
Apply the changes in management cluster’s context:
kubectl apply -f csi-driver-addon-manifest.yaml
Unset the following environment variables and feature flag:
unset _TKG_CLUSTER_FORCE_ROLE
unset FILTER_BY_ADDON_TYPE
unset NAMESPACE
unset DRY_RUN_MODE
tanzu config set features.cluster.allow-legacy-cluster false
For management cluster that uses CSI storage:
Export the following environment variables:
export _TKG_CLUSTER_FORCE_ROLE="management"
export FILTER_BY_ADDON_TYPE="csi/azuredisk-csi-driver"
export NAMESPACE="tkg-system"
export DRY_RUN_MODE="legacy"
tanzu config set features.cluster.allow-legacy-cluster true
Set NAMESPACE
to the cluster’s namespace, tkg-system
in the example above.
Generate the CSI driver manifest:
tanzu mc create ${MANAGEMENT_CLUSTER_NAME} --dry-run -f ~/MANAGEMENT_CLUSTER_CREATE_CONFIG.yaml > csi-driver-addon-manifest.yaml
Where MANAGEMENT_CLUSTER_NAME
is the name of the management cluster.
Update the namespace of the secret in the metadata in csi-driver-addon-manifest.yaml
with the namespace of the management cluster. Use the command kubectl get cluster -A
to view the namespace of the cluster.
Apply the changes in management cluster’s context:
kubectl apply -f csi-driver-addon-manifest.yaml
Unset the following environment variables and feature flag:
unset _TKG_CLUSTER_FORCE_ROLE
unset FILTER_BY_ADDON_TYPE
unset NAMESPACE
unset DRY_RUN_MODE
tanzu config set features.cluster.allow-legacy-cluster false
Install Azure File CSI Driver after Tanzu Kubernetes Grid Upgrade
If the cluster has not installed it before, follow this procedure to install Azure File CSI Driver after upgrading your Tanzu Kubernetes Grid installation to v2.4+.
Export the following environment variables and set feature flag:
export _TKG_CLUSTER_FORCE_ROLE="management"
export FILTER_BY_ADDON_TYPE="csi/azurefile-csi-driver"
export NAMESPACE="tkg-system"
export DRY_RUN_MODE="legacy"
tanzu config set features.cluster.allow-legacy-cluster true
Set NAMESPACE
to the cluster’s namespace, tkg-system
in the example above.
For each workload cluster that uses CSI storage:
Generate the CSI driver manifest:
tanzu cl create ${TARGET_CLUSTER_NAME} --dry-run -f ~/MANAGEMENT_CLUSTER_CREATE_CONFIG.yaml > csi-driver-addon-manifest.yaml
Where TARGET_CLUSTER_NAME
is the name of the cluster on which you are installing the CSI driver.
Update the namespace of the secret in the metadata in csi-driver-addon-manifest.yaml
with the namespace of the workload cluster. Use the command kubectl get cluster -A
to view the namespace of the cluster.
Apply the changes in management cluster’s context:
kubectl apply -f csi-driver-addon-manifest.yaml
Unset the following environment variables and feature flag:
unset _TKG_CLUSTER_FORCE_ROLE
unset FILTER_BY_ADDON_TYPE
unset NAMESPACE
unset DRY_RUN_MODE
tanzu config set features.cluster.allow-legacy-cluster false
For management cluster that uses CSI storage:
Export the following environment variables:
export _TKG_CLUSTER_FORCE_ROLE="management"
export FILTER_BY_ADDON_TYPE="csi/azurefile-csi-driver"
export NAMESPACE="tkg-system"
export DRY_RUN_MODE="legacy"
tanzu config set features.cluster.allow-legacy-cluster true
Set NAMESPACE
to the cluster’s namespace, tkg-system
in the example above.
Generate the CSI driver manifest:
tanzu mc create ${MANAGEMENT_CLUSTER_NAME} --dry-run -f ~/MANAGEMENT_CLUSTER_CREATE_CONFIG.yaml > csi-driver-addon-manifest.yaml
Where MANAGEMENT_CLUSTER_NAME
is the name of the management cluster.
Update the namespace of the secret in the metadata in csi-driver-addon-manifest.yaml
with the namespace of the management cluster. Use the command kubectl get cluster -A
to view the namespace of the cluster.
Apply the changes in management cluster’s context:
kubectl apply -f csi-driver-addon-manifest.yaml
Unset the following environment variables and feature flag:
unset _TKG_CLUSTER_FORCE_ROLE
unset FILTER_BY_ADDON_TYPE
unset NAMESPACE
unset DRY_RUN_MODE
tanzu config set features.cluster.allow-legacy-cluster false
This step is required for both major v2.3.x to v2.4.x and patch v2.4.x to v2.4.y upgrades.
For information about how to upgrade Crash Recovery and Diagnostics, see Install or Upgrade the Crash Recovery and Diagnostics Binary.
Examine your upgraded management clusters or register them in Tanzu Mission Control. See Examine and Register a Newly-Deployed Standalone Management Cluster.