You can use Harbor as a Supervisor Service as a local registry for container images from the NVIDIA NGC catalog or for storing validated ML models.
If you want to use the Harbor registry integration with Supervisor, you can follow these setup approaches:
- Disconnected environment
-
Note: The installation of the Harbor service in the Supervisor requires an Internet connection.
- Use a Harbor registry only in the Supervisor in the GPU-enabled workload domain. Perform the following tasks:
- Enable Harbor as a Supervisor Service.
- For VMware Cloud Foundation 5.2.1, log in to the vCenter Server instance for the management domain at https://<vcenter_server_fqdn>/ui.
- In the vSphere Client side panel, click Private AI Foundation.
- In the Private AI Foundation workflow, select Cloud disconnected at the top and click the Set Up a Workload Domain section.
- Install the Supervisor Services for Contour and Harbor. See Enable Harbor as a Supervisor Service. The wizards for installing the Contour and Harbor Supervisor Services in the guided deployment workflow are the same as in the Workload Management area of the vSphere Client.
- Upload AI Container Images to a Private Harbor Registry in VMware Private AI Foundation with NVIDIA
In a disconnected environment, you can disconnect the environment from the Internet and start using the Harbor service as a local container registry after you install the service or after you install it and download the initial set of required container images. In this approach, you must manually download container images from the NVIDIA NGC catalog to a machine in the environment and then upload them to the registry.
- Enable Harbor as a Supervisor Service.
- Use a Harbor registry that is as a replica of an Internet-connected Harbor registry.
One Harbor registry, running outside the VMware Private AI Foundation with NVIDIA environment, is always connected to the Internet. The Harbor registry in the Supervisor for the GPU-enabled workload domain receives container images from the connected one using a proxy mechanism. In this way, the main components of the VMware Cloud Foundation instance remain isolated.
In this approach, additional resources are required for the connected registry.
Note: Allocate enough storage space for hosting the NVIDIA NGC containers you plan to deploy on a deep learning VM or on a TKG cluster. Accommodate at least three versions of each container in the storage space.If connecting to the Internet while installing the Harbor service or setting up a connected Harbor registry is not an option for your organization, use a container registry by another vendor.
- Use a Harbor registry only in the Supervisor in the GPU-enabled workload domain. Perform the following tasks:
- Storing ML models
- Use a Harbor registry in the Supervisor in the GPU-enabled workload domain. Perform the following tasks:
Upload AI Container Images to a Private Harbor Registry in VMware Private AI Foundation with NVIDIA
In a disconnected environment, where you use a Harbor registry only on the AI-ready Supervisor, you must manually upload the AI container images that you intend to deploy on a deep learning VM or a TKG cluster from the NVIDIA NGC catalog to Harbor.
Procedure
Create a Harbor Registry in VMware Private AI Foundation with NVIDIA as a Replica of a Connected Registry
To be able to update easily to the latest images in the NVIDIA NGC catalog, you can use a Harbor registry in a Supervisor that is in another VI workload domain or VMware Cloud Foundation instance and can be connected to Internet. You then replicate this connected registry on the Supervisor where you plan to run AI workloads.
You pull the latest container images from NVIDIA NGC to the connected Harbor registry and transfer them to the disconnected one by using a proxy-cached connection. In this way, you do not have to download container images and then upload them manually on a frequent basis.
You set up the network between the two registries in the following way:
- The connected registry is routable to the replica registry.
- The connected registry is placed in a DMZ where only docker push and
docker pull
communication is allowed between the two registries.
Prerequisites
Procedure
What to do next
- Pull the container images that are required by your organization from NVIDIA NGC to the connected registry by running docker pull on the Docker client machine.
- If the replication rule has manual trigger mode, run manually replications as needed.