Setting Up a Private Harbor Registry in VMware Private AI Foundation with NVIDIA

You can use Harbor as a local registry for container images from the NVIDIA NGC catalog.

Note: The installation of the Harbor service in the Supervisor requires an Internet connection.

If you want to use the Harbor registry integration with Supervisor, you can follow these setup approaches:

Use a Harbor registry only in the Supervisor in the GPU-enabled workload domain. Perform the following tasks:
1. Enable Harbor as a Supervisor Service.
2. Upload AI Container Images to a Private Harbor Registry in VMware Private AI Foundation with NVIDIA
You can disconnect your environment from the Internet and start using the Harbor service as a local container registry after you install the service or after you install it and download the initial set of required container images.
In this approach, you must manually download container images from the NVIDIA NGC catalog to a machine in the environment and then upload them to the registry.
Use a Harbor registry that is as a replica of an Internet-connected Harbor registry.
One Harbor registry, running outside the VMware Private AI Foundation with NVIDIA environment, is always connected to the Internet. The Harbor registry in the Supervisor for the GPU-enabled workload domain receives container images from the connected one using a proxy mechanism. In this way, the main components of the VMware Cloud Foundation instance remain isolated.
In this approach, additional resources are required for the connected registry.

Note: Allocate enough storage space for hosting the NVIDIA NGC containers you plan to deploy on a deep learning VM or on a TKG cluster. Accommodate at least three versions of each container in the storage space.

If connecting to the Internet while installing the Harbor service or setting up a connected Harbor registry is not an option for your organization, use a container registry by another vendor.

Upload AI Container Images to a Private Harbor Registry in VMware Private AI Foundation with NVIDIA

In a disconnected environment where you use a Harbor registry only on the AI-ready Supervisor, you must manually upload the AI container images that you intend to deploy on a deep learning VM or a TKG cluster from the NVIDIA NGC catalog to Harbor.

Procedure

On the machines for access to NVIDIA NGC and to the disconnected VMware Cloud Foundation instance, configure the Docker client with the certificate of the Harbor Registry.
See Configure a Docker Client with a Registry Certificate.
Log in to NVIDA NGC.
Use the reserved user name of $oauthtoken and paste the API key in the password field.
```
docker login nvcr.io
```
Pull the required container images to the machine with access to NVIDIA NGC catalog and save them to an archive.
For example, to download the CUDA Sample container, run the following commands.
```
docker pull nvcr.io/nvidia/k8s/cuda-sample:vectoradd-cuda11.7.1-ubi8
docker save > cuda-sample.tar nvcr.io/nvidia/k8s/cuda-sample:vectoradd-cuda11.7.1-ubi8
```
Copy the archive to the machine with access to the local container registry.
On the machine with access to the local container registry, load the container image.
```
docker load < cuda-sample.tar
```
Push the container images to the Harbor Registry.
See Push Container Images to a Registry.

Create a Harbor Registry in VMware Private AI Foundation with NVIDIA as a Replica of a Connected Registry

To be able to update easily to the latest images in the NVIDIA NGC catalog, you can use a Harbor registry in a Supervisor that is in another VI workload domain or VMware Cloud Foundation instance and can be connected to Internet. You then replicate this connected registry on the Supervisor where you plan to run AI workloads.

You pull the latest container images from NVIDIA NGC to the connected Harbor registry and transfer them to the disconnected one by using a proxy-cached connection. In this way, you do not have to download container images and then upload them manually on a frequent basis.

Note: You can also use a connected container registry by another vendor.

You set up the network between the two registries in the following way:

The connected registry is routable to the replica registry.
The connected registry is placed in a DMZ where only docker push and docker pull communication is allowed between the two registries.

Prerequisites

Enable Harbor as a Supervisor Service in the Supervisor in the GPU-enabled workload domain.

Procedure

Log in to the connected Harbor Registry UI as a Harbor system administrator.
Go to the Administration > Registries page and create an endpoint for the NVIDIA NGC catalog nvcr.io/nvaie selecting the Docker Registry provider and with your NVIDIA NGC API key.
Go to the Administration > Projects page and create a proxy-cache project, connected to the endpoint for nvcr.io/nvaie.
Back on the Registries page, create a replication endpoint for the disconnected registry, selecting the Harbor provider.
Go to the Administration > Replications page and create a replication rule.
- Use push-based replication mode.
- In the Destination registry property, enter the URL of the disconnected registry on the AI-ready Supervisor.
- Set filters, target namespace and trigger mode according to the requirements of your organization.

What to do next

Pull the container images that are required by your organization from NVIDIA NGC to the connected registry by running docker pull on the Docker client machine.
If the replication rule has manual trigger mode, run manually replications as needed.