You can manually install the NVIDIA software for your vSphere Bitfusion deployment. Follow this procedure if you chose not to download and install the NVIDIA driver, CUDA library, and NVIDIA Fabric Manager during the initial boot of the vSphere Bitfusion server virtual machine (VM) and your vSphere Bitfusion has access to the Internet.

You can skip this procedure if you chose to download and install the NVIDIA software during the initial boot of the vSphere Bitfusion server VM.

Prerequisites

  • The use of the NVIDIA driver implies acceptance of the NVIDIA Software License Agreement. See License For Customer Use of NVIDIA Software.
  • The NVIDIA driver certified for use with vSphere Bitfusion is NVIDIA-Linux-x86_64-460.32.03.run.
  • The CUDA library that is necessary for NCCL operations and certified for use with vSphere Bitfusion is cuda_11.2.0_460.27.04_linux.run.
  • The NVIDIA Fabric Manager package certified for use with vSphere Bitfusion is nvidia-fabricmanager-460-460.32.03-1.x86_64.rpm.

Procedure

  1. Log in to the appliance shell of the vSphere Bitfusion server VM.
    ssh customer@bitfusion_server_IP_address
  2. To install the NVIDIA driver, CUDA library, and NVIDIA Fabric Manager, run the sudo install-nvidia-packages --defaults --yes command.
  3. Restart the VM.

Results

As the vSphere Bitfusion server VM powers on, allow the VM to run for 10 minutes or longer before performing any further configuration tasks or operations. During this time, the vSphere Bitfusion server registers with vCenter Server.

What to do next

Verify That the vSphere Bitfusion Plug-In Registers with vCenter Server