If you plan to use Retrieval-Augmented Generation (RAG) with VMware Private AI Foundation with NVIDIA, set up a PostgreSQL database with pgvector by using VMware Data Services Manager.
You can create the database manually or create a self-service catalog in VMware Aria Automation that can be used by DevOps engineers and developers.
Prerequisites
- Verify that VMware Private AI Foundation with NVIDIA is available for the VI workload domain. See Deploying VMware Private AI Foundation with NVIDIA.
- Install the psql command line utility from the PostgreSQL Web site.
Procedure
Deploying a Vector Database by Using a Self-Service Catalog Item in VMware Aria Automation
Data scientists and DevOps engineers can use VMware Aria Automation to provision a PostgreSQL database with pgvector extension in the VI workload domain dedicated for your AI workloads.
Create a Vector Database Catalog Item in VMware Aria Automation
As a cloud administrator, add a catalog item for provisioning databases in VMware Data Services Manager to Service Broker in VMware Aria Automation.
Prerequisites
- Verify that you have VMware Data Services Manager 2.0.2 deployed.
- Provide a machine that has Python 3.10 installed and has access to the VMware Data Services Manager and VMware Aria Automation instances.
Procedure
Results
Deploy a Vector Database by Using a Self-Service Catalog Item in VMware Aria Automation
In VMware Private AI Foundation with NVIDIA, as data scientist or a DevOps engineer, you can deploy a vector database from VMware Aria Automation by using a self-service catalog item in Automation Service Broker.
Prerequisites
Verify with your cloud administrator that the prerequisites for creating a PostgreSQL database are in place. See Creating Databases.