pgvector

A machine language-generated embedding is a complex object transformed into a list of numbers (vector) that reflects both the semantic and syntactic relationships of the data. The pgvector module provides vector similarity search capabilities for Greenplum Database that enable you to search, store, and query embeddings at large scale.

The Greenplum Database pgvector module is equivalent to version 0.7.0 of the pgvector module used with PostgreSQL. The limitations of the Greenplum version of the module are described in the Greenplum Database Limitations topic.

Installing and Registering the Module

The pgvector module is installed when you install Greenplum Database. Before you can use the data type and index access method defined in the module, you must register the vector extension in each database in which you want to use these:

CREATE EXTENSION vector;

Refer to Installing Additional Supplied Modules for more information.

Upgrading the Module

The pgvector module is installed when you install or upgrade Greenplum Database. A previous version of the extension will continue to work in existing databases after you upgrade Greenplum. To upgrade to the most recent version of the extension, you must:

ALTER EXTENSION vector UPDATE TO '0.7.0';

in every database in which you registered/use the extension.

About the vector Types, Operators, and Functions

pgvector provides a vector data type and the index access methods ivfflat and hnsw. The type, methods, and the supporting functions and operators provided by the module enable you to perform exact and approximate neighbor search on, and determine L2, inner product, and cosine distance between, embeddings. You can also use the module to store and query embeddings.

vector Data Type

The vector data type represents an n-dimensional coordinate. Each vector takes 4 * dimensions + 8 bytes of storage. Each element is a single precision floating-point number (similar to the real type in Greenplum Database), and all of the elements must be finite (no NaN, Infinity, or -Infinity). Vectors can have up to 16,000 dimensions.

vector Operators

pgvector provides the following operators for the vector data type:

Operator	Description
+	Element-wise addition
-	Element-wise subtraction
*	Element-wise multiplication
\|\|	concatenate
<–>	Euclidean distance
<#>¹	Negative inner product
<=>	Cosine distance
<+>	taxicab distance

¹ Because Greenplum Database supports only ASC order index scans on operators, <#> returns the negative inner product.

vector Functions

pgvector provides the following functions for the vector data type:

Function Name	Description
binary_quantize(vector) → bit	binary quantize
cosine_distance(vector, vector) → double precision	Computes the cosine distance
inner_product(vector, vector) → double precision	Computes the inner product
l2_distance(vector, vector) → double precision	Computes the Euclidean distance
l1_distance(vector, vector) → double precision	Computes the taxicab distance
l2_normalize(vector) → vector	Normalize with Euclidean norm
subvector(vector, integer, integer) → vector	subvector
vector_dims(vector) → integer	Returns the number of dimensions
vector_norm(vector) → double precision	Computes the Euclidean norm

vector Aggregate Functions

pgvector provides the following aggregate functions for the vector data type:

Function	Description
avg(vector) → vector	Computes the arithmetic mean
sum(vector) → vector	Computes the sum of the vector elements

About the half vector Types, Operators, and Functions

pgvector provides a halfvec data type and the index access methods ivfflat and hnsw. The type, methods, and the supporting functions and operators provided by the module enable you to perform exact and approximate neighbor search on, and determine L2, inner product, and cosine distance between, embeddings. You can also use the module to store and query embeddings.

half vector Data Type

The halfvec data type represents an n-dimensional coordinate. Each half vector takes 2 * dimensions + 8 bytes of storage. Each element is a half-precision floating-point number, and all elements must be finite (no NaN, Infinity or -Infinity). Half vectors can have up to 16,000 dimensions.

half vector Operators

pgvector provides the following operators for the halfvec data type:

Operator	Description
+	element-wise addition
-	element-wise subtraction
*	element-wise multiplication
\|\|	concatenate
↔	Euclidean distance
<#>	negative inner product
⇔	cosine distance
<+>	taxicab distance

¹ Because Greenplum Database supports only ASC order index scans on operators, <#> returns the negative inner product.

half vector Functions

pgvector provides the following functions for the halfvec data type:

Function Name	Description
binary_quantize(halfvec) → bit	binary quantize
cosine_distance(halfvec, halfvec) → double precision	cosine distance
inner_product(halfvec, halfvec) → double precision	inner product
l1_distance(halfvec, halfvec) → double precision	taxicab distance
l2_distance(halfvec, halfvec) → double precision	Euclidean distance
l2_norm(halfvec) → double precision	Euclidean norm
l2_normalize(halfvec) → halfvec	Normalize with Euclidean norm
subvector(halfvec, integer, integer) → halfvec	subvector
vector_dims(halfvec) → integer	number of dimensions

half vector Aggregate Functions

pgvector provides the following aggregate functions for the halfvec data type:

Function	Description
avg(halfvec) → halfvec	average
sum(halfvec) → halfvec	sum

About the binary vector Types, Operators, and Functions

pgvector provides a binary vector data type and the index access methods ivfflat and hnsw. The type, methods, and the supporting functions and operators provided by the module enable you to perform exact and approximate neighbor search on, and determine Hamming and Jaccard distance between, embeddings. You can also use the module to store and query embeddings.

binary vector Data Type

The binary vector only stores 0 or 1 in each dimension. You can use binary_quantize convert a vector to the binary vector. Each binary vector takes dimensions / 8 + 8 bytes of storage. See the Postgres docs for more info.

binary vector Operators

pgvector provides the following operators for the binary vector data type:

Operator	Description
<~>	Hamming distance
<%>	Jaccard distance

binary vector Functions

pgvector provides the following functions for the binary vector data type:

Function	Description
hamming_distance(bit, bit) → double precision	Hamming distance
jaccard_distance(bit, bit) → double precision	Jaccard distance

Using the pgvector Module

You can use pgvector to search, store, and query embeddings in Greenplum Database.

Examples: Storing Embeddings in Greenplum Database

In the following examples, you manipulate a vector column of a table.

Create a new table with a vector column with 3 dimensions:

CREATE TABLE items (id bigserial PRIMARY KEY, embedding vector(3));

Or add a vector column to an existing table:

ALTER TABLE items ADD COLUMN embedding vector(3);

Insert vectors into the table:

INSERT INTO items (embedding) VALUES ('[1,2,3]'), ('[4,5,6]');

Upsert vectors:

INSERT INTO items (id, embedding) VALUES (1, '[1,2,3]'), (2, '[4,5,6]')
    ON CONFLICT (id) DO UPDATE SET embedding = EXCLUDED.embedding;

Update vectors:

UPDATE items SET embedding = '[1,2,3]' WHERE id = 1;

Delete vectors:

DELETE FROM items WHERE id = 1;

Examples: Storing Embeddings with halfvec in Greenplum Database

In the following examples, you manipulate a halfvec column of a table.

Create a new table with a halfvec column with 3 dimensions:

CREATE TABLE items (id bigserial PRIMARY KEY, embedding halfvec(3));

Or add a halfvec column to an existing table:

ALTER TABLE items ADD COLUMN embedding halfvec(3);

Insert halfvecs into the table:

INSERT INTO items (embedding) VALUES ('[1,2,3]'), ('[4,5,6]');

Upsert halfvecs:

INSERT INTO items (id, embedding) VALUES (1, '[1,2,3]'), (2, '[4,5,6]')
    ON CONFLICT (id) DO UPDATE SET embedding = EXCLUDED.embedding;

Update halfvecs:

UPDATE items SET embedding = '[1,2,3]' WHERE id = 1;

Delete halfvecs:

DELETE FROM items WHERE id = 1;

Examples: Storing Embeddings with binary vector in Greenplum Database

Use the bit type to store binary vectors (example)

CREATE TABLE items (id bigserial PRIMARY KEY, embedding bit(3));
INSERT INTO items (embedding) VALUES ('000'), ('111');

Get the nearest neighbors by Hamming distance:

SELECT * FROM items ORDER BY embedding <~> '101' LIMIT 5;

Also supports Jaccard distance (<%>)

SELECT * FROM items ORDER BY embedding <%> '101' LIMIT 5;

Examples: Querying Embeddings in Greenplum Database

You can query embeddings as follows.

Get the nearest neighbors to a vector or halfvec by L2 distance:

SELECT * FROM items ORDER BY embedding <-> '[3,1,2]' LIMIT 5;

Get the nearest neighbors to a row:

SELECT * FROM items WHERE id != 1 
  ORDER BY embedding <-> (SELECT embedding FROM items WHERE id = 1) LIMIT 5;

Get rows within a certain distance:

SELECT * FROM items WHERE embedding <-> '[3,1,2]' < 5;

Combine with the ORDER BY and LIMIT clauses to use an index.

Evaluating Embedding Distance

The following examples use the available vector distance operators.

Get the distance:

SELECT embedding <-> '[3,1,2]' AS distance FROM items;

When you request the inner product, remember to multiply by -1:

SELECT (embedding <#> '[3,1,2]') * -1 AS inner_product FROM items;

For cosine similarity, use 1 - <cosine_distance>:

SELECT 1 - (embedding <=> '[3,1,2]') AS cosine_similarity FROM items;

Aggregating Embeddings

The following examples display various forms of aggregating embeddings.

Average the vectors in a table:

SELECT AVG(embedding) FROM items;

Average a group of vectors in a table:

SELECT category_id, AVG(embedding) FROM items GROUP BY category_id;

About Indexing Embeddings

By default, pgvector performs exact nearest neighbor search, which provides perfect recall. You can add an index to use approximate nearest neighbor search, trading some recall for performance.

Note
Unlike a typical index, a query returns different results after adding an approximate index.

When you create an index for an embedding, you use the lists parameter to specify the number of clusters created during index creation. Each cluster is a partition of the data set.

To achieve good recall, keep the following in mind:

Create the index after the table has some data.
Choose an appropriate number of lists. A reasonable initial value is rows / 1000 for up to 1M rows and sqrt(rows) for over 1M rows.
When querying, specify an appropriate number of probes (higher is better for recall, lower is better for speed). A reasonable initial value is sqrt(lists).

Dimension limitations for index types:

vector: supports up to 2,000 dimensions.
halfvec: supports up to 4,000 dimensions.
binary vector: supports up to 64,000 dimensions.

The following examples show how to add an index for various distance methods.

IVFFlat Examples

Create an index on the L2 distance:

CREATE INDEX ON items USING ivfflat (embedding vector_l2_ops) WITH (lists = 100);

Create an index on the inner product:

CREATE INDEX ON items USING ivfflat (embedding vector_ip_ops) WITH (lists = 100);

Create an index on the cosine distance:

CREATE INDEX ON items USING ivfflat (embedding vector_cosine_ops) WITH (lists = 100);

Create an index on the L2 distance with halfvec:

CREATE INDEX ON items USING ivfflat ((embedding::halfvec(3)) halfvec_l2_ops);

Create an index on the inner product with halfvec:

CREATE INDEX ON items USING ivfflat ((embedding::halfvec(3)) halfvec_ip_ops) WITH (lists = 100);

Create an index on the cosine distance with halfvec:

CREATE INDEX ON items USING ivfflat ((embedding::halfvec(3)) halfvec_cosine_ops) WITH (lists = 100);

Create an index on the Hamming distance with binary vector:

CREATE INDEX ON items USING ivfflat ((binary_quantize(embedding)::bit(3)) bit_hamming_ops) WITH (lists = 100);

Query Options

pgvector provides a probes parameter that you can set at query time to specify the number of regions to search during a query.

Specify the number of probes (1 by default):

SET ivfflat.probes = 10;

A higher probes value provides better recall at the cost of speed. You can set it to the number of lists for exact nearest neighbor search (at which point the planner will not use the index).

Use SET LOCAL inside a transaction block to set probes for a single query:

BEGIN;
SET LOCAL ivfflat.probes = 10;
SELECT ...
COMMIT;

HNSW Examples

Create an index on the L2 distance:

CREATE INDEX ON items USING hnsw (embedding vector_l2_ops);

Create an index on the inner product:

CREATE INDEX ON items USING hnsw (embedding vector_ip_ops);

Create an index on the cosine distance:

CREATE INDEX ON items USING hnsw (embedding vector_cosine_ops);

Create an index on the L1 distance:

CREATE INDEX ON items USING hnsw (embedding vector_l1_ops);

Create an index on the L2 distance with halfvec:

CREATE INDEX ON items USING hnsw ((embedding::halfvec(3)) halfvec_l2_ops);

Create an index on the inner product with halfvec:

CREATE INDEX ON items USING hnsw ((embedding::halfvec(3)) halfvec_ip_ops);

Create an index on the cosine distance with halfvec:

CREATE INDEX ON items USING hnsw ((embedding::halfvec(3)) halfvec_cosine_ops);

Create an index on the L1 distance with halfvec:

CREATE INDEX ON items USING hnsw ((embedding::halfvec(3)) halfvec_l1_ops);

Create an index on the Hamming distance with binary vector:

CREATE INDEX ON items USING hnsw ((binary_quantize(embedding)::bit(3)) bit_hamming_ops);

Create an index on the Jaccard distance with binary vector:

CREATE INDEX ON items USING hnsw ((binary_quantize(embedding)::bit(3)) bit_jaccard_ops);

HNSW indexes support the following parameters:

m specifies the maximum number of connections per layer (16 by default)
ef_construction specifies the size of the dynamic candidate list for constructing the graph (64 by default)

For example:

CREATE INDEX ON items USING hnsw (embedding vector_l2_ops) WITH (m = 16, ef_construction = 64);

Query Options

Specify a csutom size for the dynamic candidate list for a search:

SET hnsw.ef_search = 100;

Higher value provides better recall at the cost of speed. The default size of the candidate list is 40.

This example sets the candidate size in a transaction for a single query:

BEGIN;
SET LOCAL hnsw.ef_search = 100;
SELECT ...
COMMIT;

Indexing Progress

You can check index creation progress in Greenplum Database as described in CREATE INDEX Progress Reporting.

SELECT phase, tuples_done, tuples_total FROM gp_stat_progress_create_index;

Filtering

There are multiple ways to index nearest neighbor queries with a WHERE clause:

SELECT * FROM items WHERE category_id = 123 ORDER BY embedding <-> '[3,1,2]' LIMIT 5;

For exact search, create an index on one or more of the WHERE columns:

CREATE INDEX ON items (category_id);

For approximate search, create a partial index on the vector column:

CREATE INDEX ON items USING ivfflat (embedding vector_l2_ops) WITH (lists = 100)
    WHERE (category_id = 123);

Use partitioning for approximate search on many different values of the WHERE columns:

CREATE TABLE items (embedding vector(3), category_id int) PARTITION BY LIST(category_id);

About Hybrid Search

You can use pgvector together with full-text search for a hybrid search:

SELECT id, content FROM items, to_tsquery('hello & search') query
    WHERE textsearch @@ query ORDER BY ts_rank_cd(textsearch, query) DESC LIMIT 5;

About Debugging and Maximizing Performance

Use EXPLAIN ANALYZE to debug performance:

EXPLAIN ANALYZE SELECT * FROM items ORDER BY embedding <-> '[3,1,2]' LIMIT 5;

Exact Search

To speed up queries without an index, increase the max_parallel_workers_per_gather server configuration parameter:

SET max_parallel_workers_per_gather = 4;

If vectors are normalized to length 1, use inner product for the best performance:

SELECT * FROM items ORDER BY embedding <#> '[3,1,2]' LIMIT 5;

Approximate Search

To speed up queries with an index, increase the number of inverted lists (at the expense of recall):

CREATE INDEX ON items USING ivfflat (embedding vector_l2_ops) WITH (lists = 1000);

Greenplum Database Limitations

pgvector for Greenplum Database has the following limitations:

The Greenplum query optimizer (GPORCA) does not support ivfflat and hnsw vector indexes. Queries on tables that utilize these index types fall back to the Postgres-based planner.
Append-optimized tables cannot use vector indexes.
The size of (a vector) index can be larger than the table size.
Parallel index creation is temporarily unsupported on Greenplum.

Additional References

The following examples use pgvector and the VMware Greenplum documentation to build an AI assistant for the product documentation: