Vectorize (Vector Database)

Learn what vector databases are on Cloudflare's documentation

Vectorize is only available in local development when using remote storage.

Getting Started

Vectorize indexes are managed in your NuxtHub project within the hub.vectorize object in your nuxt.config.ts file. Multiple indexes can be created using separate keys.

nuxt.config.ts

export default defineNuxtConfig({
  hub: {
    vectorize: {
      <index-name>: {
        dimensions: <number>,
        metric: "dot-product" | "cosine" | "euclidean",
        metadataIndexes: {
          <property>: "string" | "number" | "boolean"
        }
      }
    }
  }
})

Then, make sure to deploy your project to create the indexes.

Once deployed, you can use the hubVectorize() server composable to interact with the vector databases using remote storage with the npx nuxt dev --remote command.

Cloudflare Vectorize index configurations are immutable after creation. Modifying an index's dimension size or distance metric will disconnect the modified existing index from your deployed project and create a new index. Existing data will not be migrated. Consider your index configuration carefully before creation to avoid data loss and reconnection issues.

Create an index

Creating an index requires three inputs:

A name, for example prod-search-index or recommendations-idx-dev.
The (fixed) dimension size of each vector, for example 384 or 1536.
The (fixed) distance metric to use for calculating vector similarity.

Index names can only contain lowercase characters, hyphens (-), and are limited to 51 characters.

Cloudflare Vectorize indexes will be created for your project when you deploy it. Created Vectorize indexes contain a unique 4 character suffix.

Use existing indexes

On the Cloudflare dashboard → Workers & Pages → Your Pages project
Go to Settings → Bindings → Add
Select Vectorize database
- Set the variable name to VECTORIZE_<NAME>. The entire variable name should be capitalised.
- Select the existing Vectorize index
Add the index configuration to hub.vectorize in nuxt.config.ts. The name should match <name> used in the variable name, and be lowercase.

Creating metadata indexes

Vectorize allows you to add up to 10KiB of metadata per vector into your index, and also provides the ability to filter on that metadata while querying vectors. To do so you would need to specify a metadata field as a "metadata index" for your Vectorize index.

Learn more about creating metadata indexes.

`hubVectorize()`

Server composable that returns a Vectorize index.

const index = hubVectorize("<index>")

IntelliSense will suggest <index> based on the indexes configured in hub.vectorize.

`insert()`

Inserts vectors into the index.

const vectorsToInsert = [
  { id: "123", values: [32.4, 6.5, 11.2, 10.3, 87.9] },
  { id: "456", values: [2.5, 7.8, 9.1, 76.9, 8.5] },
];
const inserted = await index.insert(vectorsToInsert);

See all available properties on the Vector object.

If vectors with the same vector ID already exist in the index, only the vectors with new IDs will be inserted.

Vectorize inserts are asynchronous and the insert operation returns a mutation identifier unique for that operation. It typically takes a few seconds for inserted vectors to be available for querying in an index.

If you need to update existing vectors, use the upsert operation.

Insert vs Upsert

If the same vector id is inserted twice in a Vectorize index, the index would reflect the vector that was added first.
If the same vector id is upserted twice in a Vectorize index, the index would reflect the vector that was added second.
Use the upsert operation if you want to overwrite the vector value for a vector id that already exists in an index.

Return

Returns VectorizeAsyncMutation.

`upsert()`

Upserts vectors into an index.

const vectorsToUpsert = [
  { id: "123", values: [32.4, 6.5, 11.2, 10.3, 87.9] },
  { id: "456", values: [2.5, 7.8, 9.1, 76.9, 8.5] },
  { id: "768", values: [29.1, 5.7, 12.9, 15.4, 1.1] },
];
const upserted = await index.upsert(vectorsToUpsert);

An upsert operation will insert vectors into the index if vectors with the same ID do not exist, and overwrite vectors with the same ID.

See all available properties on the Vector object.

Upserting does not merge or combine the values or metadata of an existing vector with the upserted vector: the upserted vector replaces the existing vector in full.

Vectorize upserts are asynchronous and the upsert operation returns a mutation identifier unique for that operation. It typically takes a few seconds for upserted vectors to be available for querying in an index.

Return

Returns VectorizeAsyncMutation.

`query()`

Query an index with the provided vector, returning the score(s) of the closest vectors based on the configured distance metric.

const queryVector = [32.4, 6.55, 11.2, 10.3, 87.9];
const matches = await index.query(queryVector);

console.log(matches)
/*
{
    "count": 5,
    "matches": [
        { "score": 0.999909486, "id": "5" },
        { "score": 0.789848214, "id": "4" },
        { "score": 0.720476967, "id": "1234" },
        { "score": 0.463884663, "id": "6" },
        { "score": 0.378282232, "id": "1" }
    ]
}
*/

Querying an index, or vector search, enables you to search an index by providing an input vector and returning the nearest vectors based on the configured distance metric.

Optionally, you can apply metadata filters or a namespace to narrow the vector search space.

Params

vectorrequired

array

Input vector that will be used to drive the similarity search.

options

object

Query options.

const matches = await index.query(queryVector, {
  topK: 5,
  returnValues: true,
  returnMetadata: "all",
});

Return

Returns VectorizeMatches.

Control over scoring precision and query accuracy

When querying vectors, you can specify to either use high-precision scoring, thereby increasing the precision of the query matches scores as well as the accuracy of the query results, or use approximate scoring for faster response times. Using approximate scoring, returned scores will be an approximation of the real distance/similarity between your query and the returned vectors.

High-precision scoring is enabled by setting returnValues: true on your query; this tells Vectorize to fetch and use the original vector values for your matches, which enables the computation of exact scores of matches, increasing the accuracy of the results.

`getByIds()`

Retrieves the specified vectors by their ID, including values and metadata.

const ids = ["11", "22", "33", "44"];
const vectors = await index.getByIds(ids);

Params

idsrequired

string[]

List of vector ids that should be returned.

Return

Returns VectorizeVector.

`deleteByIds()`

Deletes the vector IDs provided from the current index.

const idsToDelete = ["11", "22", "33", "44"];
const deleted = await index.deleteByIds(idsToDelete);

Vectorize deletes are asynchronous and the delete operation returns a mutation identifier unique for that operation. It typically takes a few seconds for vectors to be removed from the Vectorize index.

Params

idsrequired

string[]

List of vector ids that should be deleted.

Return

Returns VectorizeAsyncMutation.

`describe()`

Retrieves the configuration of a given index directly, including its configured dimensions and distance metric.

const details = await index.describe();

Return

Returns VectorizeIndexDetails.

Vectors

Vector Object

A vector represents the vector embedding output from a machine learning model.

idrequired

string

A unique string identifying the vector in the index. This should map back to the ID of the document, object or database identifier that the vector values were generated from.

namespace

object

An optional partition key within a index. Operations are performed per-namespace, so this can be used to create isolated segments within a larger index.

valuesrequired

number[] | Float32Array | Float64Array

An array of number, Float32Array, or Float64Array as the vector embedding itself. This must be a dense array, and the length of this array must match the dimensions configured on the index.

metadata

Record<string, 'string' | 'number' | 'boolean'>

An optional set of key-value pairs that can be used to store additional metadata alongside a vector.

const vectorExample = {
    id: "12345",
    values: [32.4, 6.55, 11.2, 10.3, 87.9],
    metadata: {
        key: "value",
        hello: "world",
        url: "r2://bucket/some/object.json",
    },
};

Dimensions

Dimensions are determined from the output size of the machine learning (ML) model used to generate them, and are a function of how the model encodes and describes features into a vector embedding.

The number of output dimensions can determine vector search accuracy, search performance (latency), and the overall size of the index. Smaller output dimensions can be faster to search across, which can be useful for user-facing applications. Larger output dimensions can provide more accurate search, especially over larger datasets and/or datasets with substantially similar inputs.

The number of dimensions an index is created for cannot change. Indexes expect to receive dense vectors with the same number of dimensions.

The following table highlights some example embeddings models and their output dimensions:

Model / Embeddings API	Output dimensions	Use-case
Workers AI - `@cf/baai/bge-base-en-v1.5`	768	Text
OpenAI - `ada-002`	1536	Text
Cohere - `embed-multilingual-v2.0`	768	Text
Google Cloud - `multimodalembedding`	1408	Multi-modal (text, images)

Refer to the Workers AI documentation to learn about its built-in embedding models.

Distance metrics

Distance metrics are functions that determine how close vectors are from each other. Vectorize indexes support the following distance metrics:

Metric	Details
`cosine`	Distance is measured between `-1` (most dissimilar) to `1` (identical). `0` denotes an orthogonal vector.
`euclidean`	Euclidean (L2) distance. `0` denotes identical vectors. The larger the positive number, the further the vectors are apart.
`dot-product`	Negative dot product. Larger negative values or smaller positive values denote more similar vectors. A score of `-1000` is more similar than `-500`, and a score of `15` more similar than `50`.

Determining the similarity between vectors can be subjective based on how the machine-learning model that represents features in the resulting vector embeddings. For example, a score of 0.8511 when using a cosine metric means that two vectors are close in distance, but whether data they represent is similar is a function of how well the model is able to represent the original content.

When querying vectors, you can specify Vectorize to use either:

High-precision scoring, which increases the precision of the query matches scores as well as the accuracy of the query results.
Approximate scoring for faster response times. Using approximate scoring, returned scores will be an approximation of the real distance/similarity between your query and the returned vectors. Refer to Control over scoring precision and query accuracy.

Distance metrics cannot be changed after index creation, and that each metric has a different scoring function.

Supported vector formats

Vectorize supports vectors in three formats:

An array of floating point numbers (converted into a JavaScript number[] array).
A Float32Array
A Float64Array

In most cases, a number[] array is the easiest when dealing with other APIs, and is the return type of most machine-learning APIs.

Metadata & filtering

Metadata is an optional set of key-value pairs that can be attached to a vector on insert or upsert, and allows you to embed or co-locate data about the vector itself.

Metadata keys cannot be empty, contain the dot character (.), contain the double-quote character ("), or start with the dollar character ($).

Metadata can be used to:

Include the object storage key, database UUID or other identifier to look up the content the vector embedding represents.
The raw content (up to the metadata limits), which can allow you to skip additional lookups for smaller content.
Dates, timestamps, or other metadata that describes when the vector embedding was generated or how it was generated.

For example, a vector embedding representing an image could include the path to the blob it was generated from, the format, and a category lookup:

{ id: '1', values: [32.4, 74.1, 3.2, ...], metadata: { path: 'r2://bucket-name/path/to/image.png', format: 'png', category: 'profile_image' } }

Metadata filtering

In addition to providing an input vector to your query, you can also filter by vector metadata associated with every vector. Query results only include vectors that match filter criteria, meaning that filter is applied first, and topK results are taken from the filtered set.

By using metadata filtering to limit the scope of a query, you can filter by specific customer IDs, tenant, product category or any other metadata you associate with your vectors.

Vectorize requires metadata indexes to be specified before vectors are inserted to support metadata filtering. string, number and boolean metadata indexes are supported. Please refer to Create metadata indexes for details.

Vectorize supports namespace filtering by default.

Create metadata indexes

Metadata indexes are all managed within the metadataIndexes object within your index configuration in nuxt.config.ts.

nuxt.config.ts

export default defineNuxtConfig({
  hub: {
    vectorize: {
      tutorial: {
        dimensions: 32,
        metric: "cosine",
        metadataIndexes: {
          // <property>: "string" | "number" | "boolean"
          url: "string",
          "nested.property": "boolean"
        }
      }
    }
  }
})

Supported metadata index property types are string, number and boolean types.

Define nested properties using . (dot).

Vectorize currently supports a maximum of 10 metadata indexes per Vectorize index. Learn more at https://developers.cloudflare.com/vectorize/platform/limits/.

For metadata indexes of type number, the indexed number precision is that of float64.

For metadata indexes of type string, each vector indexes the first 64B of the string data truncated on UTF-8 character boundaries to the longest well-formed UTF-8 substring within that limit, so vectors are filterable on the first 64B of their value for each indexed property.

As of today, the metadata fields on which vectors can be filtered need to be specified before the vectors are inserted, and it is recommended that these metadata fields are specified right after the creation of a Vectorize index.

Supported operations

Optional filter property on query() method specifies metadata filter:

Operator	Description
`$eq`	Equals
`$ne`	Not equals

filter must be non-empty object whose compact JSON representation must be less than 2048 bytes.
filter object keys cannot be empty, contain " | . (dot is reserved for nesting), start with $, or be longer than 512 characters.
filter object non-nested values can be string, number, boolean, or null values.

Valid `filter` examples

Implicit `$eq` operator

{ "streaming_platform": "netflix" }

Explicit operator

{ "someKey": { "$ne": true } }

Implicit logical `AND` with multiple keys

{ "pandas.nice": 42, "someKey": { "$ne": true } }

Keys define nesting with `.` (dot)

{ "pandas.nice": 42 }

// looks for { "pandas": { "nice": 42 } }

Limits

You can store up to 10KiB of metadata per vector, and create up to 10 metadata indexes per Vectorize index.

For metadata indexes of type number, the indexed number precision is that of float64.

For metadata indexes of type string, each vector indexes the first 64B of the string data truncated on UTF-8 character boundaries to the longest well-formed UTF-8 substring within that limit, so vectors are filterable on the first 64B of their value for each indexed property.

See Vectorize Limits for a complete list of limits.

Examples

Add metadata

With the following index definition:

nuxt.config.ts

export default defineNuxtConfig({
  hub: {
    vectorize: {
      tutorial: {
        dimensions: 32,
        metric: "cosine",
        metadataIndexes: {
          streaming_platform: "string",
          "property.nested": "boolean"
        }
      }
    }
  }
})

Metadata can be added when inserting or upserting vectors.

const index = hubVectorize("tutorial")

const newMetadataVectors: Array<VectorizeVector> = [
    {
        id: "1",
        values: [32.4, 74.1, 3.2, ...],
        metadata: { url: "/products/sku/13913913", streaming_platform: "netflix" },
    },
    {
        id: "2",
        values: [15.1, 19.2, 15.8, ...],
        metadata: { url: "/products/sku/10148191", streaming_platform: "hbo" },
    },
    {
        id: "3",
        values: [0.16, 1.2, 3.8, ...],
        metadata: { url: "/products/sku/97913813", streaming_platform: "amazon" },
    },
    {
        id: "4",
        values: [75.1, 67.1, 29.9, ...],
        metadata: { url: "/products/sku/418313", streaming_platform: "netflix" },
    },
    {
        id: "5",
        values: [58.8, 6.7, 3.4, ...],
        metadata: { url: "/products/sku/55519183", streaming_platform: "hbo" },
    },
];

// Upsert vectors with added metadata, returning a count of the vectors upserted and their vector IDs
const upserted = await index.upsert(newMetadataVectors);

Query examples

Use the query() method:

const queryVector: Array<number> = [54.8, 5.5, 3.1, ...];
const originalMatches = await index.query(queryVector, {
    topK: 3,
    returnValues: true,
    returnMetadata: 'all',
});

Results without metadata filtering:

{
    "matches": [
        {
            "id": "5",
            "score": 0.999909486,
            "values": [58.79999923706055, 6.699999809265137, 3.4000000953674316],
            "metadata": {
                "url": "/products/sku/55519183",
                "streaming_platform": "hbo"
            }
        },
        {
            "id": "4",
            "score": 0.789848214,
            "values": [75.0999984741211, 67.0999984741211, 29.899999618530273],
            "metadata": {
                "url": "/products/sku/418313",
                "streaming_platform": "netflix"
            }
        },
        {
            "id": "2",
            "score": 0.611976262,
            "values": [15.100000381469727, 19.200000762939453, 15.800000190734863],
            "metadata": {
                "url": "/products/sku/10148191",
                "streaming_platform": "hbo"
            }
        }
    ]
}

The same query() method with a filter property supports metadata filtering.

const queryVector: Array<number> = [54.8, 5.5, 3.1, ...];
const metadataMatches = await index.query(queryVector, {
    topK: 3,
    filter: { streaming_platform: "netflix" },
    returnValues: true,
    returnMetadata: 'all',
});

Results with metadata filtering:

{
    "matches": [
        {
            "id": "4",
            "score": 0.789848214,
            "values": [75.0999984741211, 67.0999984741211, 29.899999618530273],
            "metadata": {
                "url": "/products/sku/418313",
                "streaming_platform": "netflix"
            }
        },
        {
            "id": "1",
            "score": 0.491185264,
            "values": [32.400001525878906, 74.0999984741211, 3.200000047683716],
            "metadata": {
                "url": "/products/sku/13913913",
                "streaming_platform": "netflix"
            }
        }
    ]
}

Namespaces

Namespaces provide a way to segment the vectors within your index. For example, by customer, merchant or store ID.

To associate vectors with a namespace, you can optionally provide a namespace: string value when performing an insert or upsert operation. When querying, you can pass the namespace to search within as an optional parameter to your query.

A namespace can be up to 64 characters (bytes) in length and you can have up to 1,000 namespaces per index. Refer to the Limits documentation for more details.

When a namespace is specified in a query operation, only vectors within that namespace are used for the search. Namespace filtering is applied before vector search, not after.

Insert vectors with a namespace

// Mock vectors
// Vectors from a machine-learning model are typically ~100 to 1536 dimensions
// wide (or wider still).
const sampleVectors: Array<VectorizeVector> = [
    {
        id: "1",
        values: [32.4, 74.1, 3.2, ...],
        namespace: "text",
    },
    {
        id: "2",
        values: [15.1, 19.2, 15.8, ...],
        namespace: "images",
    },
    {
        id: "3",
        values: [0.16, 1.2, 3.8, ...],
        namespace: "pdfs",
    },
];

// Insert your vectors, returning a count of the vectors inserted and their vector IDs.
const inserted = await index.insert(sampleVectors);

Query vectors within a namespace

// Your queryVector will be searched against vectors within the namespace (only)
const matches = await index.query(queryVector, {
    namespace: "images",
});

Namespace versus metadata filtering

Both namespaces and metadata filtering narrow the vector search space for a query. Consider the following when evaluating both filter types:

A namespace filter is applied before metadata filter(s).
A vector can only be part of a single namespace with the documented limits. Vector metadata can contain multiple key-value pairs up to metadata per vector limits. Metadata values support different types (string, boolean, and others), therefore offering more flexibility.

Limits

Feature	Current Limit
Indexes per account	100 indexes
Maximum dimensions per vector	1536 dimensions
Maximum vector ID length	51 bytes
Metadata per vector	10KiB
Maximum returned results (`topK`) with values or metadata	20
Maximum returned results (`topK`) without values and metadata	100
Maximum upsert batch size (per batch)	1000
Maximum index name length	64 bytes
Maximum vectors per index	5,000,000
Maximum namespaces per index	1000 namespaces
Maximum namespace name length	64 bytes
Maximum vectors upload size	100 MB
Maximum metadata indexes per Vectorize index	10
Maximum indexed data per metadata index per vector	64 bytes

Learn more about Cloudflare Vectorize limits.

Types