Vector Index API Reference
Index Management Operations
These operations are performed at the client level and don't require connecting to a specific index.
vectorstackai.PreciseSearch.list_indexes
Lists information about all available indexes.
Retrieves metadata for all indexes associated with the current API key.
Returns:
Name | Type | Description |
---|---|---|
list_info |
List[Dict[str, Any]]
|
A list of dictionaries, where each dictionary contains information about an index with the following keys:
|
Source code in src/vectorstackai/client.py
vectorstackai.PreciseSearch.create_index
create_index(index_name, embedding_model_name='none', dimension=None, metric='dotproduct', features_type='dense')
Creates a new vector index with the specified parameters.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
index_name
|
str
|
Name of the index to create. |
required |
embedding_model_name
|
str
|
Name of the embedding model to use. There are two kinds of embedding models: - Integrated models: These are pre-trained models hosted on the vector2search platform. - Non-integrated models: These are custom models hosted on your platform/application. - Set "embedding_model_name" to "none" for using your own embedding model (i.e. non-integrated model). |
'none'
|
dimension
|
Optional[int]
|
Vector dimension (required for non-integrated models). |
None
|
metric
|
Optional[str]
|
Distance metric for comparing dense and sparse vectors. Must be one of "cosine" or "dotproduct". |
'dotproduct'
|
features_type
|
Optional[str]
|
Type of features used in the index. Must be one of "dense" or "hybrid" (sparse + dense). |
'dense'
|
Source code in src/vectorstackai/client.py
vectorstackai.PreciseSearch.index_info
Retrieves information about a specific vector index.
This method searches for the index specified by index_name
within the list of available indexes.
If the index exists, it returns a dictionary containing information about the index. This method
is useful to get information about the index without having to connect to it.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
index_name
|
str
|
Name of the index to retrieve information for. |
required |
Returns:
Name | Type | Description |
---|---|---|
index_info |
Dict[str, Any]
|
A dictionary containing information about the index with the following keys:
|
Source code in src/vectorstackai/client.py
vectorstackai.PreciseSearch.index_status
Retrieves the status of a specific vector index.
This method retrieves the status of the index specified by index_name
.
Here are the possible statuses:
- "initializing": The index is being initialized.
- "ready": The index is ready for use.
- "failed": The index failed to initialize.
- "deleting": The index is being deleted.
- "undergoing_optimization_for_latency": The index is undergoing optimization for better latency and throughput.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
index_name
|
str
|
Name of the index to retrieve status for. |
required |
Returns:
Name | Type | Description |
---|---|---|
index_status |
str
|
The current status of the index. |
Source code in src/vectorstackai/client.py
vectorstackai.PreciseSearch.connect_to_index
Connects to an existing vector index and returns an IndexObject for further operations.
This method searches for the index specified by index_name
within the list of available indexes.
If the index exists, it returns an IndexObject
configured with the current connection parameters,
which can be used to perform operations such as upsert, search, and more on the index.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
index_name
|
str
|
The name of the index to connect to. |
required |
Returns:
Name | Type | Description |
---|---|---|
IndexObject |
IndexObject
|
An object that provides methods to interact with the specified vector index. |
Source code in src/vectorstackai/client.py
vectorstackai.PreciseSearch.delete_index
Deletes a vector index by its name.
Permanently deletes the specified index and all its contents. The deletion is asynchronous, and the deleted index cannot be recovered. Note, this method is useful for deleting an index without having to connect to it.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
index_name
|
str
|
Name of the index to delete. |
required |
ask_for_confirmation
|
bool
|
Whether to ask for confirmation before deleting the index. |
True
|
Source code in src/vectorstackai/client.py
Index Operations
These operations are performed on a specific index after connecting to it.
vectorstackai.objects.index.IndexObject.set_similarity_scale
Set the scale values for dense and sparse similarity scores in hybrid search.
The similarity in a hybrid index is computed as a weighted sum of the dense and sparse similarity scores:
similarity = dense_similarity * dense_scale +
sparse_similarity * sparse_scale
This method allows you to set the scale values for the dense and sparse similarity scores. The scale values must be between 0 and 1.
Note
In a dense index, the scale values are ignored. The similarity is computed as:
similarity = dense_similarity.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
dense_scale
|
float
|
The scale value for the dense similarity score. Defaults to 1.0. |
1.0
|
sparse_scale
|
float
|
The scale value for the sparse similarity score. Defaults to 1.0. |
1.0
|
Source code in src/vectorstackai/objects/index.py
vectorstackai.objects.index.IndexObject.upsert
upsert(batch_ids, batch_metadata=None, batch_vectors=None, batch_sparse_values=None, batch_sparse_indices=None)
Upsert a batch of vectors and associated metadata to the index.
This method upserts a batch of dense or sparse vectors, along with their associated metadata in the index. Note, if a datapoint with the same ID already exists, its metadata, vector, and sparse vector will be updated with the new values.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
batch_ids
|
List[str]
|
List of unique identifiers for each vector. |
required |
batch_metadata
|
Optional[List[Dict[str, Any]]]
|
List of dictionaries containing metadata for each vector. For indexes configured with an integrated embedding model, each dictionary should include a 'text' key whose value is used to compute the embeddings. |
None
|
batch_vectors
|
Optional[List[List[float]]]
|
List of dense vectors (each represented as a list of floats) corresponding to each ID. This field is required for indexes configured with a non-integrated embedding model and should be omitted for indexes configured with an integrated embedding model. |
None
|
batch_sparse_values
|
Optional[List[List[float]]]
|
List of values in the sparse vector (each as a list of floats) corresponding to each ID. Required for upserting in a hybrid index. |
None
|
batch_sparse_indices
|
Optional[List[List[int]]]
|
List of indices in the sparse vector (each as a list of ints) corresponding to each ID. Required for upserting in a hybrid index. |
None
|
Source code in src/vectorstackai/objects/index.py
vectorstackai.objects.index.IndexObject.search
search(top_k=10, query_text=None, query_vector=None, query_sparse_values=None, query_sparse_indices=None, return_metadata=True)
Search the index for entries similar to the query.
Finds entries in the index that are most similar to the provided query. Query can be a text (if using an integrated embedding model) or a dense vector (if using a non-integrated embedding model), along with a sparse vector (if using a hybrid index).
Parameters:
Name | Type | Description | Default |
---|---|---|---|
top_k
|
int
|
Number of top-k results to return (should be >= 1). |
10
|
query_text
|
str
|
Query text (required for integrated embedding models). |
None
|
query_vector
|
List[float]
|
Query vector (required for non-integrated embedding models). |
None
|
query_sparse_values
|
List[float]
|
Query sparse values (required for hybrid indexes). |
None
|
query_sparse_indices
|
List[int]
|
Query sparse indices (required for hybrid indexes). |
None
|
return_metadata
|
bool
|
Whether to return metadata for each result (optional, defaults to True). |
True
|
Returns:
Name | Type | Description |
---|---|---|
search_results |
Dict[str, Any]
|
List of dictionaries containing search results, sorted in descending order of similarity scores. Each dictionary contains:
|
Source code in src/vectorstackai/objects/index.py
vectorstackai.objects.index.IndexObject.info
Get information about the index.
If the index is still being created (i.e., not yet ready), the returned dictionary
includes only "index_name"
and a "status"
(which is "initializing"
). Once the index is configured, the status is set to"ready"
, the returned dictionary includes additional information.
Returns:
Name | Type | Description |
---|---|---|
index_info |
Dict[str, Any]
|
A dictionary containing information about the index with the following keys:
|
Source code in src/vectorstackai/objects/index.py
vectorstackai.objects.index.IndexObject.delete
Deletes the vector index.
Permanently deletes the index and all its contents. The deletion is asynchronous, and the deleted index cannot be recovered.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
ask_for_confirmation
|
bool
|
Whether to ask for confirmation before deleting the index. Defaults to True. When True, the user will be prompted to type 'yes' to confirm deletion. |
True
|
Returns:
Type | Description |
---|---|
None
|
None |
Raises:
Type | Description |
---|---|
ValueError
|
If the index doesn't exist or cannot be deleted. |
Source code in src/vectorstackai/objects/index.py
vectorstackai.objects.index.IndexObject.delete_vectors
Deletes vectors from the index by their IDs.
Permanently removes the specified vectors from the index based on their unique identifiers. The deletion operation cannot be undone. This operation is performed synchronously. Even if one of the IDs does not exist, the operation will raise an error, and the index state will remain unchanged (i.e., no vectors will be deleted).
Parameters:
Name | Type | Description | Default |
---|---|---|---|
ids
|
List[str]
|
A list of string IDs identifying the vectors to delete from the index. Each ID must correspond to a vector previously added to the index. |
required |
Source code in src/vectorstackai/objects/index.py
vectorstackai.objects.index.IndexObject.optimize_for_latency
Optimizes the index for better latency and throughput.
This method triggers an optimization process in the background to improve the latency and throughput of the index for search operations.