Index Management
This section covers various operations for managing your indexes, including getting information about specific indexes, listing all indexes, optimizing indexes for performance, and deleting indexes.
Getting Index Info
There are two ways to get detailed information about a specific index:
-
Using the client's
index_info()
method: -
Using the index object's
info()
method:
Both methods return a dictionary containing metadata including:
index_name
: Name of the indexstatus
: Current status ("initializing" or "ready")num_records
: Number of vectors stored in the indexdimension
: Vector dimensionmetric
: Distance metric used ("cosine" or "dotproduct")features_type
: Type of features stored ("dense" or "hybrid")embedding_model_name
: Name of the integrated embedding model (if any)optimized_for_latency
: Whether the index is optimized for low-latency queries
Index information during creation
If the index is still being created (status is "initializing"
), only basic information will be available.
Once the index is ready, you will have access to all metadata fields.
Checking Index Status
The index_status()
method returns the current status of an index:
Checking index status | |
---|---|
Possible statuses
The possible statuses are:
- "initializing": The index is being initialized.
- "ready": The index is ready for use.
- "failed": The index failed to initialize.
- "deleting": The index is being deleted.
- "undergoing_optimization_for_latency": The index is undergoing optimization for better latency and throughput.
Listing Indexes
The list_indexes()
method returns information about all indexes associated with your API key:
Each index in the returned list contains the same metadata fields as described in the Getting Index Info section.
Optimizing For Latency
Introduction
When your dataset grows large, searching for the nearest neighbors in high-dimensional vector spaces becomes increasingly resource-intensive. To address this challenge, most modern vector databases use approximate nearest neighbor (ANN) search algorithms. These algorithms are typically built on specialized data structures—such as Inverted File Index (IVF), Hierarchical Navigable Small World (HNSW), or Vamana graphs—that enable fast lookups while preserving a high level of accuracy.
Latency Optimization with No Manual Tuning
Most vector databases require manual configuration of various ANN parameters, such as nprobe for IVF or the number of edges per node in HNSW. By contrast, PreciseSearch is designed to automatically select the best data structure and optimize its parameters, eliminating the need for manual tuning and simplifying setup.
To optimize for latency in PreciseSearch, you can use the optimize_for_latency()
method. This method runs an optimization process on the backend to reduce search latency.
- When to Call: You usually need to call this method only once, ideally after you have loaded most of your data. If the size of your index is less than 500,000 vectors, you do not need to call this method.
- Effect on Indexing: You can still insert additional data afterward, but you will not need to run the optimization again.
- Asynchronous Execution: Because this operation runs in the background, the method returns immediately while optimization continues behind the scenes.
Optimizing an index for latency | |
---|---|
Deleting Indexes
Important considerations when deleting indexes
- Deletion is permanent and cannot be undone
- By default, both methods will ask for confirmation before deletion
- You can bypass the confirmation by setting
ask_for_confirmation=False
. - Be extremely careful when disabling confirmation, as deleted indexes cannot be recovered.
There are two ways to delete an index:
-
Using the client's
delete_index()
method:Deleting using the client -
Using the index object's
delete()
method: