概述
向量存储存储 嵌入 数据并执行相似度搜索。接口
LangChain 为向量存储提供了统一的接口,允许您:add_documents- 向存储中添加文档。delete- 按 ID 删除已存储的文档。similarity_search- 查询语义相似的文档。
初始化
要初始化向量存储,请为其提供嵌入模型:添加文档
像这样添加 [Document] 对象(包含 page_content 和可选元数据):
删除文档
通过指定 ID 进行删除:相似度搜索
使用similarity_search 发出语义查询,它将返回最接近的嵌入文档:
k— 返回的结果数量filter— 基于元数据的条件过滤
相似度指标与索引
嵌入相似度可使用以下方式计算:- 余弦相似度
- 欧几里得距离
- 点积
元数据过滤
按元数据(例如来源、日期)过滤可以优化搜索结果:主要集成
选择嵌入模型:OpenAI
OpenAI
Azure
Azure
Google Gemini
Google Gemini
Google Vertex
Google Vertex
AWS
AWS
HuggingFace
HuggingFace
Ollama
Ollama
Cohere
Cohere
Mistral AI
Mistral AI
Nomic
Nomic
NVIDIA
NVIDIA
Voyage AI
Voyage AI
IBM watsonx
IBM watsonx
Fake
Fake
xAI
xAI
Perplexity
Perplexity
DeepSeek
DeepSeek
In-memory
In-memory
Amazon OpenSearch
Amazon OpenSearch
pip
Astra DB
Astra DB
Azure Cosmos DB NoSQL
Azure Cosmos DB NoSQL
Azure Cosmos DB Mongo vCore
Azure Cosmos DB Mongo vCore
Chroma
Chroma
CockroachDB
CockroachDB
Elasticsearch
Elasticsearch
Install the package and start Elasticsearch locally using the start-local script:This creates an Elasticsearch will be available at
elastic-start-local folder. To start Elasticsearch:http://localhost:9200. The password for the elastic user and API key are stored in the .env file in the elastic-start-local folder.FAISS
FAISS
Milvus
Milvus
MongoDB
MongoDB
PGVector
PGVector
PGVectorStore
PGVectorStore
Pinecone
Pinecone
Qdrant
Qdrant
Oracle AI Database
Oracle AI Database
turbopuffer
turbopuffer
Valkey
Valkey
| 向量存储 | 按 ID 删除 | 过滤 | 按向量搜索 | 带分数搜索 | 异步 | 通过标准测试 | 多租户 | 添加文档中的 ID |
|---|---|---|---|---|---|---|---|---|
AstraDBVectorStore | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
AzureCosmosDBNoSqlVectorStore | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ✅ | ✅ |
AzureCosmosDBMongoVCoreVectorStore | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ✅ | ✅ |
Chroma | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
Clickhouse | ✅ | ✅ | ❌ | ✅ | ❌ | ❌ | ❌ | ✅ |
AsyncCockroachDBVectorStore | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
CouchbaseSearchVectorStore | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ✅ |
DatabricksVectorSearch | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ✅ |
ElasticsearchStore | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ✅ |
FAISS | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ✅ |
InMemoryVectorStore | ✅ | ✅ | ❌ | ✅ | ✅ | ❌ | ❌ | ✅ |
LambdaDB | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ |
Milvus | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
Moorcheh | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
MongoDBAtlasVectorSearch | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
openGauss | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ | ✅ |
PGVector | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ✅ |
PGVectorStore | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ |
PineconeVectorStore | ✅ | ✅ | ✅ | ❌ | ✅ | ❌ | ❌ | ✅ |
QdrantVectorStore | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ✅ |
Weaviate | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ✅ |
SQLServer | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ | ✅ |
TurbopufferVectorStore | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ | ✅ | ✅ |
ValkeyVectorStore | ✅ | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ | ✅ |
ZeusDB | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ |
Oracle AI Database | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | ✅ |
所有向量存储
Activeloop Deep Lake
Alibaba Cloud MySQL
Alibaba Cloud OpenSearch
AnalyticDB
Annoy
Apache Doris
ApertureDB
Astra DB Vector Store
Atlas
AwaDB
Azure Cosmos DB Mongo vCore
Azure Cosmos DB No SQL
Azure Database for PostgreSQL - Flexible Server
Azure AI Search
Bagel
BagelDB
Baidu Cloud ElasticSearch VectorSearch
Baidu VectorDB
Apache Cassandra
Chroma
Clarifai
ClickHouse
CockroachDB
Couchbase
DashVector
Databricks
IBM Db2
DingoDB
DocArray HnswSearch
DocArray InMemorySearch
Amazon Document DB
DuckDB
China Mobile ECloud ElasticSearch
Elasticsearch
Epsilla
Faiss
Faiss (Async)
FalkorDB
Gel
Google AlloyDB
Google BigQuery Vector Search
Google Cloud SQL for MySQL
Google Cloud SQL for PostgreSQL
Firestore
Google Memorystore for Redis
Google Spanner
Google Bigtable
Google Vertex AI Feature Store
Google Vertex AI Vector Search
Hippo
Hologres
Jaguar Vector Database
Kinetica
LambdaDB
LanceDB
Lantern
Lindorm
LLMRails
ManticoreSearch
MariaDB
Marqo
Meilisearch
Amazon MemoryDB
Milvus
Momento Vector Index
Moorcheh
MongoDB Atlas
MyScale
Neo4j Vector Index
NucliaDB
Oceanbase
openGauss
OpenSearch
Oracle AI Database
Pathway
Postgres Embedding
PGVecto.rs
PGVector
PGVectorStore
Pinecone
Pinecone (sparse)
Qdrant
Relyt
Rockset
SAP HANA Cloud Vector Engine
ScaNN
SemaDB
SingleStore
scikit-learn
SQLiteVec
SQLite-VSS
SQLServer
StarRocks
Supabase
SurrealDB
Tablestore
Tair
Tencent Cloud VectorDB
Teradata VectorStore
ThirdAI NeuralDB
TiDB Vector
Tigris
TileDB
Timescale Vector
Typesense
turbopuffer
Upstash Vector
USearch
Vald
Valkey
VDMS
veDB for MySQL
Vearch
Vectara
Vespa
viking DB
vlite
Volcengine RDS for MySQL
Weaviate
Xata
YDB
Yellowbrick
Zep
Zep Cloud
ZeusDB
Zilliz
Zvec
Connect these docs to Claude, VSCode, and more via MCP for real-time answers.

