Question 1

What is a vector database?

Accepted Answer

A vector database stores high-dimensional numerical vectors (embeddings) and enables approximate nearest-neighbor (ANN) search to find semantically similar items. Unlike relational databases that match exact values, vector databases use indexing algorithms like HNSW or IVF to retrieve the closest matches by distance (cosine, L2, dot product) across millions or billions of vectors. This makes them the standard storage layer for RAG pipelines, semantic search, and AI agent memory. This directory lists 9 options including cloud-native services like Pinecone and Zilliz, embedded libraries like Chroma and LanceDB, and self-hosted systems like Milvus, Qdrant, and Weaviate.

Question 2

How do I choose the right vector database?

Accepted Answer

Four criteria drive most decisions: (1) Deployment model — Pinecone and Zilliz are fully managed SaaS; Chroma and LanceDB run in-process with no server; Milvus, Qdrant, and Weaviate require self-hosted infrastructure. (2) Scale — pgvector handles millions of vectors comfortably inside Postgres, while Milvus and Zilliz are designed for billions. (3) Metadata filtering — Qdrant and Weaviate have strong filtered ANN support; pgvector relies on Postgres indexes. (4) Ecosystem fit — if you're already on Postgres, pgvector adds zero infrastructure; if you need a Python-native prototype, Chroma or LanceDB spin up in seconds.

Question 3

What is the difference between Pinecone and Qdrant?

Accepted Answer

Pinecone is a fully managed, serverless vector database with no infrastructure to operate — you pay per query and storage, making it fast to production but expensive at scale. Qdrant is open-source and self-hostable (or available as a managed cloud service), giving you full control over hardware and cost at the expense of ops overhead. Qdrant also supports richer payload filtering, sparse vectors for hybrid search, and on-disk indexing out of the box. Pinecone's advantage is zero-ops simplicity and consistent low latency under variable load.

Question 4

Are there free or open-source vector database options?

Accepted Answer

Yes — the majority of the 9 tools listed here are open-source. Chroma, LanceDB, Milvus, pgvector, Qdrant, and Weaviate are all MIT or Apache 2.0 licensed and free to self-host. pgvector is a Postgres extension, so if you already run Postgres it costs nothing to add. Chroma and LanceDB are particularly popular for local development and testing because they run embedded with no separate server process. Zilliz (Milvus-as-a-service) and Pinecone offer free tiers with dimension and request limits, suitable for prototyping.

Question 5

Can I use a vector database without a separate server (embedded mode)?

Accepted Answer

Yes — Chroma and LanceDB both support embedded operation, running inside your Python or Node.js process with data stored locally on disk, requiring no network calls or separate service. This is useful for local development, testing, or edge deployments where running a dedicated server is impractical. pgvector is another zero-new-server option if you already have Postgres: it adds ANN search as an extension. For production at scale, dedicated servers like Qdrant or Milvus offer better throughput, replication, and operational controls that embedded libraries lack.

Name	Best For	Pricing	Key Differentiator
Chroma	Rapid RAG prototyping	Free tier on cloud; serverless paid plans	Minimal setup—start local, migrate to managed
LanceDB	Multimodal RAG (video, audio, text)	Free open-source; cloud pricing not published	Embeddable + native multimodal search
Milvus	Enterprise-scale self-hosted RAG	Free open-source; Zilliz Cloud for managed	Billions of vectors; works from dev to datacenter
pgvector	Combining vectors with relational queries	Free (PostgreSQL License)	Use your existing Postgres; no new infrastructure
Pinecone	Production SaaS with compliance	Free tier; usage-based paid	Purpose-built for AI; namespace isolation for multi-tenant
Qdrant	Flexible deployment (self-hosted + cloud)	Free tier on cloud; self-hosted free	Rust-based; balances performance and operational flexibility
Turbopuffer	Cost-efficient large-scale RAG	$64/month minimum; usage-based scaling	Object storage backend; lowest cost per vector at scale
Weaviate	Semantic search with built-in vectorization	See website	Includes ML models for embedding; reduces external dependencies
Zilliz	Enterprise Milvus alternative (managed)	Free tier; serverless + dedicated cluster options	Milvus under the hood; handles compliance and BYOC

9 Best Vector Databases for RAG & AI Agents

How to Choose

Comparison

Turbopuffer

Zilliz

pgvector

LanceDB

Qdrant

Pinecone

Weaviate

Milvus

Chroma

Top Vector Databases Experts

Frequently Asked Questions