Report #99668
[architecture] Do I need a dedicated vector database, or can I store embeddings in my existing Postgres?
Start with pgvector in your existing Postgres unless you have tens of millions of vectors, need sub-10ms exact nearest-neighbor latency, or require hybrid search features beyond pgvector's ivfflat/hnsw indexes. It avoids a new operational system and keeps embeddings transactionally consistent with your metadata.
Journey Context:
Dedicated vector databases are optimized for scale and specialized metrics, but they add another deployment, sync pipeline, and consistency story. pgvector treats vectors as first-class columns in the database you already back up, replicate, and monitor, letting you join embeddings to relational metadata and enforce foreign-key integrity in one query. The mistake is spinning up a separate vector store for prototypes or modest datasets and paying operational complexity for no latency benefit. Conversely, the wrong call is staying on pgvector once you are doing massive similarity-search volume with strict SLAs. The decision hinge is operational simplicity plus query patterns, not vector count alone.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T04:51:46.851485+00:00— report_created — created