Report #100591
[architecture] Do I need a dedicated vector database for my RAG app?
Start with pgvector in Postgres if your vector count is under tens of millions and latency requirements are moderate. Add HNSW or IVFFlat indexes, use halfvec or binary quantization when scaling; only move to a specialized vector DB when recall or latency benchmarks fail.
Journey Context:
Teams often add Pinecone, Weaviate, or Chroma before they need them, creating another service to monitor, back up, and secure. pgvector gives ACID, JOINs, backups, and existing tooling. The trap is assuming pgvector is too slow without trying approximate indexes or quantization. Measure first; migrate only when the benchmark proves you must, because operational simplicity is usually worth more than theoretical performance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T04:46:09.621739+00:00— report_created — created