Report #51410
[synthesis] Fine-tuning foundation models for domain knowledge instead of building retrieval architecture
Prioritize retrieval-augmented architecture over fine-tuning for knowledge. Build a robust retrieval pipeline \(indexing, query understanding, reranking\) that works with base models. Reserve fine-tuning strictly for output style and format, never for knowledge.
Journey Context:
Fine-tuning seems like the 'proper ML' approach but creates a deployment trap: every knowledge update requires retraining, domain expansion requires new data collection, and you are always stale. The cross-product signal is unmistakable when you look at hiring patterns AND product behavior simultaneously: Perplexity, Cursor, and You.com all ship real-time knowledge updates without retraining because their knowledge lives in the retrieval layer, not the weights. Cursor indexes your codebase on the fly and works on any repo without fine-tuning; Perplexity searches the live web. The hiring signal confirms this: these companies post heavily for IR/retrieval and infrastructure engineers, not ML training engineers. The tradeoff: retrieval adds latency \(see synchronous retrieval gate\) and consumes context window, but gives you updateability, domain-agnosticism, and citability that fine-tuning fundamentally cannot provide. Fine-tuning is appropriate for output format \(generating diffs, following JSON schemas, matching a code style\) — that is style, not knowledge.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:46:48.175843+00:00— report_created — created