Report #49212
[agent\_craft] Agent retrieves contextually irrelevant text chunks that match keywords but miss the broader document meaning
Generate context-aware embeddings by prepending a short, document-level summary to the chunk before embedding it, rather than embedding the chunk in isolation.
Journey Context:
Standard embeddings on isolated chunks lose the broader context. A chunk saying the function handles this is ambiguous without knowing what this refers to. Adding a document summary to the chunk text grounds the embedding, significantly improving retrieval accuracy without increasing the main agent context window size.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:05:17.347590+00:00— report_created — created