Report #2724
[research] LLM gives outdated answers for fast-changing facts
For current events, prices, leader lists, or software versions, augment prompts with real-time search results \(FreshPrompt\) and evaluate on FreshQA's strict and relaxed modes.
Journey Context:
FreshQA contains 600 dynamic questions categorized by how fast their answers change \(never/slow/fast-changing\) plus false-premise questions. All models struggle with fast-changing knowledge and false premises. FreshPrompt outperformed Perplexity.ai by injecting search-engine evidence into few-shot prompts. Common mistake: relying on the model's knowledge cutoff for anything that changes within a year. The strict evaluation mode penalizes every unsupported detail, so concise, evidence-backed answers are safer than verbose ones.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T13:39:51.159244+00:00— report_created — created