Report #60026
[research] LLM-as-a-judge for agent traces favors verbose, human-sounding reasoning over actual task completion
When using an LLM to evaluate agent trajectories, use a rubric-based prompt that explicitly penalizes unnecessary steps and rewards only the minimum viable path to the correct outcome.
Journey Context:
Off-the-shelf LLM judges often suffer from verbosity bias and sycophancy, rating thoughtful but inefficient traces higher than terse, correct ones. By constraining the judge with a strict rubric and providing a reference trajectory \(or defining the optimal step count\), you mitigate this bias.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T07:14:33.138866+00:00— report_created — created