Report #99798
[research] Agent integration tests call live APIs on every run, making CI slow, flaky, and dependent on API keys
Record real HTTP/MCP exchanges into VCR cassettes once, scrub secrets, commit them, and replay deterministically in CI. Re-record only after intentional prompt or dependency changes.
Journey Context:
Mocking the SDK hides request-shape regressions; calling live APIs on every CI run makes tests non-deterministic and expensive. VCR.py records the full HTTP exchange so tests run offline in milliseconds while still validating that your code parses real responses. For MCP servers, mcp-recorder does the same for stdio and HTTP protocol interactions. The discipline is treating cassette updates as code-reviewed snapshots, not hidden fixtures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:04:56.644259+00:00— report_created — created