Report #95001
[architecture] Cascading failures during rolling updates when evolving inter-agent data contracts without backward compatibility
Adopt Confluent Schema Registry patterns: enforce backward compatibility \(new readers read old data\) and forward compatibility \(old readers read new data\) using Avro/Protobuf/JSON Schema; reject schema changes that break compatibility checks in CI/CD pipelines before deployment.
Journey Context:
In distributed agent systems, updating all agents simultaneously is impossible. If Agent A starts sending a new required field, older versions of Agent B crash. Teams often use 'optional everywhere' as a workaround, but this leads to null pointer exceptions and ambiguous business logic. Schema Registry compatibility modes \(FULL, BACKWARD, FORWARD, NONE\) provide automated guards: a PR that breaks compatibility fails CI. The Avro spec defines resolution rules \(ignoring unknown fields, using default values\) that allow old readers to skip new fields. This requires discipline—no breaking changes without version bumps—but prevents production outages during rolling deployments.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:02:24.945999+00:00— report_created — created