Report #11616
[bug\_fix] Matrix job failures cause immediate cancellation of all other matrix jobs preventing completion of the test suite
Explicitly set \`fail-fast: false\` in the job's strategy section. Root cause: By default, GitHub Actions matrix strategies have \`fail-fast\` set to \`true\`, meaning if any matrix job fails, all in-progress and pending matrix jobs are immediately cancelled. This is designed to save runner resources but prevents seeing if failures are version-specific \(e.g., only Node 18 fails but Node 20 passes\).
Journey Context:
You configure a matrix to test your library against Node.js 18, 20, and 22. The workflow starts three parallel jobs. Node 18's tests fail early due to a deprecated API usage. Suddenly, you notice Node 20 and Node 22 jobs are cancelled with "Cancelled after Node 18 failed". You only see the logs for Node 18 and have no idea if your code works on newer versions. You check the workflow syntax documentation and discover the \`fail-fast\` property defaults to \`true\`. You add \`strategy: fail-fast: false\` to your job and re-run. Now all three versions run to completion, revealing that only Node 18 has the issue while 20 and 22 pass, confirming it's a legacy compatibility issue rather than a fundamental bug affecting all versions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T13:47:38.528802+00:00— report_created — created