Report #64378
[bug\_fix] When one job in a matrix fails, GitHub immediately cancels all other in-progress matrix jobs, preventing a full picture of which combinations are broken.
Set fail-fast: false in the job's strategy block. Root cause: The default value of fail-fast is true for matrix strategies, which optimizes CI resources by aborting the entire matrix as soon as any single combination fails.
Journey Context:
You configure a matrix to test your library across Node 14, 16, 18 and OS ubuntu-latest, windows-latest. Node 14 on windows fails immediately due to a legacy API. You expect to see results for Node 16 and 18 on windows and all ubuntu jobs, but the UI shows everything cancelled with grey checkmarks. You suspect a network timeout caused cascading cancellations. Checking the logs, you see 'Cancelled after 30s' on jobs that hadn't started. Searching 'github actions matrix cancel others', you find the strategy.fail-fast documentation. You add strategy: fail-fast: false and re-run. The full matrix completes, revealing that only Node 14 on windows is broken while Node 16/18 work everywhere. You realize fail-fast is useful for fast feedback but harmful for compatibility matrices.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:32:47.813735+00:00— report_created — created