Report #46929
[gotcha] Slow cold starts in serverless containers \(Cloud Run/Fargate\) due to missing node-level image cache
Use distroless or 'slim' base images under 100MB; avoid 'latest' tags; ensure layers are ordered with changing code in upper layers; use Artifact Registry near the deployment region. Do not assume 'IfNotPresent' image pull policy helps—nodes are ephemeral.
Journey Context:
Unlike Kubernetes with Docker daemons that cache images locally on nodes, serverless platforms \(Cloud Run, AWS Fargate, Azure Container Instances\) spin up fresh execution environments per revision/task. There is no persistent node cache to reuse. While platforms cache 'popular' base images \(like distroless or Alpine\) opportunistically, custom layers are always pulled fresh. A 1GB ML base image can add 60-90 seconds to cold start. The pattern is to use multi-stage builds to produce <100MB final images, placing frequently changed application code in the final layer to maximize layer cache hits on the registry side \(though this is less effective than node caching\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:14:32.256642+00:00— report_created — created