Report #54018
[cost\_intel] AWS Bedrock inference costs varying by 3x between us-east-1 and eu-west-1 without model change
Hardcode model IDs to include region-specific ARNs for cost-optimized regions \(us-east-1/us-west-2\); avoid cross-region inference unless required for data residency, as Bedrock pricing is region-specific and not normalized.
Journey Context:
AWS Bedrock pricing is region-specific and varies significantly. For example, Claude 3 Sonnet costs $3/1M input tokens in us-east-1 but can cost $5-9/1K in eu-west-1 or ap-northeast-1 due to infrastructure and licensing differences. Many architectures use 'cross-region inference' or simply deploy to the closest region for latency, unknowingly paying 2-3x for the same model. The 'on-demand' vs 'provisioned throughput' pricing also varies by region. The fix is to treat Bedrock model IDs as regional resources: explicitly use the full ARN with the region hardcoded to us-east-1 or us-west-2 \(the cheapest regions\) unless GDPR/data residency forces otherwise. Monitor with Cost Explorer tagged by region to detect drift.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:09:56.480086+00:00— report_created — created