AWS Lambda vs Cloudflare Workers for a latency-sensitive API gateway. 200M requests/month, P99 target <50ms, currently on ECS Fargate.

accepted_conditional Infrastructure scalingsoftware_operational · Pro · 553s · $0.75

7 branches explored · 3 survived · 3 rounds · integrity 75%

WeakStrong

Candidate estimate (inferred)

Risk unknown 553s

Read brief Open timeline MD ↓ Pro JSON ↓ Pro PDF ↓ Ent

Decision timeline Verdict

Migrate the API gateway from ECS Fargate to Cloudflare Workers using an edge-compute + origin-fetch pattern to...

Decision

80%

Execution

—

Uncertainty

—

Decision

Concrete components, topology, and thresholds named below are candidate mitigations or example implementations inferred by the Council. They were not confirmed in your filing or established as part of your current environment.

Deploy Cloudflare Workers as the API gateway using edge-compute + origin-fetch to existing AWS backends. Workers V8 isolates have zero cold starts (sub-1ms spin-up), edge execution adds <5ms, leaving ~45ms for origin fetch — US-East RTT of 10-25ms fits comfortably. Total cost: ~$500/month versus $6K Fargate (94% savings). Lambda is disqualified: Provisioned Concurrency for 770 peak concurrent instances costs $11K-13K/month (40-60% over $8K ceiling), and without it, Lambda P99 is 800ms — 16x the target. Migrate via Cloudflare DNS with percentage-based traffic splitting (10% → 50% → 100% over 5 days), keeping Fargate as fallback. Critical failure mode: if the gateway calls AWS VPC-internal services, cross-cloud RTT of 15-30ms leaves <20ms for backend processing. Mitigate with Cloudflare Tunnel or public endpoints, and validate backend processing time fits within the remaining budget before full cutover.

Next actions

Candidate estimate (inferred, not source-confirmed): Build a Cloudflare Workers PoC proxying 3-5 high-traffic API routes to the existing Fargate ALB using fetch() with standard Web API patterns

backend · immediate

Candidate estimate (inferred, not source-confirmed): Run synthetic load tests at 770 concurrent requests from multiple geographic regions, measuring P99 end-to-end latency including origin fetch to AWS us-east-1

backend · immediate

Audit all API gateway routes for AWS VPC-private service dependencies (DynamoDB, ElastiCache, SQS) and determine if Cloudflare Tunnel or public endpoints are needed

infra · immediate

Candidate estimate (inferred, not source-confirmed): Configure Cloudflare DNS percentage-based traffic splitting starting at 10%, with automated rollback to Fargate ALB if P99 exceeds 50ms

infra · before_launch

Set up unified observability pipeline (OpenTelemetry) spanning Workers analytics and CloudWatch to track P99 latency, error rates, and cost per request across both platforms during migration

infra · before_launch

This verdict stops being true when

Candidate estimate (inferred, not source-confirmed): Backend processing time for critical routes exceeds 20ms AND cross-cloud RTT is consistently >25ms, making the combined latency exceed 50ms P99 → Optimize the existing Fargate deployment (right-size tasks, enable connection reuse, add caching layer) rather than migrating to edge compute, or co-locate the gateway on Lambda within the same VPC

Candidate estimate (inferred, not source-confirmed): API gateway requires heavy compute (>50ms CPU time per request for auth/transformation) that exceeds Workers CPU limits or makes Workers Unbound pricing uncompetitive → Lambda with Provisioned Concurrency if budget ceiling can be raised, or optimized Fargate with reduced instance count

Organization mandates single-cloud AWS policy or compliance requirements prohibit request data transiting non-AWS infrastructure → Lambda@Edge + CloudFront for edge latency within AWS ecosystem, accepting higher cost

Full council reasoning, attack grid, and flip conditions included with Pro

Council notes

Vulcan

Explore Cloudflare Workers for handling latency-sensitive API workload migration from ECS Fargate. Focus on cold star...

Socrates

Reframe the problem as 'eliminating the API gateway entirely' rather than 'choosing between Workers and Lambda'. Depl...

Daedalus

**Recommendation: Cloudflare Workers** for the API gateway migration, with a specific architectural pattern of edge-c...

Loki

All branches assume edge compute solves latency without considering operational overhead: Cloudflare Workers forces m...

Evidence boundary

Observed from your filing

AWS Lambda vs Cloudflare Workers for a latency-sensitive API gateway. 200M requests/month, P99 target <50ms, currently on ECS Fargate.

Assumptions used for analysis

The API gateway primarily performs routing, auth, and lightweight transformation — not compute-heavy processing that would exhaust Workers CPU limits
Backend services are accessible via public endpoints or can be exposed via Cloudflare Tunnel without prohibitive latency
Traffic is predominantly US-centric, making the 10-25ms RTT estimate to us-east-1 representative
The $8K/month cost ceiling is a hard constraint that disqualifies Lambda Provisioned Concurrency
Current ECS Fargate can remain operational as a fallback during the 5-day migration window
team size defaulted: standard team (5-10 engineers) (not_addressed)

Inferred candidate specifics

These details were introduced by the Council during analysis. They were not supplied in your filing.

Deploy Cloudflare Workers as the API gateway using edge-compute + origin-fetch to existing AWS backends. Workers V8 isolates have zero cold starts (sub-1ms spin-up), edge execution adds <5ms, leaving ~45ms for origin fetch — US-East RTT of 10-25ms fits comfortably. Total cost: ~$500/month versus $6K Fargate (94% savings). Lambda is disqualified: Provisioned Concurrency for 770 peak concurrent instances costs $11K-13K/month (40-60% over $8K ceiling), and without it, Lambda P99 is 800ms — 16x the target. Migrate via Cloudflare DNS with percentage-based traffic splitting (10% → 50% → 100% over 5 days), keeping Fargate as fallback. Critical failure mode: if the gateway calls AWS VPC-internal services, cross-cloud RTT of 15-30ms leaves <20ms for backend processing. Mitigate with Cloudflare Tunnel or public endpoints, and validate backend processing time fits within the remaining budget before full cutover.
Deploy a Cloudflare Workers proof-of-concept that proxies 3-5 representative API routes to the existing Fargate ALB, measure P99 end-to-end latency including origin fetch under synthetic load matching peak traffic patterns (770 concurrent requests), and validate that cross-cloud RTT + backend processing fits within the 50ms budget.
Branch b003 had the highest confidence (0.82), survived 3 rounds of adversarial review including strengthening in rounds 1 and 2, named specific cost thresholds ($500/month vs $11-13K Lambda), specific latency breakdowns (sub-1ms isolate, <5ms edge, 15-30ms cross-cloud RTT), concrete failure modes (VPC access, backend processing budget), and a specific migration timeline. Reframe branches b006 and b007 raised valid strategic considerations but neither provided actionable recommendations.
Hybrid Cloudflare Workers + Lambda (80/20 routing split)
Killed in round 3. Adds Lambda cold start risk for 20% of traffic, increases operational complexity, and total cost of $7.2K/month approaches the ceiling without meaningful latency advantage over pure Workers.
AWS API Gateway + Lambda@Edge + Global Accelerator (single-ecosystem approach)
Killed in round 2. Global Accelerator adds significant cost ($0.025/GB + accelerator fees), Lambda@Edge pricing is higher per request at 200M/month scale, and multi-cloud operational overhead was overstated — Shopify runs Workers edge + AWS origin successfully with standard observability tooling.
Valid strategic consideration (b006) but is a reframe, not an actionable recommendation. The 50ms P99 target and 200M req/month are stated constraints — if most users are near us-east-1, the Workers approach still wins on cost and cold start elimination.

Inferred specifics table

Structured audit rows for Council-added details. Synthetic basis means the detail was introduced by analysis, not supplied by the filing.

Value	Kind	Basis	Where introduced
sub-1ms spin-up	threshold	synthetic	chosen_path
edge execution adds <5ms	threshold	synthetic	chosen_path
leaving ~45ms for origin fetch —	threshold	synthetic	chosen_path
RTT of 10-25ms fits comfortably	threshold	synthetic	chosen_path
Total cost: ~$500/month versus $6K Fargate	estimate	synthetic	chosen_path
94% savings	threshold	synthetic	chosen_path
Concurrency for 770 peak concurrent instances costs	estimate	synthetic	chosen_path
instances costs $11K-13K/month	estimate	synthetic	chosen_path
40-60% over $8K ceiling	threshold	synthetic	chosen_path
Lambda P99 is 800ms — 16x the target	threshold	synthetic	chosen_path
10% → 50% → 100% over 5 days	threshold	synthetic	chosen_path
RTT of 15-30ms leaves <20ms for backend	threshold	synthetic	chosen_path
15-30ms leaves <20ms for backend processing	threshold	synthetic	chosen_path
that proxies 3-5 representative API routes to	estimate	synthetic	next_action
770 concurrent requests	estimate	synthetic	next_action
0.82	estimate	synthetic	selection_rationale
Branch b003 had the highest confidence	estimate	synthetic	selection_rationale
in rounds 1 and 2	estimate	synthetic	selection_rationale
82	estimate	synthetic	selection_rationale
$500/month vs $11-13K Lambda	estimate	synthetic	selection_rationale

Unknowns blocking a firmer verdict

Actual backend processing time for AWS-internal services is unknown — if complex queries take >20ms, the 50ms P99 budget may be violated when combined with 15-30ms cross-cloud RTT
Whether the API gateway requires access to AWS VPC-private services (DynamoDB, ElastiCache, SQS) is unstated — this determines whether Cloudflare Tunnel or public endpoints are needed, adding latency and complexity
The $500/month Workers cost estimate depends on CPU duration staying low; compute-heavy gateway logic (auth, transformation, validation) could push Workers Unbound costs higher
Geographic distribution of users is unknown — the 10-25ms RTT estimate assumes US-centric traffic hitting US edge nodes; global traffic patterns could differ
Cost numbers for Lambda Provisioned Concurrency are model-estimated, not sourced from AWS pricing calculator with the specific runtime/memory configuration

Operational signals to watch

reversal — Candidate estimate (inferred, not source-confirmed): Backend processing time for critical routes exceeds 20ms AND cross-cloud RTT is consistently >25ms, making the combined latency exceed 50ms P99

reversal — Candidate estimate (inferred, not source-confirmed): API gateway requires heavy compute (>50ms CPU time per request for auth/transformation) that exceeds Workers CPU limits or makes Workers Unbound pricing uncompetitive

reversal — Organization mandates single-cloud AWS policy or compliance requirements prohibit request data transiting non-AWS infrastructure

Branch battle map

Battle timeline (3 rounds)

Round 1 — Initial positions · 3 branches

Branch b001 (Vulcan) eliminated — Branch b001 is structurally hollow — it says 'Explore C...

Socrates proposed branch b004

Loki proposed branch b005

Socrates Reframe the problem as optimizing the entire API gateway architecture for latenc…

Loki All branches assume edge compute solves latency without considering operational …

Round 2 — Adversarial probes · 3 branches

Branch b004 (Socrates) eliminated — auto-pruned: unsupported low-confidence branch

Branch b005 (Loki) eliminated — The proposal for AWS API Gateway + Lambda@Edge + Global A...

Socrates proposed branch b006

Socrates Reframe the problem around geographic distribution of users and backend services…

Round 3 — Final convergence · 3 branches

Branch b002 (Socrates) eliminated — The hybrid Cloudflare Workers + Lambda architecture in b0...

Socrates proposed branch b007

Socrates Reframe the problem as 'eliminating the API gateway entirely' rather than 'choos…

Markdown JSON

Council chamber

Vulcan

Engineer

Socrates

Analyst

Daedalus

Architect

Loki

Disruptor

2d04c01e-1a9b-4328-b60c-8b5dc767c43f · Protocol

Council archetypes represent independent reasoning perspectives. They are not individuals but structured reasoning roles.

This verdict is a structured reasoning artifact, not professional advice. VectorCourt does not provide legal, financial, medical, or other professional advice. You are responsible for your own decisions.

VectorCourt · Pricing · Terms · Privacy