โ† All Reports
Tomales Bay Capital - Research Memo

The Compute Arms Race

Capacity, Projections, and the Recursive Horizon - Mapping the $725B infrastructure buildout reshaping the global energy grid

๐Ÿ“… May 20, 2026 (Updated May 21) ยท ๐Ÿ”ญ Galileo Research
Hyperscaler Capex 2026
~$725B
Big 4 only; +77% YoY
US DC Power Demand
2ร— by 2027
Goldman Sachs (May 2026)
Global DC Capacity
103 โ†’ 200 GW
By 2030 (JLL); AI-specific: ~30 GW (Epoch AI)
Nvidia DC Revenue
~$73B/qtr
Q1 FY27 consensus est.
Transformer Lead Time
160+ weeks
3+ years; rising

Executive Summary

The world's largest technology companies are collectively committing over $725 billion in capital expenditure in 2026 alone - nearly doubling 2025 levels and exceeding global oil & gas upstream investment for the first time.[1] This report maps the compute buildout across hyperscalers and AI labs, projects US and global datacenter power demand through 2030, identifies the physical bottlenecks constraining deployment, and analyzes the "recursive horizon" - the phenomenon where AI efficiency gains are overwhelmed by exploding inference demand from reasoning and agentic workloads.

Bottom line: We are witnessing the largest infrastructure buildout since rural electrification, compressed into a decade. The binding constraint is not capital - it's watts. Every major player is now effectively in the power business. For seed investors, the opportunity set lies in the picks-and-shovels of power delivery: grid-enhancing technologies, onsite generation, battery storage, cooling, and the software layer that optimizes utilization of constrained compute.

Section 1

The Capex Supercycle: Hyperscaler & Lab Compute Inventory

The scale of capital deployment into AI infrastructure in 2026 has no precedent in the history of enterprise technology. Combined capital expenditure of the five largest US cloud and AI infrastructure providers - Amazon, Microsoft, Alphabet, Meta, and Oracle - is projected between $725 billion and $775 billion for 2026, nearly doubling 2025 levels and exceeding global investment in oil and natural gas production.[1][7] In Q1 2026 alone, Alphabet, Amazon, Microsoft, and Meta reported combined quarterly capex exceeding $130 billion.[8]

2026 Capex Guidance by Hyperscaler
Sources: Q1 2026 earnings calls and annual guidance; all figures in $B USD

Individual Hyperscaler & Lab Profiles

Company Operational Capacity Contracted / Under Construction 2026 Capex Key Power Deals
Microsoft / Azure ~6 GW as of Q3 FY2026. Added 1 GW in a single quarter; ~5 GW at FY2025 end. 400+ DCs, 70+ regions[52] Multi-GW pipeline. Still "capacity constrained" through 2026 (CFO Amy Hood)[52] ~$190B[10] World's largest corporate renewable buyer (34 GW PPAs). TerraPower nuclear: 345 MW by late 2027. Revenue/GW: $6.2B annualized[52]
Amazon / AWS Not disclosed. Cumulus Data campus (PA): 960 MW, co-located with 2.5 GW Susquehanna nuclear[53] Massive pipeline. Project Rainier (Anthropic). 1 GW+ Trainium2/3 by end 2026[14] ~$200B[9] Talen/Susquehanna nuclear PPA: 1,920 MW, $18B, 17 years - largest nuclear-to-DC deal ever. Trainium to save "tens of billions" annually[53]
Alphabet / Google Not disclosed at GW level. Consumed 30.8 TWh in 2024 (doubled from 2020)[54] Intersect Power acq ($4.75B): "multiple GW" of energy + DC projects[54] $180-190B[10] Intersect Power ($4.75B). Kairos Power/TVA: 50 MW nuclear. NextEra partnership. Cloud backlog $462B. 2027 capex to "significantly increase"[54]
Meta 26 operational DCs globally (end 2025). Total MW not disclosed[55] Hyperion (Richland Parish, LA): 2-5 GW campus, 4M sq ft. Entergy: 6.7 GW dedicated generation (10 gas plants). $27B off-balance-sheet (Blue Owl)[55] $125-145B[10] Entergy: 6,700 MW new gas gen (~half of all Entergy Louisiana capacity). Exploring nuclear[55]
Oracle Not disclosed. 8 AI superclusters under development[11] Stargate partner (joint sites with OpenAI/SoftBank). Backlog $523B[11] ~$50B (FY2026)[11] Primary infra partner for Stargate (Abilene TX + 5 sites). Q3 FY2026 capex: $9.1B/qtr[11]
OpenAI / Stargate Under construction. First site: Abilene, TX[12] ~7 GW planned; 10 GW target. UAE: up to 5 GW (G42)[12][13] $500B (SoftBank, Oracle, OpenAI)[12] Oracle as primary developer. Crusoe Energy, JPMorgan $2.3B loan. 4.5 GW Oracle-OpenAI joint development[12]
Anthropic No owned DCs. Uses >1M Google TPUs + >1M AWS Trainium chips[14][15] >5 GW committed across partners. Google >1 GW. AWS 1 GW Trainium2/3. Broadcom "multiple GW." Leasing xAI Colossus 1[14][15][18] Via partners: Google $40B, Amazon $33B, MS/Nvidia $15B[16] Most compute-diversified frontier lab. Multi-cloud = hedge + signal no single hyperscaler can satisfy frontier demand[15]
xAI / Colossus 555K GPUs (H100/H200/GB200). ~2 GW site. Built 100K GPUs in 122 days[17] Target: 1M GPUs. Colossus 1 now leased to Anthropic[18] $18B silicon at Colossus[17] GE Vernova / Solar Turbines gas for initial power. Memphis chosen for power access[33]
Key Insight

AI margins are already better than early cloud margins were at a comparable stage, per Microsoft CFO Amy Hood.[10] Microsoft's revenue per GW of datacenter capacity reached $6.2B annualized in Q3 FY2026, up from $4.8B a year earlier - AI extracts far more economic value per watt than traditional cloud.[52] This is the single most important signal for sustainability of the capex cycle.

Who Commands the Compute: User vs. Provider View

The table above shows who owns datacenter infrastructure. But who actually uses the compute? An analysis by Aymeric Roucher, building on Epoch AI's Frontier Data Centers dataset and supplementary curated data, reveals a strikingly different picture when capacity is attributed to the tenant running the workloads rather than the facility owner:[89][90]

Compute UserTotal Capacity (GW)Owns DCs?Key Insight
OpenAI15.3 GWNo (Stargate under construction)Largest compute consumer globally. Capacity via Oracle, Azure, SoftBank JVs
Google DeepMind6.0 GWYes (Google-owned)Vertically integrated: owns DCs, designs TPUs, trains models
Anthropic5.1 GWNoMost diversified: Google TPUs + AWS Trainium + xAI Colossus + Broadcom
Meta3.8 GWYesFully self-hosted. Hyperion (2-5 GW) will make Meta the largest owner-operator
Microsoft2.0 GWYes (~6 GW owned)Owns 3ร— what it consumes for its own AI โ€” the rest serves Azure tenants
xAI1.5 GWYes (Colossus)Colossus 1 now leased to Anthropic; building Colossus 2
Who Commands vs. Who Owns the Compute
Source: Roucher (2026) via Epoch AI data. GW includes owned + rented capacity. [89]
The Demand vs. Supply Mismatch

Second-Order The user-view reveals a structural asymmetry: the largest consumers of compute (OpenAI, Anthropic) own none of the physical infrastructure, while the largest owners (Microsoft, Amazon, Google) derive much of their value from serving these tenants. OpenAI commands 15.3 GW despite owning zero operational datacenters โ€” it is the world's largest compute renter. Third-Order This creates a power dynamic where infrastructure providers hold structural leverage over AI labs, and any disruption in these partnerships (as seen with the weakening Microsoft-OpenAI relationship) has immediate compute access implications. Labs that diversify their compute sources (Anthropic's multi-cloud strategy) are hedging this concentration risk.[89]

GW Deployment Trajectory by Company (Wells Fargo / BNEF)

Provider2023202420252026E2027E2028E
AWS7.2009.20013.13918.58524.19529.295
GCP4.0005.6257.60511.97617.27721.896
Azure4.6006.3008.90012.70018.20022.500
Oracle0.5750.7751.4553.4906.20510.875
Meta4.9756.0007.50011.35315.83920.548
Total CSPs21.35027.90038.59958.05781.659104.957
xAI00.2121.3142.0004.5007.000
Neoclouds*0.1120.4652.2177.35713.39421.941
Combined Total21.46228.57742.13067.41499.553133.898
Incremental Y/Y-+7.115+13.553+25.285+32.139+34.345

*Neoclouds: CoreWeave, Nebius, Nscale, Crusoe, FluidStack, Lambda, IREN. Source: Wells Fargo Securities LLC Estimates; BloombergNEF.[84]

Global DC Capacity: Live vs. Under Construction vs. Pipeline (BNEF)
Source: BloombergNEF historical data. All values in GW.

Lead Times, Delays & The Supplier Margin Explosion

The gap between announced capacity and energized capacity is the most important structural risk in the compute buildout. Hyperscalers can commit hundreds of billions, but physical infrastructure moves on the timeline of transformers and turbines - not software.

Transformer Manufacturing Crisis

Substation transformer lead times have worsened steadily: ~140 weeks in 2023, ~150 weeks in 2025, exceeding 160 weeks (3+ years) in 2026.[4] US GSU transformer demand soared 274% between 2019 and 2025 (Wood Mackenzie).[47] Supply deficit was ~100% in 2025, projected <10% by 2030 as new capacity comes online.[46] Eaton committed $340M for a third US transformer plant; Hitachi Energy and Siemens scaling globally. But new factories take 2-3 years to build - bottleneck persists through 2028.[60]

Gas Turbine Delivery Backlogs

The IEA documented a 70% surge in global gas turbine orders in 2025.[1] GE Vernova's gas power backlog expanded from 62 GW to 83 GW (Q4 2025), targeting 110 GW by year-end 2026. Power orders surged 77% QoQ, with 41 heavy-duty gas turbines ordered in one quarter.[56] Siemens Energy: orders โ‚ฌ58.9B (+19.4% YoY), multiyear backlogs.[57]

Grid Interconnection Queue by ISO Region

ISO / RegionQueue BacklogWait TimeKey Dynamic
ERCOT (Texas) 226 GW large load (Nov 2025). 77% data centers[58] 4ร— in one year (from 63 GW). Only 23 GW new gen added 2024-2025 "Phantom load" problem: speculative requests. Texas law (Jul 2025) now requires transparency[58]
PJM (Mid-Atlantic) ~46 GW remaining. 160 GW studied total[59] New cycle: 1-2 years (vs. historic 5-6+). Transition Cycle 2 processing through end 2026 Includes NoVA (world's largest DC market). Downstream transmission upgrades still lag[59]
MISO / CAISO / Others Implementing FERC Order 2023 cluster-based reforms 3-5+ years typical. CAISO: 4+ years for generation interconnection historically Generation queue: 77% solar + storage (ERCOT). Load queues less standardized than gen queues

The Announced-vs-Energized Gap

Industry data compiled by @negligible_cap shows the full scope of the buildout across all player categories - including xAI (0.2 โ†’ 7.0 GW) and neoclouds like CoreWeave, Nebius, and Crusoe (0.1 โ†’ 21.9 GW). Combined total: 21.5 GW (2023) โ†’ 133.9 GW (2028E) - a 6.2ร— increase in five years.[84] Incremental deployment is projected at +25 GW by end of 2026, +32 GW more by end of 2027, and +34 GW more by end of 2028. But the gap between announced and energized is wide: Sightline Climate found 12 GW US capacity announced for 2026 with only 5 GW under construction (58% gap).[31] Bloomberg (Apr 2026): >50% of US DCs planned for 2026 delayed.[32] JLL: >50% of global projects delayed 3+ months in 2025; equipment lead times 33 weeks (50% above pre-2020).[6] LBNL "Queued Up" data: historically only 13% of capacity in interconnection queues reaches commercial operation; median wait 4.5 years.[45]

The Supplier Margin Story

CompanyMargin TrajectoryKey Data
GE Vernova Near-zero at spin-off (Apr 2024) โ†’ 12-14% EBITDA (2026, raised twice) Power profit +57%. Electrification profit doubled. Stock +13% on earnings. Revenue: $44.5-45.5B. Gas turbine backlog: 110 GW target[56]
Siemens Energy Negative (wind losses) โ†’ 6% profit margin, higher in grid tech Orders โ‚ฌ58.9B (+19.4%). Multiyear gas turbine backlogs[57]
Eaton Electrical segment margins consistently 20%+ $340M new US transformer plant. DC power = fastest-growing vertical[60]
Vertiv Gross margin: 28.4% (2022) โ†’ 37.8% (Q3 2025) Power management / cooling for DCs. 9.4 ppt margin expansion in 3 years[50]
The Supplier Arbitrage

GE Vernova is the clearest example: near-zero-margin spin-off (Apr 2024) to 12-14% EBITDA by 2026, driven by pricing power from DC and grid demand. Gas turbine backlog: 62 GW โ†’ 110 GW in ~18 months. Vertiv's gross margin expanded nearly 10 points in 3 years. When hyperscalers commit $725B in a single year, suppliers of physical infrastructure can name their price. This dynamic won't normalize before 2028-2029.[56][50]

AI Labs: The New Compute Consumers

The frontier AI labs have emerged as among the largest individual consumers of compute, rivaling the hyperscalers' own internal workloads:

Lab / Project Compute Footprint Power Capacity Capital Committed
OpenAI / Stargate 10 GW target; ~7 GW planned across multiple sites[12] First site: Abilene, TX (Oracle). 5 additional sites announced Sep 2025 $500B total commitment (SoftBank, Oracle, OpenAI). $100B deployed immediately. UAE expansion: up to 5 GW (G42)[13]
Anthropic Multi-cloud: 1M+ Google TPUs; 500-600K AWS Trainium chips; leasing xAI Colossus 1[14] Google: >1 GW in 2026. AWS: additional 1 GW Trainium2/3 by end of 2026. Broadcom: "multiple gigawatts"[15] Google: up to $40B investment. Amazon: up to $33B total ($25B latest round). Microsoft/Nvidia: $15B. $30B+ ARR[16]
xAI / Colossus 555,000 Nvidia GPUs (H100/H200/GB200) as of Feb 2026. Target: 1M GPUs[17] ~2 GW site capacity (Memphis, TN). Built 100K GPUs in 122 days; doubled to 200K in 92 more $18B in silicon at Colossus alone. Anthropic leasing Colossus 1 as of May 2026[18]
Anthropic as the Multi-Cloud Wildcard

Anthropic has become the most compute-diversified frontier lab in history - simultaneously running on Google TPUs, AWS Trainium, Azure GPUs, and now leasing xAI's Colossus facility. Total committed capacity across all partnerships exceeds 5 GW. This multi-cloud strategy is both a hedge against vendor lock-in and a signal that no single hyperscaler can satisfy frontier training demand alone.[14][15]

Nvidia: The Arms Dealer

Nvidia remains the central beneficiary of the compute arms race. Data center revenue reached $39.1 billion in Q1 FY2026 (calendar Q1 2025), up 73% year-over-year, and grew to $51.2 billion in Q3 FY2026 (+66% YoY).[19][20] For Q1 FY2027 (calendar Q1 2026), consensus estimates project data center revenue of approximately $73 billion - nearly doubling in 12 months.[21]

Nvidia Data Center Revenue (Quarterly)
Source: Nvidia earnings releases & consensus estimates. Fiscal year ends January.

The Blackwell NVL72 architecture is now in "full-scale production" across all major cloud providers, with the next-generation GB300 sampling and the Vera Rubin platform announced at GTC 2026 - promising 10ร— inference throughput per watt versus Blackwell.[19][22] Nvidia's annual product cadence through 2028 means each generation of datacenter is obsolescent before the next one is built - a structural advantage that forces continuous capex.

Section 2

The Prediction Gap: Forecasts vs. Reality

Before projecting forward, we need to ask: how reliable are the forecasts we're relying on? The answer - based on tracking forecast vintages from the same sources - is more nuanced than the common narrative of "everyone underestimates." Some forecasters have overshot near-term. Others have consistently revised upward. And the gap between major forecasters may matter more than the gap between any individual forecast and reality.

Goldman Sachs: The Serial Reviser

Goldman Sachs has been the most prolific publisher of datacenter power forecasts - and the most consistent at raising them. Tracking their 2030 global datacenter power demand forecast (expressed as % increase vs. 2023 baseline of ~415 TWh):

Date PublishedReport Title2030 Forecast (% increase vs. 2023)Implied 2030 TWh
May 2024"AI Is Poised to Drive 160% Increase in Data Center Power Demand"+160%~1,079 TWh[61]
February 2025"AI to drive 165% increase in data center power demand by 2030"+165%~1,100 TWh[62]
November 2025"The 6 Ps Driving Growth and Constraints"+175%~1,141 TWh[24]
April 2026Latest revision (Kobeissi Letter / Benzinga reporting)+220%~1,350 TWh[63]

Goldman raised their 2030 target by 60 percentage points in 23 months - from +160% to +220%. The April 2026 revision was the most aggressive single jump (+45 ppts from the November 2025 figure). They also increased the US share of global demand from 50% to 60%, projecting US datacenter consumption alone at ~750 TWh by 2030.[63]

The Prediction Gap: Goldman Sachs 2030 Forecast Revisions vs. IEA Base Case
Sources: Goldman Sachs Research publications (May 2024-Apr 2026), IEA Energy and AI (Apr 2025, Apr 2026). Global datacenter electricity demand in TWh.

IEA: Stable Base, But Initial Near-Term Overshoot

The IEA's forecast history tells a more surprising story. Their January 2024 Electricity 2024 report contained the headline: "Electricity consumption from data centres, AI and the cryptocurrency sector could double by 2026."[64] With the 2022 global baseline at ~460 TWh, this implied ~920 TWh by 2026. Actual 2025 consumption came in at ~485 TWh, and 2026 is projected around 550-600 TWh[1] - well below the "doubling" scenario.

The IEA overestimated near-term growth. Their high-scenario was too aggressive for the 2024-2026 window. However, their 2030 base case has been remarkably stable: 945 TWh in the April 2025 report, ticking up to 950 TWh in the April 2026 update.[1] This stability may reflect the IEA's recognition that physical bottlenecks (grid interconnection, transformer supply) create a ceiling on near-term deployment - demand exists, but infrastructure can't deliver it fast enough.

The IEA's US-specific forecast for 2026 (260 TWh, published Jan 2024) appears roughly on track - their own November 2025 data projects >250 TWh for 2026.[25] The near-term US projections have held better than the global aggregate.

McKinsey: The Most Aggressive - and Arguably Most Accurate

McKinsey published the most granular US datacenter electricity forecast in their November 2024 "AI's Power Binge" report:[65]

YearMcKinsey US DC Forecast (TWh)McKinsey (% of US total)IEA Actual/EstimateDelta
2023147 TWh3.7%154 TWh (IEA)-4.5%
2024178 TWh4.3%183 TWh (IEA)-2.7%
2025224 TWh5.2%>200 TWh (IEA proj.)TBD
2026292 TWh6.5%>250 TWh (IEA proj.)TBD
2028450 TWh9.3%~350 TWh (IEA proj.)-
2030580 TWh11.7%--

McKinsey's 2023-2024 estimates were within 3-5% of IEA's later actuals - the best near-term accuracy of any forecaster surveyed. Their 2026 projection of 292 TWh is notably higher than IEA's >250 TWh, potentially reflecting faster AI adoption than IEA models. On global capacity, McKinsey projected 171-219 GW by 2030 (Oct 2024), which they later refined to 219 GW (May 2025).[66]

Forecaster Comparison Table

ForecasterTrack RecordCurrent 2030 Global Estimate
Goldman Sachs Has consistently underestimated. Four upward revisions in 23 months (+60 ppts). Most reliable as a leading indicator: when GS raises, the market follows ~1,350 TWh global (+220% vs 2023). US: ~750 TWh[63]
IEA Overshot near-term ("could double by 2026" didn't happen) but stable on 2030 base case. Cautious, bottleneck-aware. US near-term forecasts surprisingly accurate ~950 TWh global (base case). High case not quantified publicly[1]
McKinsey Most accurate near-term (within 3% on US 2024). Most aggressive capacity projections. Has been validated by subsequent data 219 GW global capacity. US: 580 TWh. $7T cumulative investment[65][66]
The Honest Answer

The prediction gap is not a simple story of universal underestimation. Three dynamics are at work:

1. Goldman Sachs has been a serial underestimator - and they know it. Each revision admits the prior one was too low. Their trajectory suggests 220% may not be the final number.

2. The IEA is bottleneck-aware - their stable 950 TWh base case may reflect a realistic ceiling imposed by grid/transformer constraints, not conservative demand modeling. If the bottlenecks ease faster than expected, their number will look too low.

3. The gap between forecasters is as important as the gap between forecast and reality. IEA says 950 TWh; Goldman says 1,350 TWh. That's a 42% spread on a 4-year forecast. McKinsey's capacity numbers (219 GW) imply even more if utilization rates hold. The honest answer: nobody knows, and the range of plausible outcomes is wider than any single number suggests.

The Correction Band

If Goldman's average miss rate holds (~15-20% per annual revision), their current 1,350 TWh estimate for 2030 could be revised to 1,550-1,600 TWh by early 2027. But - and this is critical - physical infrastructure sets a hard ceiling on what can actually be deployed. The LBNL "Queued Up" data shows only 13% of capacity in interconnection queues reaches commercial operation.[45] The real question is not "how much demand exists?" but "how fast can supply respond?" The answer to the second question likely puts realized 2030 demand closer to IEA's 950 TWh than Goldman's 1,350 TWh - even if underlying demand would support the higher number.

Section 3

Forward Extrapolation & Cost Analysis

Where Does Cost Accrue? The $7 Trillion Breakdown

McKinsey's April 2025 analysis provides the most detailed decomposition of the $7 trillion in cumulative datacenter capital expenditure projected through 2030:[67]

Cost CategoryShareAmountKey Components
Technology (chips, GPUs, servers) 60% ~$3.1T GPU/XPU procurement (Nvidia, AMD, custom silicon), server assembly, networking equipment, HBM memory. This is where Nvidia's dominance translates to margin capture
Energizers (power + cooling) 25% ~$1.3T Power generation/transmission ($720B grid spending needed per GS), cooling infrastructure, transformers, switchgear, electrical distribution. The bottleneck layer
Builders (real estate + labor) 15% ~$1.0T Construction labor (~$0.6T), shell & site (~$0.3T), land acquisition (~$0.1T). Skilled labor shortages driving shift to prefabrication/modular

Of the total, AI-specific datacenters require $5.2 trillion while traditional IT workloads require $1.5 trillion.[67] Goldman Sachs' May 2026 "Tracking Trillions" framework projects $7.6 trillion cumulative AI CapEx between 2026 and 2031 alone, growing from $765B annually in 2026 to $1.6T in 2031 - anchored to Nvidia forward data center revenue estimates.[85] The concentration of cost in technology (60%) explains why Nvidia captures outsized economics - but the bottleneck is in the energizer layer (25%), where delivery delays strand the other 75% of invested capital.

The Physical Wall: Supply Chain Constraints as Ceiling

Rabobank's May 2026 analysis identifies four binding constraints that create a physical ceiling on buildout speed:[68]

Efficiency vs. Demand: Why Jevons Paradox Dominates

The historical pattern in computing is unambiguous: efficiency gains increase, not decrease, total consumption. Every major advance in computing efficiency has expanded the addressable market faster than it reduced per-unit cost.

The current cycle is the most extreme example yet. Dell COO Jeff Clarke (May 19, 2026): token costs fell 80% in one year, but token consumption for reasoning surged 320ร—.[5] Net effect: 64ร— increase in total compute spend (0.20 ร— 320 = 64). Nvidia CEO Jensen Huang reported inference token generation surged 10ร— in one year.[19]

The IEA confirms this at the macro level: energy per individual AI task is "improving at a rate unprecedented in energy history" - dropping by at least an order of magnitude annually. But new use cases (video generation, reasoning, agentic workflows) consume hundreds to thousands of times more energy per query.[1] Efficiency unlocks demand categories that didn't previously exist.

The Jevons Verdict

Every data point confirms Jevons paradox dominates: Nvidia's Vera Rubin promises 10ร— lower cost per token vs. Blackwell.[22] This will not reduce total compute demand. It will unlock use cases currently constrained by economics - persistent agentic systems, real-time video generation, continuous scientific simulation - driving another wave of demand. The only scenario where efficiency caps total demand is one where AI capability plateaus. No credible observer is predicting that.

The Evercore Token Consumption Model

Evercore ISI introduced a token consumption model on May 20, 2026 that provides the most granular demand-side framework published by a tier-1 sell-side firm:[82][83]

YearAnnual Tokens (Quadrillions)YoY GrowthDominant Category
2024A~100-Frontier training (59%)
2025E~200~100%Frontier training
2026E~350~75%Frontier training (59%), consumer inference (26%)
2027E~700~100%Shift to inference-dominant
2028E~1,500~114%Agentic AI rising rapidly
2029E~2,500~67%Agentic AI + enterprise inference
2030E~4,000~60%Agentic AI becomes largest consumer

Key insight: agentic AI is negligible today (7% of 2026 tokens) but becomes the largest single category by 2030. This is the demand wildcard - agents consume tokens continuously, without human rate-limiting. Evercore estimates compute capacity demand could reach 250 GW by 2030, assuming blended throughput of 500K transactions/sec/MW.[83]

For context: Goldman Sachs' latest forecast implies ~150-170 GW by 2030 (at their +220% power demand growth rate). McKinsey projects 219 GW. Evercore's 250 GW is the most aggressive mainstream projection yet - 18% above Goldman and 14% above McKinsey. If Evercore is right, the implied cumulative capex exceeds $10 trillion.

Why 4 Quintillion Tokens Matters

A 40ร— increase in annual token demand (100Q โ†’ 4,000Q) in four years implies that every efficiency gain from Vera Rubin, Blackwell Ultra, and custom silicon gets consumed by demand expansion - and then some. This is the Jevons paradox quantified at the application layer: cheaper tokens don't reduce total compute consumed. They create use cases (persistent agents, continuous reasoning, real-time video) that didn't exist at higher price points.

GW Deployment Trajectory by Year

Sell-side capacity deployment estimates show a clear acceleration curve, broadly consistent with Goldman Sachs' facility-level data from Aterio:[84]

YearIncremental GW DeployedSource
2024 (actual)6.4 GWGoldman Sachs / Aterio[3]
2025 (actual)8.5 GWGoldman Sachs / Aterio[3]
2026 (projected)~25 GWSell-side est.[84]
2027 (projected)~32 GWSell-side est.[84]
2028 (projected)~34 GWSell-side est.[84]

Note that Goldman's May 20, 2026 report projects 13.6 GW in 2026 and 36.3 GW in 2027[3] - the 2027 figures are broadly consistent (~32-36 GW range), but Goldman's 2026 figure (13.6 GW) is significantly below the sell-side aggregate (25 GW). This may reflect the gap between scheduled activations (Goldman/Aterio) and contracted/announced capacity (sell-side estimates that include projects not yet under construction). The delta - roughly 11 GW - is consistent with the "announced vs. energized" gap we documented in Section 1.

Three Scenarios to 2030

Goldman Sachs 2030 Forecast Revision History
Projected global datacenter electricity demand for 2030, by report vintage
Three-Scenario Forward Extrapolation: US DC Electricity Demand
Bull (RSI + Jevons), Base (current trajectory), Bear (supply-constrained). US only, TWh.
ScenarioAssumption2030 Global DC DemandImplied CapexKey Risk
Bear Case Supply chain constraints create a hard ceiling. Transformer/grid bottlenecks persist. Only 50% of announced capacity comes online. LBNL 13% completion rate continues ~120-140 GW operational (~950 TWh, IEA base)[1] ~$5T (constrained) Stranded capital: $725B/yr committed but physically unable to deploy. Hyperscaler ROI timelines extend. AI revenue growth decelerates before capex cycle completes
Base Case Current trajectory continues. Grid investments begin easing bottlenecks by 2028. Onsite generation fills gaps. No step-change in demand or efficiency ~170-220 GW (~1,100-1,350 TWh, GS/McKinsey range)[63][66] ~$7T (McKinsey estimate) Execution risk: actual buildout tracks 60-70% of plans. Regional concentration creates grid stress in key markets (Virginia, Texas, Dublin)
Bull Case RSI kicks in ~2028. Token demand 2-5ร— current projections. Agentic AI creates unbounded inference demand. Every enterprise deploys autonomous agents ~300-400 GW (~1,800-2,500 TWh) ~$10-15T cumulative Physically impossible on current grid trajectory. Requires massive onsite generation buildout. Water becomes binding constraint. Social backlash accelerates
The 2-5ร— Question: How Realistic?

Is 2-5ร— current demand projections by 2030 realistic? The Dell COO data suggests we've already seen 64ร— total compute spend growth in one year (80% cost reduction ร— 320ร— consumption). If agentic AI reaches mainstream enterprise deployment and RSI accelerates model capability, 2-5ร— is conservative relative to what the token consumption curve implies. The constraint isn't demand - it's whether the physical world can deliver. The honest answer: demand may support 5ร—+ but realized consumption will be limited to whatever the grid, transformers, and supply chain can actually deliver. The gap between demand and deliverable supply is where the investment opportunity lies.

Section 4

The Recursive Self-Improvement Wildcard

Every scenario model breaks if AI achieves recursive self-improvement (RSI) - the ability to autonomously improve its own capabilities, creating a feedback loop where each generation of AI is designed by a more capable predecessor. This is no longer a thought experiment. The leaders of every major AI lab are now publicly discussing RSI as a near-term possibility.

The Timeline Consensus (Last 90 Days)

WhoRoleStatementDateTimeline
Jack Clark Co-founder, Anthropic "I think there's a 60%+ chance that recursive self-improvement will occur before the end of 2028." Based on weeks reading hundreds of public data sources[71] May 4, 2026 End of 2028 (60%)
Dario Amodei CEO, Anthropic "This loop has already started, and will accelerate rapidly." Transformative AI potentially late 2020s. AI doing all software engineering tasks within "6-12 months"[72] 2026 Late 2020s
Sam Altman CEO, OpenAI "We are only a couple of years away from early versions of true superintelligence." Blog: 2026 = novel insights, 2027 = real-world robots. "By end of 2028, more intellectual capacity inside data centers than outside"[73] Feb 2026 2028 (ASI)
Demis Hassabis CEO, Google DeepMind "AGI is now on the horizon... foothills of the singularity." 50% chance by 2030. "10ร— the impact of the Industrial Revolution, at 10ร— the speed"[74] May 20, 2026 2028-2031 (50%)
Elon Musk CEO, xAI AGI surpasses any single human by end of 2026. Exceeds collective human intelligence by 2027. Grok 5: 10% probability of AGI[75] Jan 2026 (Davos) 2026-2027 (most aggressive)
Shane Legg Co-founder, Google DeepMind 50% probability of "Minimal AGI" by 2028. Has maintained this prediction since 2009 - 17 years of consistency[76] Ongoing (since 2009) 2028 (50%)
Market Signal: Karpathy Joins Anthropic (May 19, 2026)

OpenAI co-founder and former Tesla AI chief Andrej Karpathy โ€” one of the most recognized names in AI research โ€” announced he joined Anthropic's pre-training team on May 19, 2026, stating: "I think the next few years at the frontier of LLMs will be especially formative."[86] Market commentary interpreted this as an RSI signal: "This indicates that we are close to RSI and therefore an accel in model IQ increases. In that scenario the value of compute is going to explode."[87] Whether or not the inference is correct, the talent consolidation at Anthropic (Karpathy + Jack Clark's 60% RSI prediction + Colossus lease + $30B+ ARR) is the strongest single-company signal in the RSI race.

The Convergence

What's remarkable is not any individual prediction โ€” it's the convergence. The CEOs and founders of OpenAI, Anthropic, Google DeepMind, and xAI are all pointing to 2026-2030 as the window for AGI/ASI, with the center of mass around 2028. These are not neutral observers - they have incentives to hype timelines. But they also have the most information about current capability trajectories. Jared Kaplan (Anthropic) told The Guardian humanity faces "the biggest decision yet" between 2027 and 2030.[71]

What RSI Means for Compute Demand

If RSI arrives circa 2028, the compute implications are qualitatively different from current scaling:

Phase 1 AI-accelerated AI research (now - 2027): AI systems assist human researchers in model design, data curation, and hyperparameter optimization. This is already happening - Anthropic, OpenAI, and DeepMind all use AI to accelerate their research. Compute demand grows at current rates (17-22% CAGR per Goldman/McKinsey).

Phase 2 Autonomous AI research (~2027-2028): AI systems design and run their own experiments with minimal human oversight. Training runs happen continuously rather than episodically. Demand for compute becomes continuous and compound rather than periodic. This is the phase Jack Clark's 60% probability applies to.

Phase 3 Recursive loop (~2028+): Each AI generation designs a more capable successor. Training and inference merge into a single continuous process. Demand becomes theoretically unbounded - limited only by available compute, energy, and capital. This is where all current forecasts break.

The Unmodelable Scenario

If Phase 3 materializes, every 2030 projection in this report - Goldman's 1,350 TWh, IEA's 950 TWh, McKinsey's 219 GW - becomes an underestimate by definition. The demand curve goes vertical. The only binding constraint becomes physics: how fast can we build power plants, manufacture chips, and lay transmission lines? This is not a risk to the capex thesis - it's the most extreme version of it. The companies that have secured power, land, and supply chain commitments before RSI arrives will be the most valuable infrastructure assets on earth.

The Skeptic's Case

It would be intellectually dishonest to present RSI timelines without noting the skeptic's position. Hassabis himself notes his definition of AGI requires capabilities - robust creativity, scientific discovery - that current systems lack.[74] Grok's own estimate of AGI timing (2040-2050) is notably more conservative than its creator's.[75] Key objections:

The Dual Dynamic: Demand Spike vs. Self-Optimization

RSI creates two opposing forces on compute demand simultaneously:

Force 1 - Demand explosion: RSI generates continuous, compounding training runs. Each improvement cycle requires a new training run on the full model - or more precisely, on a model that is already larger and more capable than its predecessor. A research survey of 25 leading AI researchers from DeepMind, OpenAI, Anthropic, Meta, UC Berkeley, Princeton, and Stanford (arXiv:2603.03338, February 2026) found that participants converged on predictions that AI agents will transition from "assistants" to "autonomous AI developers" - but after that point, predictions diverged sharply. The frontier lab researchers were more bullish on explosive growth than academics.[77]

Force 2 - Efficiency improvement: A self-improving AI would also optimize its own architecture, training procedures, and inference efficiency. DeepSeek R1 demonstrated this in early 2025: algorithmic innovation achieved comparable performance at dramatically lower compute cost. An RSI system would discover such optimizations continuously. The paper "The Race to Efficiency" (arXiv:2501.02156) argues that classical scaling laws neglect time and efficiency, and that algorithmic improvements are compounding alongside hardware improvements.[78]

Which force dominates? Historically, efficiency gains in computing have always been overwhelmed by demand expansion (Jevons paradox). But RSI introduces a genuinely novel dynamic: for the first time, the system generating demand is the same system optimizing efficiency. Deloitte's 2026 analysis explicitly concludes that "AI's next phase will likely demand more computational power, not less" - efficiency gains at the model level are consumed by the explosion in inference volume and the shift to more compute-intensive reasoning architectures.[79] The academic evidence supports the view that demand dominates, but with meaningful efficiency offsets that prevent the curve from going truly vertical.

Second-Order: Can AI Optimize Its Own Infrastructure?

If AI can improve its own models, can it also optimize the physical infrastructure it runs on? Several mechanisms are already emerging:

Net assessment: AI will meaningfully improve infrastructure efficiency (perhaps 2-3ร— over current levels by 2030). But this improvement is multiplicative on a base that is itself growing exponentially. A 3ร— efficiency gain on a 10ร— demand increase still means a 3.3ร— net increase in physical infrastructure needed.

The Endgame: 2030-2033 If Both Trends Continue

If forecast undershooting persists at the Goldman Sachs historical rate (~15-20% per annual revision) AND RSI accelerates demand on top of that, the math produces sobering numbers:

Parameter2026 (Today)2030 (Base + RSI)2033 (Extrapolated)
Global DC electricity demand ~550-600 TWh 1,500-2,000 TWh (IEA base + GS correction + RSI uplift) 3,000-5,000 TWh (est. - data is thin, labeled as extrapolation)
US share ~250-300 TWh (~6% of US electricity) ~900-1,200 TWh (~15-20% of US electricity) ~1,500-2,500 TWh (~25-35% of US electricity, est.)
Global DC capacity ~68 GW operational 200-350 GW 400-700 GW (est.)
Cumulative capex ~$1.5T (2024-2026) $7-15T (2024-2030) $15-30T (2024-2033, est.)
Data Integrity Note

The 2033 column is extrapolation, not forecast. No credible source publishes specific projections beyond 2030 for datacenter demand under RSI scenarios. These numbers are derived from: (1) Goldman's historical miss rate applied forward, (2) the token consumption growth rate documented by Dell/Nvidia, and (3) the assumption that RSI adds a 2-3ร— multiplier on top of baseline growth. They should be treated as directional, not precise. The honest answer: if RSI arrives circa 2028 and forecasts continue undershooting, the 2030 numbers in this report will look conservative in hindsight.

Academic Grounding

The academic literature on RSI and compute scaling is thin but growing rapidly:

For investment purposes, the question isn't whether RSI arrives on schedule - it's whether the belief in near-term RSI sustains the capex cycle. As long as hyperscalers and sovereign governments believe AGI/ASI is 2-5 years away, the infrastructure buildout continues regardless of whether the timeline proves accurate. The capex is committed; the compute will be built; the only question is what runs on it.

Section 5

The Power Demand Reckoning: US-First Analysis

The United States is the epicenter of the global datacenter buildout. Accounting for approximately 50% of global datacenter capacity and ~90% of Americas regional capacity, the US power grid is absorbing the most acute demand shock in its history.[6]

Goldman Sachs US Datacenter Power Additions

Goldman's latest analysis projects US capacity additions accelerating dramatically:[3]

YearUS DC Capacity AdditionsYoY ChangeSource
2024 (actual)6.4 GW-Goldman Sachs / Aterio[3]
2025 (actual)8.5 GW+33%Goldman Sachs / Aterio[3]
2026 (projected)13.6 GW+60%Goldman Sachs / Aterio[3]
2027 (projected)36.3 GW+167%Goldman Sachs / Aterio[3]

Globally, Goldman Sachs forecasts datacenter demand growing approximately 50% to 92 GW by 2027, at a compound annual growth rate of 17% between 2025 and 2028. In a bullish scenario with stronger GPU power draw and higher customer demand, CAGR could reach 20%.[23] By 2030, data center power demand is projected to surge 165-175% versus 2023 levels - "the equivalent of adding another Top 10 power consuming country."[24]

The IEA Projection (April 2026)

The International Energy Agency's updated Key Questions on Energy and AI report (April 2026) projects global datacenter electricity consumption roughly doubling from 485 TWh in 2025 to approximately 950 TWh by 2030 - slightly more than Japan's entire electricity consumption.[1] For the United States specifically:

YearUS DC Electricity Demand (TWh)Source
2024~170IEA[25]
2025>200IEA[25]
2026>250IEA[25]
2027>300IEA[25]
2028~350IEA[25]
2029~400IEA[25]

The IEA estimates US electricity demand will add more than 420 TWh in total over the next five years, with datacenters making up approximately 50% of that demand growth.[26] The agency also notes that US datacenters will increase their share of regional power from 4% to 7.8% between 2025 and 2030.[27]

US Datacenter Electricity Demand Projection
Source: IEA Energy and AI Report (2025), Key Questions on Energy and AI (Apr 2026)
Critical Nuance - Efficiency vs. Demand

The IEA notes that energy per individual AI task is "improving at a rate unprecedented in energy history" - dropping by at least an order of magnitude annually. Simple text queries now consume less electricity than running a television over the same period. But new use cases (video generation, reasoning, agentic workflows) consume hundreds to thousands of times more energy per query than simple text. The net effect: demand growth overwhelms efficiency gains.[1]

AI vs. All Datacenter Capacity: The 30 GW Baseline

An important distinction: JLL's 103 GW figure (2025) represents all datacenter capacity globally โ€” including traditional enterprise IT, cloud storage, and legacy workloads. Epoch AI independently estimates that AI-specific datacenter power reached approximately 30 GW in Q4 2025, based on cumulative chip sales multiplied by rated power and a 2.5ร— overhead multiplier for servers, cooling, and infrastructure. This means AI represents roughly 30% of total datacenter capacity today โ€” but is driving approximately 80%+ of incremental growth. By 2030, AI's share of total DC power is projected to exceed 50%.[90][91]

Global Comparison: Where the Buildout Is Most Acute

JLL's 2026 Global Data Center Outlook provides the most comprehensive regional breakdown of installed capacity and projected growth:[6]

Global Datacenter Capacity by Region - 2025 vs. 2030 Projected
Source: JLL 2026 Global Data Center Outlook (Jan 2026). Values in GW of IT load capacity.
Region2025 Capacity2030 ProjectedGrowthKey Dynamics
Americas (US ~90%) ~50 GW ~100 GW ~100% Fastest absolute growth. 35 GW under construction in N. America. 97% occupancy[28]
Asia-Pacific 32 GW 57 GW +78% Colocation-led growth. On-premise declining 6% as enterprises migrate to cloud[6]
EMEA (Europe + ME + Africa) ~20 GW ~33 GW +65% Growth in London, Frankfurt, Paris hubs. Europe: DC share of power from 2.7% to 5%[27]
Middle East (subset of EMEA) ~1 GW ~15 GW +1,400% 2.2 GW under construction, 12 GW planned. Stargate UAE (5 GW). Sovereign AI strategies[29]
The Middle East Outlier

The Middle East represents the most extreme growth trajectory: 12 GW planned against just 1 GW of existing capacity, per JLL.[29] This is driven by sovereign AI mandates (Saudi Arabia's HUMAIN, UAE's G42/Stargate partnership) and abundant natural gas for power generation. The OpenAI Stargate UAE project alone targets up to 5 GW.[13] This is geopolitically significant: these countries are positioning themselves as compute exporters.

China's Parallel Buildout: The Multipolar Compute Race

Our analysis to this point has been US-centric โ€” reflecting where the data is strongest. But China is building a parallel AI compute infrastructure at scale. Using chip-spend-derived estimates (with ยฑ30-50% uncertainty on Chinese hyperscaler figures), Roucher's analysis provides the first comprehensive per-entity breakdown of Chinese AI compute capacity:[89]

EntityType~2024 (MW)~2026-27 (MW)~2030 Target (MW)Notes
ByteDanceHyperscaler~300~1,300~3,500Largest Chinese AI compute consumer. Seed, Doubao models
AlibabaHyperscaler~400~1,800~3,000Qwen models run on fleet. Largest cloud provider in China
HuaweiChip maker + cloud~300~480~2,000Ascend 910B/C chips (~3 kW each). Building domestic alternative to Nvidia
TencentHyperscaler~250โ€”~1,500Hunyuan models. Cloud fleet
China MobileTelco130โ†’250~600~1,500Hard MW nameplate data (more reliable). AIDC business
China TelecomTelco150โ†’500โ€”~1,500Starfire models. Hard capacity data
China UnicomTelco~200~700โ€”Smallest of the three telcos
Subtotal: Chinese Giants + Telcos~1,700-1,800~5,000-6,000~13,000-15,000

Chinese pure-play AI labs operate at dramatically smaller scale, and almost all rent compute from the hyperscalers above:

LabOwns/Rents~MWKey Detail
DeepSeekOwns~90Only Chinese lab owning hardware (Fire-Flyer cluster). ~10K A100 + 50K H800
Zhipu / Z.aiRents~110ยฅ1.14B H1-25 cloud spend. $6.5B IPO
Moonshot (Kimi)Rents~90$20B valuation. Long-context reasoning
MiniMaxRents~80$6.5B IPO. Hailuo video generation
StepFunRents~55$718M raise. Multimodal/video
The Multipolar Compute Race: AI Power by Entity
Source: Roucher (2026) via Epoch AI + curated estimates. GW of AI datacenter capacity (owned + rented). Chinese figures ยฑ30-50%. [89]
Data Quality Warning

Chinese hyperscaler MW figures are chip-spend-derived, not satellite-verified. The conversion uses ~48 MW per $1B AI chip spend and per-GPU power estimates (A100 ~0.7 kW, H100/H800 ~1.4 kW, H20 ~0.9 kW, Ascend 910B/C ~3 kW). The ยฑ30-50% uncertainty range is significant โ€” ByteDance's actual capacity could be anywhere from ~1,750 MW to ~5,250 MW. Telco figures are more reliable (hard nameplate data). The 2030 projections are trend extrapolation, not announced plans.[89]

The Multipolar Reality

Second-Order China's combined AI compute (10-15 GW by 2030) approaches US hyperscaler scale. ByteDance at ~3.5 GW is already at Meta-scale (3.8 GW). This has three implications: (1) chip export controls have not prevented China from building substantial AI compute โ€” Huawei's Ascend chips and stockpiled Nvidia hardware fill the gap; (2) global power demand projections that focus only on US/Europe are significantly underestimating total draw; (3) Chinese efficiency innovations (DeepSeek R1's cost breakthrough) emerge from a constrained compute environment where optimization is existential, not optional. Third-Order If China reaches 15 GW by 2030, that's an additional ~130 TWh of annual electricity demand โ€” roughly equal to all of Ireland's current consumption โ€” on top of US/European projections. The global compute race is not US-dominated; it's multipolar.[89]

Europe's Compute Deficit: The 170ร— Gap

Europe's position in the frontier AI compute race is quantifiably dire. Mistral, the continent's leading frontier AI lab, operates approximately 90 MW of compute capacity today โ€” compared to OpenAI's 15,300 MW. That's a 170ร— gap.[89]

European EntityCurrent (MW)PlannedStatus
Mistral~90~200 MW by end-2027; ~1 GW aspiration by 2029Rents from Azure + Scaleway. No owned DCs
Campus IA (MGX-led JV)0 (planning)1.4 GW near ParisMistral is minority partner. 3+ years from full capacity
Europe's Structural Disadvantage

The 170ร— compute gap between Mistral and OpenAI is not just a matter of funding โ€” it reflects a structural disadvantage in power availability, regulatory environment, and infrastructure investment. Campus IA (1.4 GW) is the only project that could begin to narrow this gap, but even at full capacity it would bring Europe to roughly 1.5 GW of frontier AI compute โ€” still 10ร— behind OpenAI alone. For investors evaluating European AI deals, this compute deficit is the single most important constraint on European labs' ability to compete at the frontier.[89]

Section 6

The Bottleneck Wall: What Money Can't Buy (Yet)

Capital is no longer the binding constraint on the compute buildout. The bottleneck has shifted to physical infrastructure - specifically, electrical grid capacity, transformer manufacturing, and the speed of interconnection. This is the most underappreciated risk in the entire AI capex cycle.

Ground Truth: Satellite-Verified GW-Scale Crossings in 2026

Epoch AI's Frontier Data Centers Hub provides independent, satellite-verified tracking of the largest AI datacenter builds โ€” using high-resolution imagery of cooling infrastructure (chillers, cooling towers), cross-referenced with permits and public disclosures. Their data shows five facilities on track to cross 1 GW in 2026:[90][92]

FacilityOwner / UserProjected 1 GW DateBuild Time to 1 GWPeak Compute (H100e)
Anthropic-Amazon New CarlisleAWS / AnthropicJanuary 2026~2.5 yearsโ€”
xAI Colossus 2xAIFebruary 2026~12 months (projected)1.4M H100e
Microsoft FayettevilleMicrosoftMarch 2026 (borderline)~3 yearsโ€”
Meta PrometheusMetaMay 2026~2 yearsโ€”
OpenAI Stargate AbileneOracle / OpenAIJuly 2026~2 yearsโ€”
Race to 1 GW: Facility Build Timelines
Source: Epoch AI Frontier Data Centers Hub (satellite-verified). Construction start โ†’ 1 GW facility power. [90][92]

Construction-to-1-GW timelines range from 1 to 3.6 years across all tracked facilities, with xAI's Colossus 2 projecting the fastest at just 12 months. Epoch estimates power capacity with 80% confidence within a factor of 1.4ร—, and timing within 6 months โ€” making this the most rigorous public dataset on actual datacenter build progress. Notably, their tracked facilities (13 large US campuses) account for approximately 2.5 million H100-equivalents, or ~15% of the global AI compute stock.[90]

Looking further out, Epoch projects Meta Hyperion and Microsoft Fairwater will each reach 5 million H100-equivalents when fully complete โ€” a 50ร— increase over the leading datacenter capacity of mid-2024 (100K H100e). Fairwater is forecasted to be the most expensive datacenter tracked: upwards of $100B in total capital cost upon completion.[92]

The $30B/GW Cost Anchor

Epoch's cost model provides a useful cross-check: $44B per GW of server power ($30B IT hardware + $14B other costs), or approximately $30B per GW of total facility power (since overheads reduce usable server capacity to ~70% of facility power). At JLL's 200 GW 2030 projection, this implies ~$6T in cumulative datacenter capital โ€” consistent with McKinsey's $7T estimate (which includes non-AI workloads). Goldman's $7.6T projection for 2026-2031 AI capex alone suggests either higher per-GW costs or more aggressive capacity deployment than the base case.[90][92]

The Grid Interconnection Crisis

Average grid connection lead times now exceed four years in primary datacenter markets, according to JLL.[6] The World Resources Institute found that power constraints are extending datacenter construction timelines by 24 to 72 months.[30] Sightline Climate tracked 12 GW of US datacenter capacity announced for 2026, but found only 5 GW actually under construction - the remaining 7 GW (58%) sits in the "announced" stage with no physical progress.[31]

The Transformer Crunch

Substation transformer lead times averaged roughly 140 weeks in 2023, increased to 150 weeks in 2025, and now exceed 160 weeks in 2026 - more than three years from order to delivery.[4] Bloomberg reported in April 2026 that more than half of US datacenters planned for 2026 are expected to be delayed due to transformer and electrical equipment shortages.[32]

The IEA documented a 70% surge in global gas turbine orders in 2025, highlighting chokepoints in energy technology supply chains.[1] More than half of datacenter projects globally experienced construction delays of three months or more in 2025, with average equipment lead times at 33 weeks - a 50% increase from pre-2020 levels.[6]

The "Bring Your Own Power" Response

Constrained by grid delays, datacenter developers are increasingly building their own power generation:

The Overbuilding Paradox

Even with behind-the-meter power, the IEA notes that onsite gas generation does not necessarily offer a faster route to market than grid connection at scale - and requires 30-70% overbuilding to handle variable AI loads. The transformer crunch affects onsite generation too, since these facilities still need step-down transformers, switchgear, and distribution infrastructure. There is no shortcut around the electrical equipment supply chain.[1]

High-Bandwidth Memory: The Silicon Bottleneck

Beyond power, the IEA flagged a shortage of high-bandwidth memory (HBM) - integral to AI chip production - that has developed over the past six months and is anticipated to persist through at least the end of 2027.[1] HBM is manufactured almost exclusively by Samsung, SK Hynix, and Micron, creating a three-company chokepoint for the entire AI hardware stack.

Section 7

Hardware Roadmap & Demand Drivers

Nvidia's annual product cadence drives a continuous upgrade cycle, with each generation rendering the prior generation's datacenters partially obsolescent. This is a structural feature, not a bug - it forces continuous capex and sustains demand for both chips and the physical infrastructure to house them.

The Jevons Gap: Token Cost vs. Token Consumption

The Jevons Gap: Token Cost vs. Token Consumption
Illustrative - indexed to May 2025 baseline. Sources: Dell COO (May 2026), Nvidia (May 2025). See Section 3 for full analysis.

Hardware Cadence: The Annual Upgrade Race

Nvidia's response is an annual product cadence promising compounding performance gains:

PlatformTimelineKey Advance
Blackwell (B200/NVL72)2025 (shipping)Full production. 30ร— inference throughput vs. prior gen (MLPerf)[19]
Blackwell Ultra (GB300)Late 2026Sampling Q1 FY2027. Enhanced reasoning performance[19]
Vera Rubin202710ร— lower cost per token vs. Blackwell. Train with 1/4 the GPUs[22]
Rubin Ultra2028Announced at GTC 2026. NVL576 rack-scale systems targeting "millions of GPUs"[22]

Each generation improves efficiency per watt - but as established, efficiency gains at these magnitudes have historically increased, not decreased, total consumption. The Vera Rubin platform's 10ร— cost reduction per token will unlock use cases currently constrained by economics, driving another wave of demand.

Section 8

Investment Implications

Where the Edge Lives

The compute arms race creates a multi-layered opportunity set for seed investors, organized by where the binding constraints actually sit:

1. Power delivery and grid modernization. The 160-week transformer lead time is the single most cited bottleneck. Companies building grid-enhancing technologies, modular substations, or novel transformer manufacturing capacity sit at the critical path. JLL projects modular datacenter systems reaching $48B in annual sales by 2030.[6]

2. Onsite generation and energy storage. With 15-27 GW of onsite natural gas projected by 2030, and 20-25 GW of battery storage in datacenters, the market for behind-the-meter power systems is emerging at massive scale. Companies optimizing gas turbine deployment, long-duration storage, or fuel cell solutions for datacenter loads are well-positioned.[1]

3. Cooling innovation. Server rack power density increased 11ร— between 2020 and 2025, with a further 4ร— increase expected by 2027. By 2027, a single rack could have peak demand equivalent to 65 households.[1] Liquid cooling is now mandatory for AI workloads - Dell's PowerCool CDU C7000 and Lenovo's Neptune business (300% YoY growth) are early signals of a rapidly scaling market.[5]

4. Inference optimization. As inference overtakes training, the software layer that maximizes token output per watt becomes critical. Nvidia's Dynamo open-source library, inference routing systems, and KV cache optimization are emerging categories. Dell projects $50B in AI server revenue for FY2027, with a $43B AI server backlog.[5]

5. Custom silicon and chip supply chain. Amazon, Google, and Meta are all building custom silicon (Trainium, TPU, MTIA). The custom chip ecosystem - chip design (Marvell, Alchip), packaging (TSMC CoWoS), and HBM (SK Hynix, Samsung, Micron) - has become as critical as the GPU itself.

The "Power as Moat" Thesis

Second-Order The companies that secure power first will build the most capable AI. Power access is becoming a competitive moat for both hyperscalers and startups. This is why JLL characterizes the market as a "bring your own power" mandate in key markets including Dublin and Texas.[6] Third-Order As power becomes the binding constraint, energy cost becomes a structural part of AI product margins. Companies that innovate on power (efficiency, generation, procurement) gain a permanent cost advantage in serving AI workloads.

What the Capital Markets Are Pricing

The data center sector maintains remarkably healthy fundamentals despite the buildout scale. JLL reports 97% global occupancy and 77% of the construction pipeline pre-committed to tenants.[6] AI-focused facilities command 60% lease rate premiums over traditional datacenters. Global datacenter core fund capital formation could top $50B in 2026, with ABS/CMBS issuance projected to reach $50B - roughly doubling every year since 2020.[6]

The IEA's analysis of AI and energy company share prices since the launch of ChatGPT found that AI demand has not provided a generalized uplift to the energy sector - it's too small in context. But manufacturers of gas turbines, electrical equipment, some nuclear companies, and energy startups have seen valuations become "more strongly linked to AI."[1]

Section 9

Key Risks & Open Questions

Risk 1: The ROI Question

Goldman Sachs Research analysts are on "heightened alert for signs of market weakness" from a lack of ways to monetize AI or innovations that commoditize models.[23] Meta's stock dropped 6% after raising capex guidance because Zuckerberg's ROI explanation - "build experiences that can get to billions of people and focus on monetizing them once you get to scale" - did not reassure investors.[10] If revenue growth decelerates before capex cycles complete, the overcapacity risk is enormous.

Risk 2: Geopolitical Disruption

The IEA notes that datacenters have been "targeted in conflict zones, underscoring their role as critical infrastructure." Trade restrictions have targeted critical minerals for power electronics, batteries, and chips. China dominates supply chains for key components. Export controls (e.g., Nvidia's $4.5B H20 charge) demonstrate the fragility of the global chip supply chain.[1][19]

Risk 3: Social and Regulatory Backlash

Community pushback against datacenter projects is a "growing issue" per the IEA, driven by concerns about affordability and environmental impact.[1] Dublin has already implemented effective datacenter moratoriums. US grid stress could trigger similar responses in key datacenter markets like Northern Virginia or Texas.

Risk 4: The Efficiency Wildcard

Breakthroughs in model efficiency (like DeepSeek's R1 cost reductions in early 2025) could temporarily depress demand for new compute. Goldman Sachs explicitly models a downside scenario where demand growth CAGR falls to 14% (vs. 17% base case).[23] However, history suggests efficiency breakthroughs ultimately expand the market rather than shrink it.

Open Questions

Sources

  1. IEA, "Key Questions on Energy and AI - Executive Summary," April 14, 2026. iea.org
  2. IEEE ComSoc Technology Blog, "Hyperscaler capex >$600bn in 2026, a 36% increase over 2025," December 22, 2025. techblog.comsoc.org
  3. Goldman Sachs Research / MarketScreener, "US Data Center Power Demand Projected to Double by 2027," May 20, 2026. marketscreener.com
  4. Data Center Knowledge, "Why AI Data Center Projects Face Years of Delays After Approval," May 2026. datacenterknowledge.com
  5. Forbes, "Dell COO Says Agentic AI Is Breaking Cloud Economics, Forcing Data Center Rebuild" (Jeff Clarke interview), May 19, 2026. forbes.com
  6. JLL, "2026 Global Data Center Outlook," January 6, 2026. jll.com
  7. Futurum Group, "AI Capex 2026: The $690B Infrastructure Sprint," February 12, 2026. futurumgroup.com
  8. New York Times, "A.I. Spending Sets a Record, With No End in Sight," April 29, 2026. nytimes.com
  9. CNBC / Andy Jassy annual letter, "Amazon plans approximately $200 billion in capex in 2026," April 2026. convergedigest.com
  10. Fortune, "Microsoft, Meta, and Google just announced billions more in AI spending," April 29, 2026. fortune.com
  11. Futurum Group, "Oracle Q2 FY 2026: Cloud Grows; Capex Rises for AI Buildout," December 12, 2025. futurumgroup.com
  12. Reuters, "OpenAI, Oracle, SoftBank plan five new AI data centers for $500 billion Stargate project," September 23, 2025. reuters.com
  13. Data Center Dynamics, "Building Stargate: Talking to OpenAI about its trillion-dollar data center vision," May 2026. datacenterdynamics.com
  14. Anthropic, "Anthropic and Amazon expand collaboration for up to 5 GW," April 2026. anthropic.com
  15. Anthropic, "Expands partnership with Google and Broadcom for multiple gigawatts of capacity," April 6, 2026. anthropic.com
  16. CNBC, "Amazon to invest up to another $25 billion in Anthropic as part of AI infrastructure deal," April 20, 2026. cnbc.com
  17. Introl Blog, "xAI Colossus Hits 2 GW: 555,000 GPUs, $18B, Largest AI Site," January 3, 2026. introl.com
  18. Wikipedia / Memphis Chamber of Commerce, "Colossus (supercomputer)," May 2026. Note: As of May 6, 2026, Anthropic has agreed to rent all computing capacity at Colossus 1. wikipedia.org
  19. Nvidia Newsroom, "Financial Results for First Quarter Fiscal 2026," May 28, 2025. nvidianews.nvidia.com
  20. Nvidia Newsroom, "Financial Results for Third Quarter Fiscal 2026," November 2025. Revenue: $57.0B total, $51.2B Data Center. nvidianews.nvidia.com
  21. HeyGoTrade, "Nvidia Q1 FY27 Earnings Preview," May 18, 2026. Consensus: ~$78B revenue, ~$73B data center. heygotrade.com
  22. Data Center Knowledge, "GTC 2026: Nvidia Unveils Vera Rubin AI Platform, Eyes $1T by 2027," May 2026. datacenterknowledge.com
  23. Goldman Sachs Research, "How AI Is Transforming Data Centers and Ramping Up Power Demand," August 29, 2025. goldmansachs.com
  24. Goldman Sachs Research, "GS SUSTAIN: Data Center Power Demand: The 6 Ps Driving Growth and Constraints," November 10, 2025. goldmansachs.com
  25. Rigzone / IEA, "USA Data Center Electricity Demand Projected to Triple," November 27, 2025. rigzone.com
  26. IEA, "Electricity 2026 - Demand," 2026. US electricity adds >420 TWh over 5 years; DCs = ~50% of growth. iea.org
  27. Avid Solutions, "13 Data Center Growth Projections That Will Shape 2026-2030," January 13, 2026. avidsolutionsinc.com
  28. JLL, "North America Data Center Report Year-end 2025," February 17, 2026. 35 GW under construction. jll.com
  29. JLL, "EMEA Year End Data Centre Report 2025," April 2, 2026. Middle East: 1 GW existing, 2.2 GW under construction, 12 GW planned. jll.com
  30. World Resources Institute, "Powering the US Data Center Boom: The Challenge of Forecasting Electricity Needs," 2025. wri.org
  31. Tech Investments / Sightline Climate, "Power Bottlenecks & The AI Data Center," May 2026. techinvestments.io
  32. Bloomberg, "The US Data Center Boom Is Hitting a Transformer Crunch," April 1, 2026. bloomberg.com
  33. Gas Turbine World, "Powering the Data Center Boom," November 2025. GE Vernova and Solar Turbines deploying onsite generation. gasturbineworld.com
  34. Data Center Frontier, "TVA Files First SMR Permit for GE Vernova Reactor," June 2025. datacenterfrontier.com
  35. Ciena, "Agentic AI: Rewriting the Rules of Compute and Networking," March 25, 2026. ciena.com
  36. Benzinga, "AI Power Boom Is 'Accelerating' As Goldman Sachs Forecasts Data Center's Demand To Surge 220% By 2030," April 15, 2026. Goldman revised 2030 forecast to +220% / ~1,350 TWh. benzinga.com
  37. McKinsey, "AI power: Expanding data center capacity to meet growing demand," October 29, 2024. Projected 171-219 GW global by 2030. mckinsey.com
  38. Axios, "Behind the Curtain: Intelligence explosion - Anthropic co-founder Jack Clark predicted 60%+ chance of RSI by end of 2028," May 7, 2026. axios.com
  39. Dave Shapiro / Substack, "Recursive Self Improvement is SIX MONTHS away!" covering Dario Amodei's Davos January 2026 remarks. substack.com
  40. Medium / Codex, "Recursive Self-Improvement: Future Dream or Current Reality?" covering OpenAI's AI research agent plans, March 23, 2026. medium.com
  41. AI Multiple, "AGI/Singularity: 9,800 Predictions Analyzed," May 2026. Shane Legg: 50% chance minimal AGI by 2028. aimultiple.com
  42. LBNL / Berkeley Lab, "Queued Up: 2025 Edition," 2025. 13% of queue capacity reaches commercial operation; median wait 4.5 years. emp.lbl.gov
  43. Electrical Trader, "Transformer Shortages: Supply Chain Impact on Pricing," February 2026. Supply deficit ~100% in 2025, projected <10% by 2030. electricaltrader.com
  44. Reuters, "Grid equipment makers invest in US to ease supply shortage," December 2, 2025. GSU demand +274% (2019-2025), Wood Mackenzie. reuters.com
  45. DCF Modeling / Vertiv Holdings, "Breaking Down Vertiv Holdings," November 2025. Gross margin: 28.4% (2022) to 37.8% (Q3 2025). dcfmodeling.com
  46. Windows News / Microsoft Q3 FY2026, "Microsoft Adds 1 GW of Datacenter Capacity," May 2026. ~5 GW at FY2025 end, ~6 GW Q3 FY2026. Revenue/GW: $6.2B. 34 GW renewable PPAs. windowsnews.ai
  47. Power Magazine / Talen Energy, "Talen, Amazon Launch $18B Nuclear PPA," June 2025. 1,920 MW, 17-year Susquehanna deal. powermag.com
  48. Forbes, "Alphabet Buys Clean Energy Startup For AI Data Centers In $4.75B Deal" (Intersect Power), Dec 2025. Google: 30.8 TWh in 2024. forbes.com
  49. ENR, "$27B Meta Data Center Pushes Louisiana Toward Massive Power Expansion" (Hyperion), Apr 2026. 2-5 GW, 10 gas plants, Entergy 6.7 GW. enr.com
  50. Reuters / GE Vernova, "GE Vernova lifts 2026 outlook as AI boom fuels power equipment demand," Apr 2026. EBITDA 12-14%. Gas backlog 83โ†’110 GW. reuters.com
  51. Siemens Energy, "Fulfills all commitments," Nov 2025. Orders โ‚ฌ58.9B (+19.4%), profit margin 6%. siemens-energy.com
  52. Latitude Media, "ERCOT's large load queue has nearly quadrupled," Feb 2026. 226 GW (from 63 GW), 77% DCs. latitudemedia.com
  53. PJM Inside Lines, "PJM Completes Interconnection Reform Transition Cycle 1," Sep 2025. 160 GW studied, 46 GW remaining. insidelines.pjm.com
  54. Fierce Network, "New investments could transform the data center power equation," Sep 2025. Eaton: $340M third US plant. fierce-network.com
  55. Goldman Sachs Research, "AI is poised to drive 160% increase in data center power demand," May 14, 2024. First major DC forecast. goldmansachs.com
  56. Goldman Sachs Research, "AI to drive 165% increase in data center power demand by 2030," February 4, 2025. ~55 GW current; 122 GW by 2030. goldmansachs.com
  57. Benzinga / Goldman Sachs / Kobeissi Letter, "AI Power Boom Is Accelerating - Data Center Demand To Surge 220% By 2030," April 15, 2026. 1,350 TWh; US 60% = 750 TWh. benzinga.com
  58. IEA, "Electricity 2024 - Executive Summary," January 2024. "Could double by 2026." US DC: 260 TWh by 2026. iea.org
  59. McKinsey, "AI power demand in charts," November 6, 2024. US DC: 147 TWh (2023)โ†’580 TWh (2030). mckinsey.com
  60. McKinsey, "Data center demands," May 20, 2025. 82 GW (2025)โ†’219 GW (2030). mckinsey.com
  61. McKinsey, "The cost of compute: A $7 trillion race to scale data centers," April 28, 2025. AI DCs: $5.2T; Tech 60%/$3.1T, Energizers 25%/$1.3T, Builders 15%/$1.0T. mckinsey.com
  62. Rabobank, "Supply chain constraints are curbing US data center development," May 2026. Power, water, minerals, labor as binding constraints. Copper deficit 6M MT by 2035. rabobank.com
  63. Fortune, "America's data centers are thirsty," May 13, 2026. Texas: 49B gallons 2025, up to 399B by 2030. fortune.com
  64. MOST Policy Initiative, "Data Center Water Use," April 2026. US DCs consumed 17.4B gallons in 2023; projected 38-73B by 2028. mostpolicyinitiative.org
  65. Axios, "Behind the Curtain: Intelligence explosion," May 7, 2026. Jack Clark (Anthropic): 60%+ chance RSI by end 2028. axios.com
  66. Dario Amodei, "The Adolescence of Technology," 2026. "This loop has already started and will accelerate." darioamodei.com
  67. Sam Altman, "The Gentle Singularity," blog.samaltman.com, Feb 2026. 2026=novel insights, 2027=robots. "By end 2028, more intellectual capacity inside DCs than outside." blog.samaltman.com
  68. The Verge, "Demis Hassabis at Google I/O: foothills of the singularity," May 20, 2026. AGI 50% by 2030. theverge.com
  69. AI Core News, "Elon Musk AI 2026 Prediction," Jan 2026. AGI surpasses any human by end 2026, collective by 2027. aicorenews.com
  70. ITP.net / Shane Legg, "Google DeepMind Co-Founder Predicts AGI by 2028," Dec 2025. 50% minimal AGI by 2028 (consistent since 2009). itp.net
  71. Field et al., "AI Researchers' Views on Automating AI R&D and Intelligence Explosions," arXiv:2603.03338, Feb 2026. 25 researchers from DeepMind, OpenAI, Anthropic, Meta, Berkeley, Princeton, Stanford. 20/25: automating AI research is most severe risk. arxiv.org
  72. "The Race to Efficiency: A New Perspective on AI Scaling Laws," arXiv:2501.02156, Jan 2025. New scaling laws incorporating time and algorithmic efficiency. arxiv.org
  73. Deloitte, "Why AI's next phase will likely demand more computational power, not less," Feb 2026. deloitte.com
  74. ICLR 2026 Workshop on AI with Recursive Self-Improvement. iclr.cc
  75. "Scaling Laws, Foundation Models, and the AI Singularity: A Critical Appraisal of 2023-2025 Evidence," WJARR 2026. journalwjarr.com
  76. @negligible_cap / Evercore ISI token consumption model, May 20, 2026. Base case: ~100 quadrillion tokens (2026) โ†’ ~4 quintillion (2030), ~150% 4-year CAGR. x.com
  77. Investing.com / Evercore ISI CoreWeave note, May 19, 2026. "Annual token consumption ~4.0 quintillion by 2030. Compute capacity could reach 250 GW by 2030." investing.com
  78. @negligible_cap / sell-side GW deployment estimates, May 12, 2026. Incremental: 25 GW (2026), 32 GW (2027), 34 GW (2028). x.com
  79. Goldman Sachs, "Tracking Trillions: The Assumptions Shaping the Scale of the AI Build-Out," May 2026. $7.6T cumulative AI CapEx 2026-2031. $765B annual (2026) โ†’ $1.6T (2031). goldmansachs.com
  80. TechCrunch / Reuters / CNBC, "OpenAI co-founder Andrej Karpathy joins Anthropic pre-training team," May 19, 2026. "The next few years at the frontier of LLMs will be especially formative." techcrunch.com
  81. @tmtlongshort, May 19, 2026. "This indicates we are close to RSI... the value of compute is going to explode." Market commentary on Karpathy/Anthropic move. x.com
  82. GeekWire, "Data centers at sea: Panthalassa nets $140M led by Peter Thiel for wave-powered AI," May 2026. $140M Series B, ~$1B valuation. Autonomous floating DCs, wave-powered, seawater-cooled. Pilot facility near Portland. geekwire.com
  83. Aymeric Roucher, "The 3 Tiers of the AI Compute Race," May 20, 2026. Interactive datacenter buildout tracker with per-entity GW breakdowns (user vs provider view), built on Epoch AI data + curated China/Europe lines. m-ric.com
  84. Epoch AI, "Frontier Data Centers Hub," updated May 21, 2026. Independent database tracking major AI data centers via satellite imagery, permits, and public documents. 13 US campuses, ~2.5M H100e tracked (~15% of global AI compute stock). CC-BY license. epoch.ai
  85. Epoch AI Data Insight, "Global AI power capacity is now comparable to peak power usage of New York State," January 16, 2026. Total AI DC power ~30 GW (Q4 2025), based on chip sales ร— rated power ร— 2.5ร— overhead multiplier. epoch.ai
  86. Epoch AI, "Introducing the Frontier Data Centers Hub," November 4, 2025. Methodology: satellite imagery of cooling infrastructure, permits, hardware specs. Five DCs crossing 1 GW in 2026. Cost model: $44B/GW server power. Build times: 1-3.6 years to 1 GW. epoch.ai
  87. Epoch AI, "What you need to know about AI data centers," November 2025. Explainer: US AI DCs need 20-30 GW by late 2027; power determines DC location; cooling infrastructure analysis methodology. epoch.ai
  88. Epoch AI Data Insight, "Build times for gigawatt-scale data centers," 2026. Construction milestone tracking via satellite imagery and permits. xAI Colossus 2: 12-month target. epoch.ai

Generated by Galileo ๐Ÿ”ญ ยท May 20, 2026