Briefing · 2026-05-10

1 substack.com 2026-04-11 4 min read

Open

Your GPUs Just Got 6x More Valuable. No New Hardware Required.

Why it matters

Google Research published TurboQuant (nicknamed “Pied Piper”) on March 25, 2026 — a drop‑in compression algorithm that the author says reduces transformer working memory (KV cache) by 6× with “zero accuracy loss,” requiring no retraining or calibration.

Key details

The author reports concrete operational effects: the same GPU that previously served ~9 concurrent users can serve ~50 after TurboQuant; the piece translates 6× memory compression into roughly a 5× increase in revenue per GPU.
The newsletter highlights macro economics: server RAM prices rose ~172% over the prior 18 months, so a 6× effective RAM increase from compression sharply lowers inference costs, extends context windows, and reduces token price.
Strategic implication — compression (KV‑cache optimization) is framed as the fastest‑moving lever in the AI infrastructure war, reshaping competitive dynamics among Google, NVIDIA, middleware vendors, and enterprises running on‑prem inference.

Brief

Google Research’s TurboQuant (published March 25, 2026 and quickly nicknamed “Pied Piper”) is presented as a drop‑in compression method that reduces transformer working memory — the KV cache used during inference — by 6× with no retraining, no calibration, and claimed zero accuracy loss. Nate argues this isn’t merely a cost story: a 6× reduction in memory demand can let a GPU that served ~9 concurrent users handle ~50, roughly translating to a ~5× revenue boost per GPU, while counteracting a ~172% increase in RAM prices over the prior 18 months. The post emphasizes that KV‑cache compression effectively makes the transformer’s context RAM cheaper and larger, enabling longer context windows and lower token costs, and contends that such compression will decisively reshape who wins the AI infrastructure race (cloud providers, chipmakers, middleware, and self‑hosted enterprises).

By Nate from Nate’s Substack

2 substack.com 2026-03-29 12 min read

Open

AI Outlook – Nvidia vs Broadcom vs AMD

Why it matters

OpenAI enterprise data shows API token consumption for reasoning rose 320x year‑over‑year; Goldman reports China enterprise token usage grew 162% from June to December (reported Mar 29, 2026).

Key details

UBS cited a 99.7% decline in cost-per-token over three years and warned that compute capacity — not just gigawatts — is the binding constraint as workloads diversify and latency becomes the next bottleneck.
Nvidia is integrating the Groq team/LPX (SRAM‑heavy) with its Vera Rubin GPUs and Dynamo software to disaggregate prefill vs. decode: management claims a ~35x increase in token‑generation performance and expects Groq LPX in production and shipping around Q3 2026 (Samsung manufacturing).
UBS models LPX could add ≈$50B to data‑center revenue on top of a ~$460–470B baseline for C2027 and suggests total Nvidia data‑center revenue could push toward ~$600B in C2027 (UBS baseline ≈$520B).

Brief

Token demand and inference latency are reshaping the AI infrastructure market. The newsletter highlights explosive token growth — OpenAI shows a 320x YoY increase in reasoning tokens and Goldman reports China's enterprise token use rose 162% from June to December — while OpenRouter data indicates an acceleration since January 2026. UBS cites a 99.7% drop in token cost over three years but warns demand growth still makes compute capacity the limiting factor; inference latency has emerged as the next key bottleneck for agentic and coding workflows.

Nvidia's response is a disaggregated inference stack: Vera Rubin GPUs handle memory‑heavy prefill/KV cache work while Groq LPX (SRAM‑centric, low‑latency decode) accelerates token generation. Jensen Huang claims ~35x token‑generation performance improvement when fused via Dynamo; sampling is underway, Samsung will fabricate Groq LPX, and shipments are targeted around Q3 2026. UBS estimates LPX could add ~ $50B to C2027 data‑center revenue, supporting upside scenarios where Nvidia data‑center revenues approach ~$600B in C2027. Countervailing forces include hyperscaler ASIC programs (Meta’s MTIA roadmap and Broadcom as a key partner). JP Morgan projects Broadcom AI revenues of $65B+ in FY26 and $120B+ in FY27 with ~10 GW deployed, implying significant hyperscaler share gains. The authors model a 10x expansion in AI demand by 2031 as plausible and note humanoid robotics components as an additional long‑run growth vector, while warning that in‑house ASICs pose a medium‑term risk to Nvidia's share despite overall market expansion.

By Tech Investments

3 Twitter/X 2026-02-17 1 min read

Open

Grid interconnection can take up to seven years; in 2024 Elon Musk's xAI avoided…

Why it matters

Grid interconnection can take up to seven years; in 2024 Elon Musk's xAI avoided that delay by trucking gas turbines on semitrucks and powering the Colossus data center's first phase in four months.

Key details

Cleanview's 'Bypassing the Grid' found dozens of permit documents, site plans, and equipment orders (dozens of projects proposed in 2025) showing many are under construction; dozens of gigawatts could come online in 1–2 years, ~75% powered by natural gas — equivalent to adding the power demand of five New York Cities; one supplier is providing a few hundred MW of ship/warship engines to an Ohio data center.

Brief

Developers are bypassing years-long grid interconnection delays by deploying mobile power plants—mobile gas turbines and jet engines—to run new data centers. xAI used semitrucked gas turbines in 2024 to energize Colossus in four months. Cleanview's 'Bypassing the Grid' documents dozens of permitted projects that could add dozens of gigawatts in 1–2 years, about 75% gas-fired, equal to five New York Cities of demand.

By @curious_founder

4 YouTube 2024-05-22 Video

Open

Energy Storage Interconnection - Challenges and Solutions

Why it matters

Interconnection backlog surged in 2023 to roughly 2,600 GW (a ~30% year‑over‑year increase), driven by solar, wind and storage; the U.S. West (outside CA) had ~706 GW in queue and CAISO ~523 GW, and more than 1.2 TW of projects entered queues after the Inflation Reduction Act (including ~540 GW of storage).

Key details

Average project dwell time in queues is about five years and roughly 70% of projects are ultimately withdrawn; between 2000–2017 only ~14% of queued capacity reached commercial operation.
FERC’s 2023 interconnection reforms push cluster (first‑ready, first‑served) studies, higher financial readiness requirements, improved queue transparency, and new cost‑allocation approaches; some RTOs/utilities (e.g., ERCOT’s connect‑and‑manage, CAISO cluster/priority paths) are already experimenting with similar approaches.
Interconnection costs and cost allocation are growing constraints: PJM median per‑kW interconnection costs rose from roughly $18–$30/kW (2000–2009) to much higher levels later, and PJM showed storage interconnection costs as high as ~$335/kW versus natural gas at ~$24/kW; Massachusetts’ queue in 2022 represented about $8 billion of planned investment that risks being lost.

Brief

The webinar (presentation/webinar format) convened experts from DOE, Sandia, PNNL, Clean Energy States Alliance and IEEE to diagnose the energy‑storage interconnection bottleneck and surface policy, technical and standards solutions. Presenters documented a nationwide queue surge—driven by rapid declines in lithium‑ion cost (cited falling from ~$270/kWh mid‑2022 to ~$180/kWh by end‑2023), IRA‑driven project additions, and aggressive state decarbonization goals—which has produced years‑long wait times, high withdrawal rates (≈70%) and large, uneven interconnection upgrade costs (examples: PJM storage interconnection ≈$335/kW vs natural gas ≈$24/kW). Practical impacts at the state level were highlighted using Massachusetts (solar+storage = 93% of the queue) and national maps showing concentrated activity in Texas, California and the U.S. West.

Speakers outlined a layered solution set. Policy and market fixes include FERC’s 2023 reforms (cluster studies, readiness requirements, transparency and revised cost allocation), state experiments (California’s CPUC limited‑generation profiles; Oregon’s export‑capacity approach; New Jersey capacity‑based fees), and ERCOT’s connect‑and‑manage practice. Operational and technical remedies focus on better hosting‑capacity analysis, standardizing export‑control methods (non‑export, limited export, managing inadvertent export), and using group/cluster studies and flexible interconnection arrangements to speed outcomes. Standards and conformity work are essential: IEEE 1547 and its supplements (15479 for storage, 15474 for intentional islands) and the transmission IBR family (2800) are being updated to provide minimum performance requirements and commissioning/testing guidance, while UL 1741 certification and conformity assessment programs help utilities verify equipment. The DOE‑led i2x effort and lab partnerships (PNNL, Sandia, LBNL) have produced a transmission road map (volume 1) and are developing a distribution road map with stakeholder comment periods and targeted technical assistance to translate these reforms into implementable, equitable processes.

By Clean Energy Group / Clean Energy States Alliance

5 substack.com 2026-04-27 4 min read

Open

Flash Note: Defense Production Act

Why it matters

On April 20, 2026 the President signed Presidential Determination 2026-10 invoking Defense Production Act (Section 303) for grid infrastructure and supply-chain capacity, formally listing transformers, transmission lines, conductors, substations, high‑voltage circuit breakers, power control electronics, protective relay systems, capacitor banks, and electrical core steel as essential to national defense.

Key details

The determination cites Executive Order 14156 (Declaring a National Energy Emergency) issued January 20, 2025 and was motivated by rapidly rising grid backlogs and long lead times (transformer lead times previously crossed 18 months).
Market evidence of accelerating demand: GE Vernova’s electrification segment had a 1Q26 quarterly net addition to backlog nearly as large as the entire annual additions from 2022–2025, signaling accelerating—not abating—growth.
Korean transformer suppliers dominate marginal large‑transformer capacity: Hyosung Heavy’s Memphis plant is the only U.S. facility able to produce 765 kV ultra‑high‑voltage transformers; HD Hyundai Electric’s Montgomery, AL expansion targets 150 units/year by 2027; LS Electric opened in Bastrop, TX; Iljin won a record $333M U.S. order and its first European ultra‑HV contract. The Korean “Big Four” backlog was $23.9B as of Q3 2025 (≈5–6 years of work).

Brief

Citrini frames the April 20, 2026 Presidential Determination (2026‑10) as a formal Defense Production Act (Section 303) response to a deepening U.S. grid‑equipment bottleneck, invoking Executive Order 14156 (Jan 20, 2025) and classifying transformers, high‑voltage components, substations, control electronics, protective relays, capacitor banks and electrical core steel as essential to national defense. The note highlights accelerating backlog metrics — GE Vernova’s electrification segment posted a 1Q26 quarterly backlog addition nearly equal to the annual additions from 2022–2025 — and persistent transformer lead times that have exceeded 18 months. It details supplier dynamics: Hyosung’s Memphis plant is the only U.S. 765 kV capable facility; HD Hyundai Electric aims for 150 units/year from Montgomery by 2027; LS Electric opened in Bastrop, TX; Iljin secured a $333M U.S. order and its first European ultra‑HV contract. The Korean Big Four held a combined $23.9B backlog in Q3 2025 (about five to six years of work), justifying DPA action to shore up capacity.

By Citrini

6 substack.com 2026-05-06 9 min read

Open

Don’t Reject Data Centers. Negotiate Harder.

Why it matters

Loudoun County, VA: data centers are forecast to generate roughly $1.3 billion for the FY2027 budget—about 45% of county tax revenue—and the county’s real property tax rate fell from $1.145 per $100 in 2016 to $0.805 in 2025 (≈30% decline); one parcel recently sold for $6 million per acre.

Key details

Chandler, AZ rejected a proposed $2.5 billion Active Infrastructure data center that developers said would deliver $10.2 million in annual property tax revenue and supplier commitments including 750 jobs at ≥$75,000, closed‑loop water, on‑site power, and noise monitoring.
State subsidies are large and rising: at least 37 states offer sales/use tax exemptions for data centers; Virginia’s 2008 exemption projected to cost $1.54M now cost $1.6B in FY2025 (FY2024 = $1.02B) and forgave ~$2.7B from 2015–2024; Columbus/Google received a ~$54M 15‑year abatement for a $300M facility that created ~20 permanent jobs.
Policy prescription from Greg Miller (Center for Land Economics): build and welcome data centers for U.S. AI competitiveness, stop state‑level subsidies, and use local negotiation leverage—require closed‑loop water, grid supply or mitigation, and stronger PILOTs/Community Benefit Agreements to fund resident electricity relief and capital projects.

Brief

Data centers are reshaping local fiscal landscapes and political choices: Greg Miller (Progress and Poverty, May 6, 2026) uses case studies—Loudoun County, Chandler AZ, Columbus OH and state incentive programs—to argue that data centers will be built and can be net fiscal positives if cities negotiate aggressively. He cites Loudoun’s ~$1.3B FY2027 data‑center revenue (≈45% of county receipts) and declining property tax rates as evidence of upside, while noting centers are high‑revenue but low‑employment land uses. Miller documents growing state giveaway costs (Virginia’s exemption rose from a $1.54M 2008 projection to $1.6B in FY2025; $2.7B forgiven 2015–2024) and examples of excessive local abatements (Google/Magellan: $300M facility, ~$54M 15‑year abatement for ~20 jobs). His prescription: permit and welcome centers for U.S. AI capacity, end broad state subsidies, and use local tools—closed‑loop water, guaranteed grid contributions, PILOTs or CBAs targeted at electricity relief and capital projects—to ensure community benefit.

By Progress and Poverty

7 datacenterdynamics.com 2026-02-27 1 min read

Open

Your 100MW hyperscale AI blueprint

Why it matters

The whitepaper '100 MW Hyperscale AI Blueprint' (Data Centre Dynamics, published 27 Feb 2026) presents a reference architecture for 100 MW hyperscale AI facilities designed to support high-density AI workloads while managing execution and operational risk.

Key details

For investors and developers the report identifies speed to market, capital efficiency, and infrastructure resilience as decisive factors shaping returns on AI data center projects.

Brief

The 100 MW Hyperscale AI Blueprint (Data Centre Dynamics, 27 Feb 2026) outlines a reference architecture for hyperscale AI data centers built to host 100 MW of high-density AI workloads. It emphasizes design and operational strategies that mitigate execution and operational risk and recommends prioritizing speed-to-market, capital efficiency, and infrastructure resilience for investable projects.

By DCD Investment & Markets Channel

8 divenewsletter.com 2026-04-23 6 min read

Open

GE Vernova’s gas turbine backlog reached about 100 GW as of April 23, 2026; CEO…

Why it matters

GE Vernova’s gas turbine backlog reached about 100 GW as of April 23, 2026; CEO Scott Strazik said turbine dollar-per-kilowatt pricing should be “very healthy” in Q2 and the company also reported big order increases for grid and wind equipment.

Key details

CenterPoint Energy plans to energize roughly 8 GW of data center load by 2029; CEO Jason Wells told investors on the company’s Q1 earnings call that Houston is now “firmly established” for hyperscalers.
PJM’s market monitor opposed FERC waivers requested for Constellation’s Three Mile Island restart needed to fully deliver output when the unit returns (possibly in 2027); Constellation has a 20-year agreement to sell the plant’s energy, capacity and clean attributes to Microsoft for data center supply.
A 50-MW distributed battery deployment via a Base Power partnership aims to reduce price spikes and peak loads for Guadalupe Valley Electric Cooperative in South Texas; meanwhile NEMA warns distribution transformer backlogs are running a year or more amid equipment shortages.

Brief

Utility Dive's April 23, 2026 Daily Dive newsletter aggregates industry moves: GE Vernova reported a roughly 100 GW gas-turbine backlog and said dollar-per-kW turbine pricing should accelerate in Q2, alongside large upticks in grid and wind equipment orders — a dynamic underscored by BNEF reporting a 66% surge in gas plant build costs. CenterPoint Energy told investors it will energize about 8 GW of hyperscaler data center load by 2029, citing Houston’s emergence as a preferred market. PJM’s market monitor opposed FERC waivers tied to Constellation’s Three Mile Island restart (needed to immediately deliver output), noting Constellation already has a 20-year contract to sell all attributes to Microsoft. Operational relief efforts include a planned 50-MW distributed battery for Guadalupe Valley Electric Cooperative; manufacturers warn transformer backlogs now run a year or more.

By Utility Dive

9 datacenterdynamics.com 2026-03-19 1 min read

Open

Stream Now: Emerging frontiers - Mapping the next data center hotspots

Why it matters

Data Centre Dynamics hosted a livestream on 19 March 2026 titled "Emerging frontiers - Mapping the next data center hotspots" examining where capacity will be built next.

Key details

Panelists identified power constraints, land scarcity and grid congestion in traditional hubs as drivers pushing investors toward secondary cities and frontier economies; AI workloads, sovereign cloud initiatives and connectivity upgrades are key factors reshaping site selection, financing and capital allocation.

Brief

Data Centre Dynamics' 19 March 2026 livestream 'Emerging frontiers — Mapping the next data center hotspots' examines how power constraints, land scarcity and grid congestion in established hubs are redirecting investment to secondary cities and frontier economies. Speakers assess AI workload demand, sovereign cloud programs, connectivity upgrades and financing models for large-scale builds amid policy uncertainty and infrastructure risk.

By DCD Investment & Markets Channel

10 substack.com 2026-04-10 9 min read

Open

🍪 TWiC: Intel with TeraFab+Sambanova+Google, Samsung and Anthropic 🚀, AGI CPU in China

Why it matters

Intel’s market cap topped $300B in early April 2026 — roughly 3x growth in 6–9 months — as its EMIB packaging gains traction while TSMC CoWoS remains capacity-constrained.

Key details

On Apr 7, 2026 Intel joined Elon Musk’s TeraFab consortium (SpaceX, xAI, Tesla) to help deliver the project’s 1 TW/year compute target, providing advanced nodes, packaging and volume manufacturing expertise.
Intel + SambaNova announced a heterogeneous inference stack combining GPUs for prefill, Intel® Xeon® 6 as host/“action” CPUs and SambaNova SN50 RDUs for decode; the RDU mixes SRAM, HBM and DRAM and “predetermines” data paths to reduce required decode-chip counts.
Google and Intel agreed to deploy Xeon CPUs plus Infrastructure Processing Units (IPUs) — offloading networking/storage/security tasks to free CPU cycles for AI orchestration, positioning x86 Xeon strongly in AI datacenters.

Brief

Intel’s recent string of moves — market cap >$300B, joining Musk’s TeraFab (Apr 7, 2026) and partnerships with SambaNova and Google — signal a potential commercial turnaround centered on packaging and heterogeneous datacenter stacks. With TSMC CoWoS constrained, Intel’s EMIB packaging has seen accelerated demand through 2026; TeraFab’s 1 TW/year ambition offers Intel a whale customer if wafer starts materialize. The Intel–SambaNova stack pairs GPUs for prefill, Intel Xeon 6 as host/action CPUs and SambaNova SN50 RDUs for decode; the RDU’s SRAM+HBM+DRAM approach and “predetermined” data-paths aim to cut the number of decode chips versus SRAM-limited LPUs (a contrast drawn with Groq’s inelastic LPU design). Separately, Google+Intel deployments of Xeons plus IPUs (DPU-like offloads) reinforce x86’s role in heavy AI orchestration workloads.

Macro demand is amplifying these hardware bets: Samsung projects Q1 2026 profit of 57T won ($38B), a 755% YoY increase driven ~95% by memory, validating a multi‑year DRAM supercycle; Anthropic reported a $30B ARR (Apr 6, 2026) and booked 3.5 GW of Google TPU capacity starting 2027 while hedging across AWS/TPU/NVIDIA. Finally, Arm’s intent to sell finished AGI CPUs into China (rather than license IP) creates a fast route for Chinese datacenters to access high‑core, AI‑optimized server processors without transferring chip design IP — a notable strategic loophole with geopolitical implications.

By Vik's Newsletter

11 substack.com 2026-03-26 28 min read

Open

Radical Candor: The Grid of the Future Runs on Transparency

Why it matters

Customer trust is the core product: J.D. Power scored U.S. residential electric utility satisfaction at 499/1,000 in 2025 (the study's historic low) while average monthly residential bills rose 34% since 2020 to $189/month.

Key details

Cost-of-service regulation (COSR) creates perverse incentives to hoard information and overbuild; the author advocates outcome-based Performance-Based Regulation (PBR) that ties utility earnings to lower retail rates and reliability outcomes.
The author proposes permissioned, near-real-time visibility into distribution and transmission constraints (feeder-level loading, thermal limits, hosting capacity, interconnection pipeline) for verified market participants under NDAs/CEII-style credentialing.
Interconnection inefficiency is large: at end of 2024 >10,000 projects representing >2,000 GW sat in U.S. queues; historically ~70% of MISO queue projects never get built — better visibility would reduce wasted siting and downstream transmission spend.

Brief

Radical Candor: The Grid of the Future Runs on Transparency frames the next structural transformation of U.S. electric utilities around trust and data transparency. Michael Lee argues that utilities currently sell an “invisible” product — trust in reliable, affordable power — yet operate under cost-of-service regulation (COSR) incentives that reward capital deployment and information control. He cites concrete metrics: a J.D. Power 2025 customer satisfaction score of 499/1,000 and average residential bills up 34% since 2020 to $189/month. That divergence feeds political backlash even as utility stocks remain high. Lee contends that outcome-based Performance-Based Regulation (PBR) — which ties utility earnings to delivering reliability at lower customer cost — would flip incentives so utilities want third parties to site, dispatch, and optimize resources efficiently rather than protect rate-base-funded builds.

Lee lays out actionable, technical changes. He calls for a permissioned, tiered data platform exposing feeder-level loading, thermal constraints, live hosting-capacity maps, planned outages, and dynamic interconnection-pipeline status to authenticated developers, aggregators, and regulators under CEII/NDA frameworks. He documents the scale of the current mismatch: more than 10,000 queued projects (>2,000 GW) at end-2024, with ~70% of MISO projects historically not built, and cites PJM baseline transmission spend (e.g., $6.7B RTEP in 2024; 2025 projected $8–12B, five-year totals approaching $40B) as partly driven by suboptimal siting from opaque information. Operational reforms Lee recommends include standardizing and modularizing distribution hardware, treating long-lead procurements as financial options, normalizing utility ROE and tying upside to performance, and requiring line‑item engineering justification (hours of thermal exceedance, alternatives evaluated) for every capital dollar. He also highlights policy and consumer-friction risks: MassSave’s $4.5B 2025–27 plan funded via bill surcharges (policy charges in MA rose $15→$59/month, 2014–2025) undermines affordability and consent for mass-device enrollment. The post points to international and vendor precedents (AEMO’s Connections Simulation Tool, DOE i2X, Tapestry et al.) and argues that transparency, credentialed access, and platform-style information infrastructure are both technically feasible and essential to earn the mass-market permission needed for a distributed, flexible grid.

By Distributed Grid from Michael Lee

12 substack.com 2026-04-24 3 min read

Open

Load Growth Challenges: Texas Grid Roundup #91

Why it matters

Published Apr 24, 2026 in Texas Energy and Power Newsletter (Grid Roundup #91), the piece flags rapid data‑center driven demand growth in Texas and a spring wave of ERCOT reports, legislative hearings, and policy initiatives about who pays for new infrastructure.

Key details

ERCOT’s preliminary 2026 Long‑Term Load Forecast projected load reaching 368 GW by 2032 — described in the article as 'more than four times the grid’s current all‑time demand record' — a number energy analyst Travis Kavulla told Bloomberg 'can’t actually happen.'
ERCOT CEO Pablo Vegas offered a much lower, more realistic projection of 111 GW by 2032 in a board presentation, highlighting large forecast variability and raising questions about transmission, interconnection, and cost allocation for the coming buildout.

Brief

The newsletter examines wide uncertainty around Texas load growth driven largely by rapid data‑center expansion and a cluster of ERCOT reports, hearings, and policy moves in spring 2026. ERCOT’s preliminary 2026 Long‑Term Load Forecast shocked observers by projecting 368 GW by 2032 — over four times the current all‑time peak — a result Bloomberg‑quoted analyst Travis Kavulla called implausible. ERCOT CEO Pablo Vegas subsequently presented a far lower 111 GW by 2032 to the board, underscoring large model and assumption sensitivity. The piece frames two central challenges: how much incremental capacity and transmission will be required, and who should bear the cost, while noting ongoing industry conversations (including a podcast episode on batteries with Fluence VP Suzanne Leta) about how resources and markets must adapt.

By Texas Energy and Power Newsletter

13 Twitter/X 2026-05-03 1 min read

Open

WoodMac forecasts the US data center electrical equipment market grows from $20B…

Why it matters

WoodMac forecasts the US data center electrical equipment market grows from $20B in 2025 to $65B by 2030; data centers reach 40% of total US electrical equipment demand by 2030 in the accelerated case, up from ~2% in 2020.

Key details

US data center capacity expands from 24 GW to 110 GW between 2026 and 2030; 600 GW of pipeline still searching for power versus 183 GW with signed utility agreements; equipment lead times are 18–36 months and manufacturers are re-opening year-old POs with ~20% price hikes.
Hyperscale unit growth 2025→2030: padmount transformers 1,573→9,395; PDUs 1,966→11,745; MV switchgear and ATS 786→4,698 each; data centers projected to consume 8× more electricity than EVs through 2030, adding ~400,000 GWh.

Brief

WoodMac projects the US data center electrical-equipment market will expand from $20B in 2025 to $65B by 2030, driven by capacity growth from 24 GW to 110 GW (2026–2030), a 600 GW pipeline seeking power versus 183 GW with agreements, sustained 18–36 month lead times, and sharp hyperscale equipment and electricity demand growth.

By @ShanuMathew93

14 substack.com 2026-04-22 32 min read

Open

Transmission Takes a Decade, Load Doesn't — with Raina Hornaday

Why it matters

ERCOT’s all-time demand peak is 85.5 GW, while at the end of 2025 the interconnection queue contained 432 GW of generation requests and ERCOT received ~225 GW of new large-load requests in 2025.

Key details

Raina Hornaday has developed more than 1 GW of renewables across Texas over 20+ years and founded Caprock Renewables and Fortress Microgrid; Caprock built utility-scale projects of about 150 MW, 200 MW and a 300 MW site that required a new substation and two years of construction.
Hornaday calls the current trend the “energization of land”: developers and landowners are maximizing sites for solar, batteries, data centers and storage, and her site-selection methodology relies on nodal price data and substation availability.
Transmission is the binding constraint: long planning and construction lead times (often measured in years to a decade) are driving a surge in distributed generation, microgrids and behind-the-meter solutions because load growth (AI/data centers, Permian demand) cannot wait.

Brief

Texas is in the middle of simultaneous, rapid demand growth and an unprecedented pipeline of generation proposals, and the mismatch between generation ambitions and transmission capability shapes how projects are being built. ERCOT’s all‑time demand peak is 85.5 GW, yet the market entered 2026 with roughly 432 GW of generation in the interconnection queue and about 225 GW of new large‑load requests filed in 2025. Raina Hornaday — founder of Caprock Renewables and Fortress Microgrid and a developer responsible for more than 1 GW of Texas projects over two decades — describes transmission as a long lead, capital‑intensive constraint that “load doesn’t want to wait for.” Her utility‑scale portfolio includes 150–300 MW solar projects; one 300 MW site required a new substation and two years of construction, illustrating the multi‑year timelines for bulk transmission work.

Because transmission expansion lags, Hornaday and other developers are shifting toward distributed generation, co‑located solar+storage, microgrids and behind‑the‑meter supply for fast‑moving customers (notably AI/data centers and Permian loads). Site selection is increasingly nodal‑data driven — choosing nodes with attractive prices and substation access — and smaller projects (sub‑10 MW batteries or behind‑the‑meter builds) avoid the full interconnection study process, shortening lead times. Battery energy storage has become a critical stopgap: ERCOT’s Real‑Time Co‑Optimization Plus Batteries (launched December 2025) and a large buildout of storage (capacity roughly doubled in the prior year) have damped price volatility since the 2021 Uri event and provide fast dispatchable capacity while transmission is planned and built. Nevertheless, not all storage projects proceed; cancellations through early 2026 reflect permitting and siting opposition, local fire‑safety concerns, financing and supply chain issues.

Hornaday also highlights the local economic and workforce benefits of renewables (agrivoltaics, “solar sheep,” school‑district interactions), the loss of Chapter 313 incentives that changed project economics, and ongoing political risk — e.g., Senate Bill 819’s failed 2025 effort and likely future iterations. Her prescription: more education, standardized siting and permitting practices, and policy alignment so Texas can pair rapid load growth with flexibility resources and transmission investments required for long‑term reliability.

By Texas Energy and Power Newsletter

15 substack.com 2026-05-06 10 min read

Open

Power Delivery As The Next Physics Wall In AI Datacenters

Why it matters

Vikram Sekar (Vik's Newsletter, published 2026-05-06) argues datacenters are shifting to 800V DC because resistance and I2R losses make low-voltage (48V) rack power impractical at AI scale.

Key details

Example math: a 600 kW rack at 48V requires ~12,500 A. With a 1% voltage budget (0.48 V) the allowable resistance is ~38 µΩ; I2R losses at that current and resistance are ~6 kW and busbar copper volume/weight becomes ~1,000 cu. in and ~147 kg per rack.
At 800V the same 600 kW needs ~750 A. A 1% budget (8 V) yields ~11 mΩ headroom; practical busbars can be 1"×1" (~84 cu. in, ~12.3 kg) and I2R losses drop to ~0.56 kW (order-of-magnitude improvement over 48V).
Architecture/standards implications: any DC stage above the 60 V SELV threshold requires galvanic isolation (transformer), so 48V sits just below the SELV ceiling allowing non-isolated downstream converters; this shapes conversion topologies and vendor sockets.

Brief

Sekar lays out why power delivery is the next “physics wall” for AI datacenters and why the industry is moving toward 800V DC rack architectures. The core physics is simple: for fixed power, lower voltage means much higher currents, which amplify resistive losses (I2R), force massive copper busbars, and erode voltage-headroom. Using his worked example, a 600 kW rack at 48V needs ~12,500 A, a 1% voltage-drop budget implies ~38 µΩ total resistance, ~6 kW dissipated, and roughly 1,000 cu. in (~147 kg) of copper busbar per rack. Moving to 800V reduces current to ~750 A, lets busbars be ~1"×1" (~84 cu. in, ~12.3 kg), and cuts dissipated power to ~0.56 kW.

Those physics drive architectural choices: keep voltages high as long as possible and perform point-of-load (PoL) conversion—vertical power delivery—close to GPUs. Conversion chains run from high-voltage grid lines (100 kV) down to medium/low voltages (10–30 kV substations, 400–480V AC halls) then to rack DC (48V historically, now 800V in new builds) and finally to VRM voltages (~0.8–1V). Each conversion stage adds loss (e.g., 97% per stage → ~91% over three stages), and isolation rules (SELV 60 V DC) force transformer-based topologies above 60 V, shaping which semiconductor and converter designs compete. Early 800V rollouts require greenfield or DC-sidecar approaches (OCP Mt. Diablo, NVIDIA Kyber), and analysts and vendors (Wood Mackenzie, SemiAnalysis, Citrini, ON Semi) expect sharply higher power‑semiconductor content and supply-chain shifts as deployments scale.

By Vik's Newsletter

16 YouTube 2025-04-22 Video

Open

Load Growth and Electric System Reliability (4.22.25)

Why it matters

NERC's 2024 Long-Term Reliability (LTR) assessment (released Feb 2024) finds over half of North America at risk of energy shortfalls over the next 10 years, with rapid load growth a primary driver.

Key details

Peak demand is accelerating: NERC reports the highest demand growth in its last 20 years of LTRs and forecasts a 12 GW increase in peak demand for the upcoming summer (more than twice last summer's increase).
Interconnection queue expanded by 44 GW (≈12%) year-over-year and is dominated by solar and battery projects, but actual project builds fell short of prior-year projections for most resource types except batteries, which exceeded expectations.
Thermal generator retirements are significant: 78 GW of expected thermal retirements are included in the 10-year assessment, plus an additional ~37 GW announced or in early retirement stages (not yet reflected in planning), with large exposure in market areas such as PJM and MISO.

Brief

Mark Olsen, manager of reliability assessments at the North American Electric Reliability Corporation (NERC), presented a webinar (Clean Energy States Alliance, April 22, 2025) summarizing findings from NERC's 2024 Long‑Term Reliability Assessment and related reliability leadership work. The session was a technical briefing-style presentation that emphasized probabilistic methods (loss-of-load hours and expected unserved energy) and a 10-year planning horizon. Olsen showed NERC's risk map (high/elevated/normal categories) and explained that more than half of the continent faces growing energy risk driven primarily by a step-change in demand growth, continued retirements of thermal plants, and uncertainty/timing in bringing new resources online.

Olsen walked through specific metrics and implications: the interconnection queue grew by about 44 GW (12%) and is dominated by solar and battery projects, but actual adds lagged projections except for batteries. NERC accounted for roughly 78 GW of thermal retirements in its baseline and flagged another ~37 GW announced but not yet removed from planning, concentrating risk in market areas such as PJM and MISO. Winter reliability is emerging as a particular concern—winter peak growth is outpacing summer growth in many regions, winter shortfall events can last much longer (example winter events modeled up to ~12 hours), and short-duration batteries (2–4 hours) have limited ability to solve extended winter energy deficits. Transmission planning activity has increased (≈28,000 circuit miles in development vs a historical average near 18,000), but construction bottlenecks, permitting and siting issues (affecting ≈1,200 miles) slow realized delivery benefits. Olsen highlighted the interregional transfer capability study showing transmission can reduce some shortfalls but will not eliminate the need for additional resources. His recommendations for state and federal policymakers: pursue an all-of-the-above resource strategy, carefully manage generator deactivations, improve permitting and siting coordination, mature probabilistic and multi-weather-year modeling, account for grid-enhancing technologies, and plan for longer-duration storage and firm resources to cover growing winter and multi-day energy needs.

By Clean Energy Group / Clean Energy States Alliance

17 substack.com 2026-04-22 4 min read

Open

Research|TPU 8t/8i and Virgo Network: the biggest networking upgrade since TPU v4 — and another major win for opti…

Why it matters

A single Virgo fabric spans over 130,000 TPUs, delivers about 47 Pb/s (petabits/second) of bisection bandwidth, and reduces latency by ~40% (Google’s Virgo Network figures cited in the April 22, 2026 writeups).

Key details

Google frames the datacenter design in three layers — scale-up within the pod, scale-out across pods (the Virgo/DCN fabric), and Jupiter for front-end and storage — confirming the recent changes target the scale-out DCN rather than replacing the 3D torus.
TPU 8t brings a multi‑fold increase in data-center network (DCN) bandwidth and TPU 8i is emphasized as the complementary piece that 'closes the loop' (the author frames 8i as a major win for optics and the networking upgrades).
Engineering implication: the system constraint has shifted from raw compute to bandwidth, validating the 'AI hypercomputer' view that compute, storage, and networking must be co‑designed (a prediction the author says they made months earlier).

Brief

Virgo Network and TPU 8t/8i mark a major networking and optics-driven shift in Google's AI datacenter architecture: a single Virgo fabric now spans >130,000 TPUs, provides ~47 Pb/s bisection bandwidth, and cuts latency by roughly 40%. Google explicitly splits the stack into pod-level scale-up, Virgo scale-out across pods, and Jupiter for front end/storage, validating that the aggressive changes are to the DCN (scale-out) rather than the 3D torus. TPU 8t reports a multi‑fold DCN bandwidth increase and TPU 8i is called out as the complementary advancement for optics. The result is a clear move from compute-limited designs to bandwidth-limited systems, underscoring the need to co-design compute, storage, and networking for modern AI hypercomputers.

By FundaAI

18 substack.com 2026-03-18 20 min read

Open

I’m Bullish On DERs. I’m Bearish On the Infrastructure Around Them.

Why it matters

There are >5 million distributed solar installations in the U.S. (SEIA, 2024), and companies like Base Power (residential batteries 25–50 kWh/home) and David Energy are already monetizing DER flexibility—Base Power raised >$1.3 billion and had multiple Texas co-op agreements by early 2026.

Key details

Distribution signals are largely absent: wholesale markets publish real-time LMPs, but feeder/transformer thermal constraints are rarely measured, priced, or shared, so DER software cannot target local relief.
ERCOT’s ADER pilot (launched 2022) exposed a hardware gap: grid-grade requirements (2-second telemetry, ±10% device validation, 5-minute basepoints) and an initial 80 MW cap led to ~15 MW qualified by late 2024; after Base Power qualified in summer 2025 ERCOT raised the cap to 160 MW then 200 MW and seven ADERs participated by December 2025.
Multiple structural barriers stem from cost-of-service regulation (COSR): utilities hold customer-level data and have weak incentives to share it because third-party DERs can defer ratebase investments; performance-based regulation (PBR) would realign incentives to encourage data sharing and DER integration.

Brief

Distributed energy resources (DERs) — solar, residential batteries, EV chargers, smart thermostats — have demonstrable commercial value, but the distribution-layer infrastructure, data flows, and regulation prevent that value from being realized. The market evidence includes >5 million U.S. solar installs (SEIA, 2024) and firms like Base Power (25–50 kWh batteries per home; >$1.3B raised; Texas co-op deals by early 2026) and David Energy (aggregating thermostats, batteries, EV chargers across NY, NJ, MA, TX) that monetize wholesale signals where they exist. By contrast, distribution constraints (feeder/transformer thermal limits) are rarely instrumented, priced, or exposed, so DERs cannot respond to the most valuable local events.

Technical and structural gaps are clear: ERCOT’s ADER pilot revealed that residential hardware and telemetry were not grid-grade (requirements: 2 s telemetry, ±10% validation, 5-minute basepoints), producing only ~15 MW qualified by late 2024 until vendors built to spec and ERCOT raised the cap to 200 MW in 2025. OEM APIs and cloud backends are designed for consumer convenience, not low-latency per-device validation; OEM lock-in and warranty/fee restrictions add commercial friction. Static NEM and many community-solar crediting schemes ignore time and location value. A persistent “topology floor” — lack of real-time device-to-substation/feeder/phase mapping — plus utilities’ incentives under cost-of-service regulation to favor ratebase investments keeps valuable flexibility idle. The author argues PBR, better data sharing, grid-grade hardware/standards, dynamic operating envelopes (as piloted in Australia), and upgraded topology sensing are needed to unlock DER value and avoid unnecessary capital-intensive network builds.

By Distributed Grid from Michael Lee

19 Twitter/X 2026-05-04 1 min read

Open

The US currently operates ~97 GW of nuclear capacity (~20% of electricity) and…

Why it matters

The US currently operates ~97 GW of nuclear capacity (~20% of electricity) and may need an additional 100–300 GW by 2050 driven in part by data center and AI load growth.

Key details

The US went from 64% of global enrichment capacity in 1985 to importing 27% of enrichment services from Russia in 2023; under 1% of uranium is domestically sourced.
Conversion and enrichment are chokepoints: ConverDyn Metropolis Works idled 2017–2023 and restarted in 2024 at ~7 kt/year (15 kt nameplate, meeting ~50% of demand); DOE awarded $2.7B in Jan 2026 and firms (Centrus 2029, Orano 2031, General Matter 2034) still fall short of demand—McKinsey estimates $105B–$170B to enable a 300 GW buildout, with reprocessing an added $20B–$45B and ~90,000 t of spent fuel in dry casks.

Brief

US nuclear fuel supply chain faces deep import dependence and capacity shortfalls: the country runs ~97 GW (~20% of electricity) but may need 100–300 GW by 2050. Conversion (ConverDyn at ~7 kt/y), limited domestic uranium (<1%), and slow enrichment/fabrication buildout (DOE $2.7B; projects into 2034) create a gap McKinsey prices at $105–170B plus $20–45B if reprocessing proceeds.

By @da_sails

20 substack.com 2026-03-18 4 min read

Open

Deep|LITE: Long-Term Upside Potential as a Core Player in AI Optical Interconnects

Why it matters

As of Mar 17, 2026 Lumentum (LITE) shares were up ~76% YTD with a $44.6B market cap after NVIDIA’s ~$2B strategic investment in early March and LITE’s inclusion in the S&P 500 on Mar 23, 2026.

Key details

Management frames growth on four pillars—Cloud Transceivers, Optical Circuit Switches (OCS), Scale-Out CPO, and Scale-Up CPO—underpinned by InP manufacturing, OCS optical engines, and the UHP laser platform as the company’s structural moat.
InP/EML scale: EML shipments expanded >8x from FY20 to FY26E, Lumentum plans to add 50%+ EML unit capacity by end-CY26 vs end-CY25, and management forecasts ~85% CAGR in AI data center InP optical lane demand from CY26–CY30.
Near-term commercial traction includes 1.6T volume shipments starting summer 2026 (CW laser vertical integration targeted in CQ3), a multi-year OCS agreement with >$400M backlog for 2H CY26 and a target >$1B OCS run rate in 2027, with OCS unit CAGR >150% from CY25–CY28.

Brief

Lumentum (LITE) is positioned as a central supplier for an industry transition toward optical scale-up, with management highlighting four revenue engines—Cloud Transceivers, OCS, Scale-Out CPO, and Scale-Up CPO—all relying on InP/EML, OCS engines, and UHP lasers. At OFC 2026 the company emphasized manufacturing scale: EML shipments grew >8x FY20–FY26E and Lumentum plans >50% more EML unit capacity by end-CY26. The firm is advancing 400G/lane Differential Drive EML for 3.2T (8×400G), arguing SiPho lacks bandwidth for 400G/lane, and projects ~85% CAGR in InP optical lane demand from CY26–CY30—implying an industry bottleneck at InP light sources. Commercially, 1.6T ramps to volume this summer, CW laser vertical integration is due in CQ3, and a multi-year OCS deal leaves >$400M backlog for 2H CY26 with a >$1B run rate target in 2027.

By FundaAI

21 Epoch AI 2025-11-04 7 min read

Open

OpenAI Stargate: where the US sites stand

Why it matters

Stargate is a $500 billion OpenAI–Oracle–SoftBank build‑out of seven US AI sites with over 9 GW of planned total facility power (comparable to New York City peak demand) and an estimated compute footprint equivalent to ~20 million H100 GPUs (end‑2025 H100‑equivalent basis).

Key details

Abilene, TX is the most advanced site: ~0.3 GW operational as of Apr 22, 2026 (≈250,000 H100‑equivalents), four of eight buildings active, projected to reach 1.2 GW (~1.0M H100‑equivalents) by Q4 2026; an earlier planned 2.1 GW expansion was canceled and Microsoft is building an adjacent 900 MW campus with Crusoe.
Other site projections and timelines: Shackelford County, TX 2.0 GW (4.2M H100‑eq) — Q4 2028; Doña Ana, NM 2.2 GW (4.6M) — Q4 2028; Milam County, TX 1.2 GW (2.5M) — Q4 2028; Port Washington, WI 1.3 GW (2.6M) — Q4 2028; Saline Township, MI 1.4 GW (2.9M) — Q4 2028; Lordstown, OH <0.3 GW (capacity/completion uncertain).
Design and risk notes: builders are using on‑site natural‑gas microgrids at ≥3 sites and closed‑loop liquid cooling at ≥6 sites to avoid grid interconnection delays and reduce evaporative water use; SoftBank owns hardware at Milam and Lordstown while Oracle owns the other sites; financing, equipment procurement, and local political opposition (e.g., Lordstown ban, Michigan pushback) could delay completion (target window through 2029).

Brief

Stargate is a nationwide, $500 billion data‑center program led by OpenAI with Oracle and SoftBank that targets seven US sites totaling more than 9 GW of planned facility power — roughly the scale of New York City’s single‑hour peak demand — and an aggregate compute capacity the authors estimate at ~20 million H100‑equivalents. Abilene, TX is furthest along (~0.3 GW operational as of Apr 22, 2026; 250k H100‑eq; four of eight buildings live) and is expected to reach 1.2 GW by Q4 2026. The six other sites (Shackelford 2.0 GW, Doña Ana 2.2 GW, Milam 1.2 GW, Port Washington 1.3 GW, Saline Township 1.4 GW, Lordstown <0.3 GW) are mostly slated for Q4 2028 handovers. Developers are favoring on‑site natural‑gas microgrids and closed‑loop liquid cooling to shorten grid interconnection timelines and limit water evaporation, but those choices raise cost and regulatory scrutiny; procurement, financing, and local opposition remain key risks to the 2029 build‑out target.

By Elliot Stewart, Ben Cottier

22 datacenterdynamics.com 2026-03-17 1 min read

Open

Whitepaper: Unlocking stranded power in AI data centers

Why it matters

DG Matrix whitepaper (published 17 March 2026 by Data Centre Dynamics) identifies 'stranded power' inside data centers as a structural constraint on AI infrastructure growth caused by mismatches between facility power architecture and dynamic AI workloads.

Key details

The paper advocates software-defined power infrastructure that dynamically routes energy across grid supply, on-site storage, and distributed resources to unlock additional deployable compute within existing utility limits.
It highlights four focus areas: causes of stranded power, how AI workload behavior exposes distribution gaps, dynamic energy-routing solutions, and the economic impact of reclaiming unused power capacity.

Brief

DG Matrix's whitepaper (published 17 March 2026 by Data Centre Dynamics) analyzes how 'stranded power' within facilities limits AI data-center growth and shows that conventional distribution can't match dynamic AI workloads. It proposes software-defined power infrastructure to dynamically route energy across grid, storage and distributed resources, unlocking additional deployable compute within current utility limits and improving economics.

By DCD Cloud & Hybrid

23 substack.com 2026-03-29 4 min read

Open

Executive Briefing: 33% of the world's helium supply just went offline. Your AI infrastructure plan doesn't accoun…

Why it matters

About 33% of the world’s helium supply came from a single industrial complex in Qatar that was hit by Iranian missiles and was reported offline three weeks before March 29, 2026; parts are destroyed and the regional shipping strait is closed.

Key details

The five largest U.S. cloud/AI providers plan $600–$700 billion in capex in 2026 (75% earmarked for AI infrastructure); Goldman Sachs projects hyperscaler capex of $1.15 trillion for 2025–2027 — all of which assumes chips from SK/TAIWAN fabs arrive on schedule.
Semiconductor fabs in South Korea and Taiwan require helium for EUV lithography machines, wafer cooling, and vacuum-chamber leak detection with no viable substitute; liquid-helium containers used in transport will vaporize (boil off) within 48 days, creating an operational deadline.
Nate flags three transmission channels to AI clusters — helium supply interruption, rising LNG/energy costs and delayed power oversupply, and a geopolitical shift (e.g., Russia→China pipeline gas) that could favor Chinese compute economics — compounded by an industry-wide memory shortage.

Brief

Qatar’s helium shutdown — a single industrial complex producing roughly a third of global helium went offline after Iranian missile strikes reported three weeks before March 29, 2026 — exposes a critical, unpriced dependency undergirding the global AI buildout. Helium is irreplaceable in EUV lithography, wafer cooling, and vacuum leak detection at fabs in South Korea and Taiwan; transport of liquid helium faces a 48‑day boil‑off window and regional shipping routes are closed. The timing is acute: the five largest U.S. cloud/AI providers plan $600–$700B capex in 2026 (75% for AI) and Goldman Sachs forecasts $1.15T hyperscaler capex across 2025–2027, all predicated on chip delivery. Nate outlines three propagation channels (helium, LNG/energy, and geopolitical compute-cost shifts), warns of memory shortages compounding delays, and argues the event could structurally advantage Chinese fabs if energy and supply realignments persist.

By Nate from Nate’s Substack

24 substack.com 2026-03-31 3 min read

Open

Earnings Calls Paint Dramatic Energy Growth Picture

Why it matters

AI-driven demand surge: utilities and independent power producers are winning long-term supply deals from tech giants to serve AI data centers, shifting demand growth expectations.

Key details

CenterPoint Energy (Q4 earnings): CEO Jason Wells said Greater Houston peak load is expected to grow 50% from 2025 to 2029 (a revision two years earlier than prior estimates) and the company expects annual operating earnings growth at the high end of its 7–9% target through 2028 and to exceed its prior 11% CAGR base-rate revenue estimate.
ERCOT long-term forecast (released August): summer peak demand rises from 87 GW in 2025 to 145 GW in summer 2031, including ~24 GW attributed to data centers and ~8.5 GW to cryptocurrency operations.
Market implications: historically volatile merchant-generator revenues (price-driven) and steady regulated-utility sales are both becoming more robust and more predictable due to large, multi-year tech contracts.

Brief

Earnings calls in the 2026 fourth-quarter season reveal a rapid, AI-fueled shift in U.S. electricity demand: tech companies locking long-term supply deals are driving both regulated utilities and merchant generators to plan for major load increases. CenterPoint Energy told investors that Greater Houston peak load should rise 50% between 2025 and 2029—an acceleration of its prior timetable—and the company now expects operating earnings growth at the high end of its 7–9% long-term target through 2028 and to beat a prior 11% CAGR base-rate revenue projection. Those company comments align with ERCOT’s August long-term forecast, which lifts summer peak from 87 GW in 2025 to 145 GW by 2031, explicitly counting about 24 GW of data-center load and 8.5 GW of crypto. The result is greater revenue visibility across the sector as AI demand reshapes planning and procurement.

By Texas Energy and Power Newsletter

25 Redefining Energy 2026-04-13 Podcast

Open

224. From Wind farms (yield) to Datacenters (growth) - Apr26

Why it matters

Paul O'Donnell (Greencoat Renewables) said Greencoat has invested about €13.5 billion and manages just under 8 GW of renewables across roughly 445 assets globally.

Key details

Paul said Greencoat launched in 2012/13 and listed Greencoat Renewables in 2017; since announcing a new strategy at the start of March 2026 the share price has risen about 20%.
Paul told hosts the company is pivoting from pure 'yield' assets to hybridized, actively managed sites: long-term PPAs with tech firms, adding storage and trading capability to capture arbitrage and flexibility revenues.
Paul said data centres now consume about 22–23% of Ireland's electricity (his estimate) and Greencoat has been developing integrated, energy-enabled sites and partnerships to serve large-scale digital infrastructure.

Brief

Greencoat Renewables’ Paul O'Donnell outlined how the yieldco model that dominated renewable finance for a decade has moved into a new phase of active operations, hybridization and growth into digital infrastructure. He traced the firm’s origins to 2012/13 and its listing in 2017, and reported total invested capital of about €13.5 billion, with just under 8 GW under management across roughly 445 assets. That long-term accumulation of government‑backed, contract-stable assets is giving way to a more complex market: higher interest rates, shifting policy after Europe’s energy crisis, and new demand drivers such as AI and data centre build‑out are forcing owners to become trading-savvy operators.

Paul described the practical response: long-term PPAs directly with tech companies, hybridizing wind/solar sites by adding batteries for arbitrage and grid services, and partnering to scale capabilities rather than trying to internalize every function. He cited a joint venture with a battery/storage provider (referred to in the interview as 'Coato') to access pricing and innovation — the partner, he said, had recently won roughly 80% of certain Italian opportunities. On storage he emphasized falling capex and attractive incremental costs when batteries are added to existing sites, and argued revenue stacks (day‑ahead arbitrage, balancing markets, capacity/flexibility services) now underpin investment cases. Turning to data centres, Paul said they already account for an estimated 22–23% of Ireland’s electricity and present both demand and policy tailwinds: governments want data centres to source renewables and provide grid backup/flexibility. He estimated a 100 MW data centre costs ~€1.5 billion to build plus €300–500 million of enabling energy infrastructure, and argued truly off‑grid, gas‑backed data centres are unlikely in Europe. Hosts agreed the strategic pivot — which they labelled 'yield to growth' — makes sense and welcomed energy investors entering the digital infra space, while also flagging structural risks for listed yieldcos (market marking, investor alignment and competition from fixed income). The market reaction to Greencoat’s March strategy — a roughly 20% share price rise — suggested investors rewarded the clearer growth and integration story.

By Redefining Energy

26 datacenterdynamics.com 2026-03-24 3 min read

Open

Nvidia announced an LPU-based LPX rack that is fully liquid-cooled and rated to…

Why it matters

Nvidia announced an LPU-based LPX rack that is fully liquid-cooled and rated to consume up to 160 kW per rack (same power draw reported for the neighboring Vera Rubin rack); announcement appears in the DCD Data Center Cooling newsletter dated 24 March 2026.

Key details

Ecolab agreed to acquire liquid-cooling specialist CoolIT for $4.75 billion from KKR, with the transaction reported as set to close later in 2026 as KKR divests.
Frore Systems raised $143 million at a $1.6 billion valuation; the liquid-cooling vendor says its novel coldplate design can unlock efficiency gains for AI data centers.

Brief

The Data Center Cooling newsletter (published 24 March 2026) highlights accelerating moves toward high‑density, liquid‑cooled AI infrastructure: Nvidia’s new LPU‑based LPX rack is fully liquid cooled and specified to draw up to 160 kW per rack, matching the nearby Vera Rubin deployment and signalling rising per‑rack power densities. In M&A, Ecolab will buy CoolIT from KKR for $4.75 billion, a deal expected to close later in 2026 that expands Ecolab’s portfolio in direct‑to‑chip and in‑rack cooling. Funding activity continues: Frore Systems closed a $143 million round at a $1.6 billion valuation, promoting a proprietary coldplate it claims improves thermal efficiency for AI workloads. The issue also points readers to technical resources— a Zutacore whitepaper on waterless direct‑to‑chip cooling and an nVent showcase on secondary fluid network design—underscoring industry focus on liquid architectures and system‑level reliability.

By DCD Data Center Cooling Newsletter

27 Twitter/X 2026-04-25 1 min read

Open

Stanford CS153Systems Session 8 (full video posted 2026-04-25) features Scott…

Why it matters

Stanford CS153Systems Session 8 (full video posted 2026-04-25) features Scott Nolan (General Matter) who claims energy — not algorithms — is the primary limit to AI scaling, labeling the constraint 'Energy Is The Real Limit' (09:03) and giving a 'Demand Growth Reality Check' (11:16) that predicts a 'Stranded Power Era' (13:17).

Key details

Nolan argues nuclear power is the practical baseload path for data centers (16:59–18:21), identifies a fuel-cycle and enrichment capacity gap that must be closed (18:21–21:41), and addresses nuclear safety and public narrative as prerequisites for deployment (28:17–30:22).
He cites bitcoin mining as a rehearsal for flexible, large-scale compute demand (21:41–25:03), urges building durable 'primitives' not short-term pivots (25:03–28:17), and calls for coordinated government support, supply-chain scaling, and focused team/site engineering to meet AI-driven jobs and spatial/timeline scaling requirements (37:18–42:37; 53:17–59:25).

Brief

Stanford CS153Systems Session 8 (video posted 2026-04-25) presents Scott Nolan of General Matter asserting that power constraints are the central bottleneck to AI scale. He maps AI factory demand, warns of stranded power, promotes nuclear as the baseload solution while highlighting enrichment and supply-chain gaps, and urges engineered sites, government support, and lessons from bitcoin mining to meet scaling timelines.

By @AnjneyMidha

28 substack.com 2026-03-03 24 min read

Open

Are AI Datacenters Increasing Electric Bills for American Households?

Why it matters

PJM’s 2025/26 Base Residual Auction (BRA) capacity price jumped ~9.3x from $29/MW‑day (2024/25) to ~$270/MW‑day, with the federal regulator later imposing a $329/MW‑day cap; the auction equates to roughly $16 billion in total capacity payments for PJM customers.

Key details

SemiAnalysis estimates the BRA spike will add about $25–$30/month to an average PJM household bill (assuming 880 kWh/month consumption and a 40% load factor), calculated by converting $329/MW‑day to $34/MWh (3.4¢/kWh).
PJM’s capacity pricing is driven by a central simulated supply‑demand construct (the VRR curve) that is highly sensitive to forecast inputs; PJM’s Internal Market Monitor (IMM) found removing all datacenters from the forecast would cut capacity payments by $9.33 billion (64%), and keeping only energized datacenters would lower payments by $7.74 billion (53%).
Supply‑side methodology changes and retirements reduced PJM offered capacity by ~35 GW over four years; accounting changes to natural‑gas plants alone removed ~14 GW from the capacity construct.

Brief

SemiAnalysis compares PJM (13‑state Eastern interconnection) and ERCOT (Texas) to assess whether AI datacenter growth is driving higher household electric bills. The analysis shows the immediate driver in PJM is not wholesale energy scarcity but the structure of PJM’s forward capacity market. PJM’s 2025/26 BRA cleared at about $270/MW‑day (versus $29/MW‑day in 2024/25), with regulators capping later auctions at $329/MW‑day. Because capacity is procured via a centrally modeled Variable Resource Requirement (VRR) curve determined by PJM’s own demand and supply forecasts, modest changes in projected datacenter load (PJM/IMM estimate ~7.9 GW incremental datacenter demand in 2025/26 and ~12 GW in 2026/27) re‑shaped the VRR near the clearing point and produced a simulated price shock. The IMM’s alternate runs show removing datacenters would reduce capacity payments by $9.33 billion (64%), demonstrating the auction’s sensitivity to forecast inputs. Translating the capped BRA into retail terms yields roughly $34/MWh of capacity cost (3.4¢/kWh), adding about $25–$30/month to an average PJM household (880 kWh/month, 40% load factor).

ERCOT’s experience contrasts sharply. ERCOT runs an energy‑only market with a real‑time Operating Reserve Demand Curve (ORDC) scarcity adder (price cap ~$5,000/MWh). Although ERCOT’s April 2025 Long‑Term Load Forecast showed an eye‑popping 77.9 GW of potential datacenter load by 2030, ERCOT applied aggressive haircuts (≈49–55% reductions) and delayed in‑service dates, and forward energy markets priced only an ~11–17% increase across 2026–2030 horizons—far below PJM’s capacity shock. Operational performance reinforced the divergence: Winter Storm Fern (Jan 24–27, 2026) saw PJM lose ~21 GW of generation despite high capacity payments, while ERCOT held with no major outages; DOE issued emergency orders and identified ~35 GW of backup generation at datacenters/industrial sites. SemiAnalysis attributes PJM’s outcome to market design and forecasting errors (including methodology changes that removed ~14 GW of gas capacity and ~35 GW of offered capacity declines over four years), construction and GPU supply‑chain delays that make PJM’s datacenter build‑out forecasts optimistic, and regulatory fragmentation (13 states + FERC) that slows reform. The piece concludes the principal fault lies in policy and market structure rather than in AI demand per se, and highlights regulatory asymmetry (ERCOT’s faster reforms like SB 6 versus PJM’s multi‑jurisdictional constraints) as a durable, investable difference for hyperscalers and power sector participants.

By SemiAnalysis

29 divenewsletter.com 2026-05-06 5 min read

Open

American Electric Power is reviewing exits from PJM and SPP over slow generation…

Why it matters

American Electric Power is reviewing exits from PJM and SPP over slow generation interconnection amid surging customer demand; AEP utilities have contracts for 63 GW of new large load across their multistate footprint by 2030 (Utility Dive, May 6, 2026).

Key details

Xcel Energy’s CEO Bob Frenzel said the company’s tariff deal with Google creates a template for courting large loads while doubling down on renewables as the primary supply strategy.
Tennessee Valley Authority generation is now 41% nuclear; interim CEO Mike Skaggs said TVA will clarify its position on new nuclear technologies and coordinate with the federal administration and its board on future paths.
Duke Energy added 2.7 GW of contracted data center load in Q1 2026, bringing total executed data center agreements to 7.6 GW — nearly two-thirds of which are already under construction — while keeping its $103 billion capital plan unchanged from 2025.

Brief

U.S. utility sector headlines (Utility Dive, May 6, 2026) center on grid strain from rapid large-load growth and the evolving role of nuclear and renewables. American Electric Power is weighing leaving PJM or SPP because slow generation interconnection is constraining its response to contracts for 63 GW of new large load by 2030. Xcel’s tariff agreement with Google is being positioned by CEO Bob Frenzel as a repeatable model to attract hyperscalers while accelerating renewables. TVA reports nuclear now supplies 41% of its power and interim CEO Mike Skaggs seeks clarity on new nuclear technologies and federal coordination. Duke added 2.7 GW of data-center contracts in Q1 (total 7.6 GW, ~66% under construction) and maintained a $103 billion capital plan. Meanwhile, NERC’s Level 3 alert requires seven prescribed actions by Aug. 3, 2026 to mitigate immediate risks from computational load losses.

By Utility Dive

30 substack.com 2026-05-01 26 min read

Open

AI Value Capture - The Shift To Model Labs

Why it matters

Agentic AI crossed a practical inflection in December 2025, driving token value up and consumption higher—SemiAnalysis reports Anthropic ARR rising from $9B to over $44B year-to-date and inference gross margins expanding from <40% to >70%.

Key details

Hardware and software advances have slashed token production cost: Blackwell-class chips can produce ~30x more tokens/sec on frontier workloads vs Hopper a year earlier; software stacks (wideEP + disagg + MTP) can lift tokens/sec on the same B300 GPU ~1k → ~8k → ~14k (up to 14x by software alone).
SemiAnalysis cites internal usage metrics showing annual Anthropic token spend as high as $10.95M, token consumption ~5 billion tokens/month per employee (≈5x Meta) with Claude Code input:output ratios ~300:1 and cache hit rates >90%, making blended token cost for Opus agentic workloads closer to $0.99/MTok versus sticker $5–$25/MTok.
Memory has become a major cost and bottleneck: DRAM pricing rose ~6x over the past year; 1-year H100 rental contract prices were ~40% up from the October 2025 trough; SemiAnalysis estimates Nvidia’s SOCAMM cost at ~$8/GB in 1Q26 with a plausible exit-2026 SOCAMM cost >$13/GB and a working assumption around ~$10/GB.

Brief

AI Value Capture - The Shift To Model Labs focuses on how agentic AI, hardware advances, and software optimizations have recomposed profit pools so that frontier model labs (e.g., Anthropic) are now capturing the lion’s share of value. The brief documents a rapid structural change beginning December 2025: agentic workflows (code agents, multi-turn assistants) drive much higher token utility and demand, increasing model monetization while the cost of producing tokens has plunged. SemiAnalysis quantifies this with internal examples—annual Anthropic token runs up to $10.95M, per-employee consumption near 5B tokens/month, Claude Code input:output ratios ≈300:1 and cache hit rates >90%—and links those usage patterns to Anthropic ARR growth (reported from $9B to >$44B) and gross margin expansion (<40% → >70%).

The piece documents the technical levers behind the shift. New accelerators (Blackwell/GB300/VR NVL72) and ASICs (TPUv7, Trainium3) plus middleware improvements (wideEP, disaggregation, MTP) multiply tokens/sec/gpu: software alone can produce ~14x gains on B300; combined with hardware, SemiAnalysis reports GB300 NVL72 configurations delivering ~17x higher throughput vs H100 in FP8 and up to ~32x in FP4. Meanwhile memory scarcity and pricing have surged (DRAM up ~6x year-over-year, one-year H100 rental +40% since Oct 2025), making Nvidia’s socketed SOCAMM a crucial pricing lever—SemiAnalysis models SOCAMM at ~$8/GB in 1Q26 with potential to exceed $13/GB by end-2026 and assumes ~$10/GB as a working figure. System-level economics are surprising: capex per watt barely changes from GB300 ($37.4/W) to Rubin (VR NVL72 $38.1/W) despite large TDP and FLOP increases, leaving big room between cost-based rental floors (~$4.92/hr/GPU for a 15.6% IRR) and value-based ceilings (~$12.25/hr/GPU parity, conservative ~$9.63/hr). The authors present a ‘One Chart To Rule Them All’ that combines floor and ceiling constraints with Neocloud IRR curves to illustrate how incremental pricing by Nvidia or TSMC could shift captured value. Finally, SemiAnalysis argues that although TSMC and Nvidia could extract more given N3 and memory tightness (TSMC N3 >100% utilization expected H2 2026; DRAM fabs >90%), both firms have so far held pricing in part for strategic/regulatory reasons—meaning short-term value accrues disproportionately to model labs, inference providers, neoclouds and memory vendors unless system suppliers move to value-based pricing.

By SemiAnalysis

31 substack.com 2026-05-09 12 min read

Open

Power Bottlenecks & The AI Data Center

Why it matters

Sightline Climate found 12 GW of US data‑center capacity announced for 2026 across 140 projects, but only ~5 GW is under construction and ~11 GW remains in the “announced” stage; 25% of announced projects lack a disclosed power strategy (tweet cited on All‑In podcast).

Key details

High‑voltage (HV) transformer lead times have stretched from ~24–30 months pre‑2020 to roughly 5 years today, creating a primary bottleneck for grid‑connected AI data centers; electrical equipment is <10% of capex but is the gating constraint.
Bring‑Your‑Own‑Power (BYOP)/on‑site generation is rising: regulators in July 2025 approved Williams to build onsite gas power for a Meta campus near Columbus, and Oklahoma passed 2025 legislation allowing companies to build their own power.
Gas turbine supply is constrained: GE Vernova reports ~3‑year lead times with limited 2029–30 slots (≈10 GW remaining across those years), while HSBC projects global turbine production capacity rising from ~70 GW in 2024 to >90 GW by 2029.

Brief

Power infrastructure — not GPUs — is emerging as the immediate choke point for the AI data‑center buildout. Sightline Climate data (cited on the All‑In podcast) shows 12 GW announced for US 2026 capacity across 140 projects while only ~5 GW is under construction; many projects lack concrete power plans. The largest single constraint is high‑voltage transformers and substation lead times, which have lengthened from ~24–30 months pre‑2020 to about five years today. That mismatch means hyperscalers that locked PPAs and electrical orders 3–4 years ago will get priority while others face multi‑year waits, idle shells, and depreciating GPU hardware.

Developers are increasingly using on‑site generation to avoid HV interconnection delays. BYOP and islanded microgrids are moving from stopgap to strategic choices: regulators approved Williams’ onsite natural‑gas plan for Meta in July 2025, and Oklahoma legalized self‑build power in 2025. Traditional gas turbines remain capacity‑constrained (GE reports ~3‑year waits; HSBC forecasts turbine capacity rising 70 GW→>90 GW by 2029), are noisy and polluting, and face long delivery schedules. Fuel cells — notably Bloom Energy’s SOFCs (native 800 V DC, 1.5 GW installed, Oracle order up to 2.85 GW, claimed 5 GW/yr manufacturing) — offer quieter, low‑emission, transformer‑avoiding alternatives. Goldman’s LCOE work shows fuel cells are sensitive to gas prices (≈40% more expensive at $4/MMBtu, but ~7% cheaper at $13.3/MMBtu). ABB reports switchgear capacity is expanding, making transformers and generation the primary bottlenecks for near‑term AI data‑center deployment.

By Tech Investments

32 divenewsletter.com 2026-03-21 5 min read

Open

Weekender: As data centers go off-grid, utilities face new cost and planning risks

Why it matters

Opinion by Brandon Owens and Morgan Bazilian (Utility Dive Weekender, March 21, 2026): industry disclosures indicate that by the end of the decade a meaningful share of new data center capacity could be partially or fully self‑supplied (i.e., off‑grid), creating planning and revenue risks for utilities.

Key details

EIA data: electric‑sector natural gas use fell 3% in 2025, attributed in part to rising solar and battery deployments, even as total U.S. gas consumption hit a record in 2025 because other sectors’ use increased.
EIA high‑demand scenario: accelerated data center buildout could drive 2025–2027 annual load growth up to 15% higher in Texas and 4.7% higher in PJM, and could contribute to a potential 79% ERCOT price increase in 2027 under that scenario.
Utility and regulatory responses: PPL Electric reached a $275 million rate‑case settlement that includes a data‑center tariff raising average residential bills 4.9% and requiring large loads to sign agreements of at least 10 years; separately FERC approved SPP’s merger of interconnection and transmission planning to speed new generation connections.

Brief

As data centers pursue on‑site generation and self‑supply, utilities face new cost, planning and reliability risks, write Brandon Owens and Morgan Bazilian in Utility Dive’s March 21, 2026 Weekender. Industry disclosures predict a sizable share of new capacity could be partially or fully off‑grid by the end of the decade, complicating load forecasts and revenue recovery. The Energy Information Administration reported electric‑sector gas use fell 3% in 2025 due to more solar and batteries even as overall U.S. gas consumption reached a record; its modeling also shows a high‑demand data center buildout could raise 2025–2027 load growth by ~15% in Texas and 4.7% in PJM and, in an extreme case, contribute to a 79% ERCOT price spike in 2027. Utilities and regulators are responding with long‑term contracts and tariffs (e.g., PPL’s $275M settlement, 4.9% residential bill impact, 10‑year minimum agreements) and planning reforms such as FERC’s approval of SPP planning merger.

By Utility Dive

33 Twitter/X 2026-04-30 1 min read

Open

On 2026-04-30, Javier Blas reported Belgium plans to nationalize all of the…

Why it matters

On 2026-04-30, Javier Blas reported Belgium plans to nationalize all of the country's nuclear power reactors and that decommissioning activities are "halted with immediate effect."

Key details

Bart De Wever tweeted an agreement with ENGIE to define conditions and launch studies for a full takeover of the Belgian nuclear park.
The government frames the move as a choice for "secure, affordable and sustainable" energy to cut dependence on fossil imports and increase control over domestic supply.

Brief

Belgium plans to nationalize all its nuclear reactors and has halted decommissioning immediately. An accord with ENGIE will define terms and start studies toward a full takeover of the Belgian nuclear park, a move the government calls necessary for secure, affordable, sustainable energy and reduced dependence on fossil imports.

By @JavierBlas

34 substack.com 2026-03-12 14 min read

Open

The Great AI Silicon Shortage

Why it matters

AI token demand surge is driving a compute shortage: Anthropic added $6B of ARR in February 2026 from Claude Code, and on‑demand GPU prices (including Hopper‑class cards) continue to rise as clusters are fully allocated.

Key details

TSMC N3 capacity is the critical bottleneck: SemiAnalysis models estimate AI-related demand will consume ~60% of N3 wafer output in 2026 and ~86% in 2027, with effective N3 utilization exceeding 100% in H2 2026.
Major accelerator roadmaps converged on N3 in 2026: NVIDIA moves Rubin to N3P (from Blackwell on 4NP), Google TPUv7 is on N3, AWS Trainium3 on N3P, and AMD’s MI350X/MI400 tiles use N3 — creating a shock to N3 wafer starts.
Memory (HBM) is the next choke point: HBM uses ~3x wafer capacity per bit vs. commodity DRAM (potentially ~4x with HBM4), customers target ~11 Gb/s HBM4 pin speeds, and several accelerators move from 8‑Hi to 12‑Hi stacks.

Brief

The Great AI Silicon Shortage frames a 2026 supply shock driven by explosive AI compute demand and constrained advanced wafer and memory capacity. SemiAnalysis shows a rapid industry shift onto TSMC’s N3 family — NVIDIA’s Rubin (N3P), Google TPUv7 (N3), AWS Trainium3 (N3P), AMD MI350X/MI400 tiles and even Vera CPUs — leading AI workloads to claim ~60% of N3 wafer output in 2026 and an estimated ~86% in 2027. TSMC is running N3 above nameplate in H2 2026 and is accelerating capex after only surpassing its prior peak in 2025, but cleanroom space limits immediate scale‑up. Memory is the secondary bottleneck: HBM consumes roughly 3× wafer capacity per bit versus commodity DRAM (rising toward 4× with HBM4), customers are targeting ~11 Gb/s for HBM4 pins, and accelerators are increasing stack height (8‑Hi → 12‑Hi). SemiAnalysis quantifies reallocation scenarios — e.g., 5% of smartphone N3 wafer starts (from 437k) could yield ~0.1M Rubin GPUs — but notes packaging (CoWoS/OSAT/EMIB) and HBM availability must also be solved. The piece highlights market consequences: hyperscalers prioritizing N3 allocation, potential foundry diversification to Samsung/Intel, and elevated DRAM/HBM pricing and negotiations shaping 2027 supply dynamics.

By SemiAnalysis

35 datacenterdynamics.com 2026-03-09 1 min read

Open

Your streaming link: Engineering the future of cooling

Why it matters

DCD streamed a three‑part event on Tuesday, 10 March 2026 with sessions at 9am, 10am, and 11am ET (1pm GMT for the 9am slot), hosted by James Raddings (Digital Portfolio Lead, DCD).

Key details

Speakers included Chris Stocker and Dr Ramsatish Kaluri (Siemens), Jaskeerat Singh (Nortek Data Center Cooling), Chris Palmer (Ark Data Centres), David Hall (Tillion Data Centers), James Repass (Fleet Data Centers), Sybrand Pretorius (Johnson Controls) and Kyle Graebner (Armada); topics covered physics‑based simulation, digital twins, advanced controls, grey‑to‑white‑space liquid cooling integration, and modular/prefabricated deployments for high‑density AI workloads.

Brief

Engineering the future of cooling — DCD streamed a three‑part broadcast on Tuesday, 10 March 2026 (9am, 10am, 11am ET) featuring Siemens, Nortek, Ark, Tillion, Fleet, Johnson Controls and Armada speakers. Sessions emphasized engineering methods (physics‑based simulation, digital twins, advanced controls), integrating liquid cooling from grey to white space, and modular/prefabricated solutions to scale high‑density AI deployments.

By Kat Sullivan

36 divenewsletter.com 2026-02-24 5 min read

Open

PJM on Feb.

Why it matters

PJM on Feb. 24, 2026 proposed behind-the-meter market reforms intended to address data center colocation; trade groups warn the changes could undermine the economics of new combined heat-and-power (CHP) at industrial facilities.

Key details

Dominion Energy released a $65 billion, five-year capital plan (announced Feb. 24, 2026) driven by data center load forecasts and progress on the Coastal Virginia Offshore Wind project.
Con Edison is planning approximately $38 billion in capital spending through 2030 to support rising demand from electric vehicles and building electrification while addressing reliability and affordability challenges.
Entergy Texas issued a request for proposals for gas-fired generation able to be online by 2032 to meet roughly 1 GW of expected new industrial and residential demand.

Brief

The Feb. 24, 2026 Utility Dive briefing highlights shifting resource and capital plans as load growth and data center demand reshape utility strategy. PJM’s proposed behind-the-meter reforms — part of a data center colocation effort — have drawn pushback from trade groups that say the changes would harm the economics of new CHP at industrial sites. Major utilities are responding with large capital programs: Dominion disclosed a $65 billion, five-year plan tied to data-center-driven load growth and the near-completion of Coastal Virginia Offshore Wind; Con Edison projects about $38 billion through 2030 for EVs and building electrification. In Texas, Entergy issued an RFP for gas-fired units deliverable by 2032 to meet ~1 GW of new demand. Separately, analysts report easing supply constraints for gas equipment after capacity expansions at GE Vernova, Siemens and Mitsubishi Heavy Industries.

By Utility Dive

37 substack.com 2026-02-25 41 min read

Open

Who Pays for Texas Grid Growth? - Roundtable Discussion

Why it matters

Roundtable on Feb 25, 2026 with hosts Micalah Spenrath, Joshua Rhodes and Matt Boms focused on ERCOT’s historic load growth driven by data centers, population and electrification.

Key details

Texas T&D has been spending roughly $9 billion/year (about $5B distribution, $4B transmission) — roughly $80 billion over the past decade — even before the recent load surge.
TAEBA analysis cited in the discussion estimates ~2.5 GW of distributed energy resources (DERs) over 10 years could save about $1,850 per ratepayer by deferring T&D upgrades and adding wholesale market value.
Capital costs have jumped: typical combined-cycle gas plant capex assumptions rose from ~$1,000/kW historically to about $2,500–$4,500/kW in recent studies; pad‑mount transformer prices have increased ~200%, raising short‑term rate‑shock risk.

Brief

Texas’s electricity system is entering a period of rapid, concentrated load growth—driven by data centers, population increases, and electrification—that is colliding with sharply higher capital costs and traditional utility incentives. Participants Micalah Spenrath, Joshua Rhodes and Matt Boms summarized the problem: transmission and distribution (T&D) spending in Texas has already averaged about $9 billion per year (roughly $5B distribution, $4B transmission) over the past decade, and utilities are filing major rate cases (the episode flags an Oncor $830M request and references resiliency programs of roughly $1B/year for some TDUs). At the same time, generation and hardware costs have escalated: combined‑cycle gas capex assumptions now often sit in the $2,500–$4,500/kW range (up from ~$1,000/kW previously) and pad‑mount transformer prices have reportedly risen ~200%. Those cost shifts increase the risk of short‑term “rate shock” if spending is front‑loaded while demand remains uncertain.

The panel argued the choice architecture matters: when infrastructure is built primarily for a single customer (“driveway” upgrades) those customers should bear the capital; when upgrades provide broad system value, costs can be socialized. They highlighted commercial examples—reported proposals where large tech loads (Meta, Microsoft) offer to fund dedicated plants (a cited ~366 MW El Paso gas plant)—and suggested borrowing mechanisms from Alberta where new loads pay upfront and are repaid over time if they persist. The Texas Advanced Energy Business Alliance (TAEBA) DER study shared on the call modeled ~2.5 GW of customer‑side DERs over 10 years and attributed about $1,850 in savings per ratepayer by combining T&D deferral benefits with wholesale market value. Panelists recommended policy fixes: expand price signals at the distribution edge, authorize larger utility pilots for virtual power plants and advanced DERs, and reorient incentives from pure cost‑of‑service toward performance‑based regulation. Those steps, they argued, can defer expensive poles‑and‑wires investments, align cost causation with the parties driving growth, and reduce the likelihood that broad ratepayers shoulder the risk of potentially transient load buildouts.

By Texas Energy and Power Newsletter

38 Twitter/X 2026-05-09 6 min read

Open

Colossus 1 contained ~220,000 NVIDIA GPUs

Why it matters

Colossus 1 contained ~220,000 NVIDIA GPUs: ~150,000 H100s, 50,000 H200s, and 20,000 GB200s, forming a heterogeneous cluster that produced an 11% MFU for training versus 40%+ MFU at Meta/Google.

Key details

Heterogeneity caused severe straggler effects in distributed training (100k GPUs must finish a step before advancing) and exposed NCCL/ring-topology limits at the 100k+ scale; Google avoided this with custom OCS topologies.
GB200 (Blackwell) power-smoothing hardware clashes with xAI’s Hopper-optimized software stack, reportedly causing chips to overheat or 'melt' unless the modeling stack is rewritten for the new silicon.
xAI moved training to Colossus 2 (a 100% Blackwell homogeneous cluster) and leased entire Colossus 1 to Anthropic for inference (single-tenant, 300MW), converting a low-MFU training asset into ~$2.60/GPU-hour rental income.

Brief

xAI handed its 220,000‑GPU Colossus 1 to Anthropic because the mixed-generation, heterogeneous cluster (≈150k H100, 50k H200, 20k GB200) was inefficient for distributed training: straggler effects, NCCL/ring‑topology latency at the 100k+ scale, and GB200 power‑smoothing issues that require a rewritten stack produced just ~11% MFU versus 40%+ at Meta/Google. xAI preserved training on Colossus 2, a homogeneous Blackwell fleet, and leased Colossus 1 as a single-tenant inference farm (300 MW) to Anthropic. Inference tolerates heterogeneity and removes multi‑tenant jitter, letting the asset earn an estimated ~$2.60/GPU‑hour or $5–6B/year—nearly hedging xAI’s annualized Q1‑2026 losses—and positioning SpaceXAI favorably ahead of a planned SpaceXAI IPO (~$1.75T) while materially boosting Anthropic’s deliverable capacity in the May 2026 compute race.

By @jukan05

39 YouTube 2024-12-28 Video

Open

So You Want to Build a Nuclear Reactor

Why it matters

The 2020 U.S. DOE advanced reactor competition picked two winners: X-energy (a TRISO-fueled pebble-bed SMR) and TerraPower’s Natrium (a sodium-cooled SMR with molten-salt heat storage).

Key details

X-energy’s design uses TRISO fuel: microscopic fuel kernels with carbon/ceramic coatings (described as 'grain-of-sand' kernels assembled into golf-ball-size pebbles), enabling continuous pebble refueling and very high outlet temperatures for both electricity and process heat.
TerraPower’s Natrium is a sodium‑cooled fast-spectrum reactor paired with a molten-salt thermal storage system; its chosen site is a former coal plant in Wyoming, it has an accepted environmental impact statement and a non‑nuclear construction permit, and the project team has submitted a ~1,700‑page Preliminary Safety Analysis Report to the NRC.
Commercial partners and funding: the DOE program provides major funding; Amazon has invested in the Natrium project and agreed to offtake electricity for AWS data centers; Dow Chemical is investing in the X‑energy Seadrift, Texas site to transition a chemical plant from coal to nuclear heat/electricity.

Brief

Professor David Rusic (Illinois Energy Prof) gives a presentation-style update on U.S. small modular reactor (SMR) progress, focusing on the two DOE-backed winners from the 2020 competition: X‑energy’s TRISO-fueled pebble-bed design and TerraPower’s Natrium sodium‑cooled reactor with molten‑salt thermal storage. He explains TRISO fuel as microscopic kernels coated with carbon/ceramic shells, assembled into golf‑ball‑size pebbles that permit continuous on-line refueling and strong containment of fission products; high neutron energies in these fourth‑generation concepts enable transmutation of long‑lived wastes. Natrium uses a sodium coolant and external molten‑salt storage so a plant can run continuously while dispatching electricity on demand from stored heat. Rusic walks through the multi‑year, capital‑intensive permitting sequence (site ownership, environmental impact statement, NRC preliminary safety analysis — ~1,700 pages for Natrium — non‑nuclear and nuclear construction permits, as‑built documentation, then operating license). He names sites and partners: Natrium’s Wyoming coal‑to‑nuclear conversion (Amazon committed as offtaker) and X‑energy’s Seadrift, Texas project adjacent to Dow Chemical, with X‑energy’s TRISO fuel assembly facility planned near Oak Ridge, TN. He judges Natrium’s schedule as potentially operational by 2029 (grid delivery ~2031) and X‑energy likely later (optimistic 2029, more likely ~2032), noting fuel‑fabrication and licensing remain key bottlenecks.

By Illinois EnergyProf

40 substack.com 2026-05-05 7 min read

Open

Jensen Huang: Your Ambition Is ‘Not High Enough’

Why it matters

At the Milken Institute Global Conference in Los Angeles (interviewed Monday, reported May 5, 2026), Jensen Huang said agentic AI has made AI “useful” in the last several months and cited Anthropic’s Claude Code as a turning point.

Key details

Huang estimated agentic/decision-making systems require roughly 1,000× more computation than generative models, driving GPU consumption “through the roof” and pushing prices for GPUs sold four to five years ago higher.
Nvidia’s upcoming Vera Rubin racks are described as ~3 tons, ~1.5 million parts per rack, containing silicon photonics, 3D packaging and liquid cooling — costing about $4–5 million per rack and deployed by the rack-theater scale (football-field sized) in data centers.
In the last 3–6 months Huang says gross margins at AI-native firms (including OpenAI and Anthropic) have “gone extremely positive,” prompting a race for capacity; he also argued AI will create jobs and enable trillion-dollar-scale reindustrialization (chip plants, computer plants, AI factories).

Brief

Nvidia CEO Jensen Huang told an audience at the Milken Institute Global Conference (interviewed Monday; article published May 5, 2026) that the recent rise of agentic AI — exemplified by Anthropic’s Claude Code — has made AI “useful” in the last several months and driven a dramatic surge in compute demand. He estimated agentic systems need roughly 1,000× the computation of generative models, causing even four- to five-year-old GPUs to appreciate in price. Huang detailed Vera Rubin server racks as ~3‑ton assemblies with ~1.5 million parts, silicon photonics, 3D packaging and liquid cooling, costing $4–5 million each. He said gross margins at AI-native firms have inflected positive over the past 3–6 months, spurring capacity races, and argued AI will create jobs, accelerate scientific discovery (what took months can take a day), and require massive reindustrialization — raising ambition by ~100×.

By Tae Kim from Key Context Newsletter

41 Twitter/X 2026-04-30 1 min read

Open

Joe Tegtmeyer researched nearly 100 new Giga Texas permits (shared with his X…

Why it matters

Joe Tegtmeyer researched nearly 100 new Giga Texas permits (shared with his X Subscribers and Patreons) and highlighted one permit that explicitly references DH5, indicating at least five Data Halls in the Cortex 2.0 build-out (post published 2026-04-30).

Key details

According to Tegtmeyer’s summary of permits and Tesla’s statements, Cortex 2.0 was partially operational as of late April 2026 (likely with Data Halls 1, 2 and possibly 3 active) and is expected to reach full operation this fall, targeting over 130,000 H100‑equivalent GPUs to support FSD/Autopilot training, Optimus humanoid development, and other AI/robotics workloads.
The DH5 permit specifically covers structural steel design work foundational for housing the high-density power, cooling, and racking systems required for a modular data‑hall (Data Hall 5) within the Cortex 2.0 supercomputing facility.

Brief

Cortex 2.0: Joe Tegtmeyer reports that his review of nearly 100 Giga Texas permits (shared with subscribers) includes one referencing DH5, implying at least five data halls. Tesla indicated partial operation in late April 2026 (likely DH1–3), with full build‑out expected this fall and a target capacity above 130,000 H100‑equivalent GPUs for FSD, Optimus, and other AI/robotics workloads; the DH5 permit covers structural steel for power, cooling, and rack infrastructure.

By @JoeTegtmeyer

42 datacenterdynamics.com 2026-03-23 1 min read

Open

Power availability is becoming the defining constraint on digital growth

Why it matters

Power availability is now the primary constraint on digital infrastructure growth (DCD Mission Critical Power, published 2026-03-23), with rising demand from AI, cloud and high-performance workloads stressing grid capacity, deployment timelines, and facility electrical architectures.

Key details

DCD's Critical Power Supplement recommends technical responses including grid-interactive architectures, on-site generation, distributed energy and hybrid electrical architectures, solid-state and advanced UPS systems, modular infrastructure, microgrids, and intelligent energy management to support higher-density AI workloads and speed to power.

Brief

Power availability is fast becoming the defining constraint on data center growth as AI, cloud and high-performance workloads increase demand and strain grid capacity, deployment timelines and electrical architecture. DCD's Critical Power Supplement (23 March 2026) examines solutions — grid-interactive designs, on-site generation, distributed/hybrid electricals, solid-state and advanced UPS, modular builds, microgrids and intelligent energy management — to enable higher densities and resilience.

By DCD Mission Critical Power

43 Twitter/X 2026-05-04 1 min read

Open

The U.S. completed only 888 miles of 345 kV+ high-voltage transmission in 2024…

Why it matters

The U.S. completed only 888 miles of 345 kV+ high-voltage transmission in 2024, down from nearly 4,000 miles in 2013, after a decade-long construction lull.

Key details

Planned transmission spending has surged: PJM's plan rose from $920 million in 2021 to almost $12 billion in 2025, and U.S. utilities plan $1.4 trillion of spending by 2030 with nearly half for transmission and distribution.
Demand-driven projects are driving new builds: Texas has four 765 kV lines under development for the Permian Basin; Virginia is building a 115-mile 765 kV line because Dominion receives 2–3 GW of new data center requests per month — yet some previous backers now oppose public cost-sharing despite deep-pocketed private beneficiaries.

Brief

High-voltage transmission is surging in the U.S. after a decade-long lull: 888 miles of 345 kV+ lines were completed in 2024 (vs. ~4,000 in 2013). Planned spending jumped—PJM from $920M (2021) to ~$12B (2025)—and utilities plan $1.4T by 2030 (about half T&D). Some former boosters resist public cost-sharing even as private demand (data centers, Permian) is ready to pay.

By @xiaowang1984

44 substack.com 2026-03-20 7 min read

Open

An Interview with Nvidia’s Ian Buck: Why AWS Is Buying a Million GPUs

Why it matters

Amazon Web Services has committed to buying more than 1 million Nvidia GPUs “through 2027,” with Nvidia VP Ian Buck expecting a million-plus GPUs delivered by year-end 2027.

Key details

The expanded AWS–Nvidia deal includes deployment of Groq 3 LPUs for ultralow‑latency inference and joint adoption of ConnectX networking and AI Ethernet / Spectrum‑X to support east‑west datacenter traffic.
Nvidia expects multi‑chip, multi‑datacenter ramps (Blackwell and Vera Rubin families); customers are placing orders in advance, Vera Rubin’s first ramp is “happening in the second half,” and Nvidia cites ~6+ months wafer‑to‑rack manufacturing lead time.
Ian Buck said Nvidia and AWS are jointly engineering fleet health for large customers (including helping scale OpenAI), and Jensen Huang estimated LPUs might handle roughly 25% of high‑throughput agentic workloads; the deal also includes RTX PRO 4500 Blackwell Server support to accelerate Amazon EMR.

Brief

Nvidia VP Ian Buck characterized the expanded Nvidia–AWS partnership as a multi‑chip, hyperscale effort that will deliver more than one million Nvidia GPUs through the end of 2027 and add Groq 3 LPUs plus ConnectX / Spectrum‑X networking for high‑performance east–west traffic. Buck said AWS and Nvidia are coordinating long‑range forecasting, with Vera Rubin ramps “in the second half” and roughly six months from silicon wafer to racks. The companies are jointly monitoring fleet health and swarming engineering to support large customers (Buck cited work with OpenAI), while Jensen Huang estimated LPUs could handle ~25% of certain ultra‑high‑throughput agentic inference workloads; the agreement also includes RTX PRO 4500 Blackwell Server acceleration for Amazon EMR and reflects AWS’s larger datacenter push amid a $200 billion 2026 capex outlook.

By Tae Kim from Key Context Newsletter

45 substack.com 2026-03-25 36 min read

Open

Texas Growth Is Running Into Power Grid Limits with Katie Coleman

Why it matters

Katie Coleman (managing partner, O’Melveny & Myers LLP) says Texas faces a surge of large interconnection requests (data centers, crypto, AI hyperscalers) that strain ERCOT’s transmission planning and resource allocation (podcast published Mar 25, 2026).

Key details

Transmission is a regulated, rate‑payer funded function while generation is private‑risk; Coleman warns unresolved transmission cost allocation and planning could shift burdens onto residential/commercial ratepayers.
Industrial customer profile: >50% of industrial sites are 'high load factor' and largely inflexible; ~25–30% actively provide demand response or ancillary services (ERS, responsive reserve), and for some manufacturers electricity is up to 70% of production costs.
Winter Storm URI (2021) was driven primarily by generation performance problems (Coleman cites ≈50% of the gas fleet on outage and a then reserve margin around 25%), not purely insufficient installed capacity — highlighting importance of operational readiness and weatherization.

Brief

Katie Coleman, a leading Texas energy regulatory attorney and managing partner in O’Melveny & Myers’ Austin office, explains in a March 25, 2026 Energy Capital podcast how a wave of very large load interconnection requests—driven by AI data centers, hyperscalers, and third‑party site developers—has exposed transmission planning and cost‑allocation limits in ERCOT. Coleman emphasizes the market split: generation investment is borne by private capital under the energy‑only design, whereas transmission remains a regulated activity that the Public Utility Commission authorizes and ultimately pays for through customer rates. That asymmetry is creating a policy squeeze as planners try to decide which speculative or staged projects are “real” and how much transmission to build now versus later without unfairly loading existing ratepayers.

Coleman walks through how different industrial customers behave in scarcity events: more than half of industrial sites are high‑load, reliability‑constrained operations (not price responsive), roughly 25–30% can and do participate in demand response or ancillary services (Emergency Response Service, responsive reserve), and some operators can opportunistically curtail when prices spike. She disputes narratives that large users “profit” from emergency pricing, noting ancillary revenues are typically mitigation, not net profit, for firms that may spend hundreds of millions annually on power (up to 70% of production costs for some). Coleman points to Winter Storm URI (2021) as an example where outages resulted from generation performance (she cites about half the gas fleet offline and a healthy reserve margin beforehand) rather than the market construct. On policy, she highlights Senate Bill 6’s requirement for earlier financial commitments for interconnection studies, but warns a pending PUC proposal for $100,000/MW non‑refundable fees and other pay‑to‑play measures—plus the 75 MW threshold—could disadvantage traditional manufacturers. She urges stability and market predictability, and notes ERCOT work to allow interconnection credit when customers bring local generation as a crucial part of a pragmatic solution to serve rapid load growth while preserving market discipline.

By Texas Energy and Power Newsletter

46 substack.com 2026-04-20 4 min read

Open

Connecting the Regulatory Dots Shaping Texas Energy | Reading and Podcast Picks - April 20, 2026

Why it matters

PUCT Chairman Thomas Gleeson and ERCOT CEO Pablo Vegas told a Texas Senate hearing in April 2026 they are managing roughly 410 GW of load applications — nearly five times today’s ERCOT capacity — and about 90% of those applications are data centers.

Key details

Regulators and the legislature are deploying three major policy tools: a Dispatchable Reliability Reserve Service (DRRS) mandated in H.B.1500 and codified at PURA §39.159(d),(e) to price long‑duration reliability; a deliberate ‘denominator effect’ strategy to grow the load base and spread fixed grid costs; and a first‑of‑its‑kind batching/flexible‑connection interconnection process with strict project maturity criteria, bankable energization timelines, and as‑available service options.
Renewables are creating new rural revenue streams in Texas — despite political pushback (Sen. Lois Kolkhorst: “Nobody wants the renewables…in their districts”) — with San Antonio Express‑News reporting that projects are helping some landowners retain and monetize land.

Brief

Texas energy regulation in spring 2026 is focused on managing rapid, AI‑driven load growth and who will pay for the necessary grid build‑out. A CSIS report by Arushi Sharma Frank frames a regulatory scramble: at an April Senate committee hearing PUCT Chair Thomas Gleeson and ERCOT CEO Pablo Vegas said they were processing ~410 GW of load applications (≈5× current ERCOT capacity), ~90% from data centers. Policymakers are responding with targeted measures — a Dispatchable Reliability Reserve Service (DRRS) (H.B.1500; PURA §39.159(d),(e)) to preserve revenue for gas and long‑duration storage, a ‘denominator effect’ approach to dilute fixed costs by expanding the load base, and a novel batching/flexible connection interconnection regime that enforces project maturity, bankable energization timelines, and as‑available service to accelerate energization. The newsletter also highlights local impacts: San Antonio Express‑News coverage shows renewables providing critical income for rural landowners, even amid political resistance, and flags other trends such as threats to solar and insurance cost pressure from climate risks.

By Texas Energy and Power Newsletter

47 datacenterdynamics.com 2026-04-27 1 min read

Open

Whitepaper: Unlocking stranded electrical capacity for AI infrastructure

Why it matters

AI workloads are increasing power volatility and exposing a gap between secured utility capacity and what facilities can continuously deliver; many sites already hold more electrical capacity than they can deploy (whitepaper word count: 260).

Key details

The DG Matrix whitepaper (published 2026-04-27 by DCD Mission Critical Power) recommends software-defined power—coordinating grid supply, batteries, and distributed resources—to increase usable capacity and unlock additional AI compute without expanding utility connections.

Brief

The DCD Mission Critical Power whitepaper (DG Matrix, published 27 April 2026) argues that conventional peak-driven electrical architectures create stranded capacity as AI workloads raise power volatility. It presents software-defined power methods—coordinating grid supply, battery storage and distributed resources—as practical strategies to raise usable capacity and deploy more AI compute without new utility connections.

By DCD Mission Critical Power

48 substack.com 2026-03-27 9 min read

Open

🍪 TWiC: TurboQuant, AGI CPU, GF Sues Tower, CPU Price Hikes, 400G SiPho, AAOI, ++

Why it matters

TurboQuant (a Google Research KV-cache compression paper first posted on arXiv ~2025) resurfaced in Mar 2026 and triggered a memory/storage selloff—Micron (MU) fell ~25% from highs and SanDisk declined ~8% despite Micron reporting FQ2 revenue of $8.05B and a record 75% gross margin.

Key details

Arm launched its first fully in‑house processor, the Arm AGI CPU (announced Mar 2026): built on TSMC 3nm with 136 Neoverse V3 cores, 300 W TDP; air‑cooled racks hold 8,160 cores (60 CPUs) and liquid‑cooled racks >45,000 cores (>330 CPUs); CEO Rene Haas forecasted $15B in annual AGI revenue within five years and positioned Meta as a lead customer.
Intel and AMD raised CPU prices by roughly 10–15% year‑to‑date (Mar 2026) while lead times stretched from ~2 weeks to up to 6 months for some SKUs, tightening supply as Arm predicts agentic AI will drive ~4× more CPU utilization per deployed GW.
Tower Semiconductor + Coherent demoed 400 Gbps/lane SiPho on Tower’s production process at OFC 2026 (clear 420 Gb/s PAM4 eye) using Coherent InP CW lasers; shortly after GlobalFoundries filed suit against Tower alleging infringement of 11 patents and seeks US import bans—Tower shares fell ~7.5%, GF ~4.6%.

Brief

In the Mar 27, 2026 newsletter, several semiconductor and photonics shifts converged. TurboQuant — Google Research’s KV‑cache compression paper that had been on arXiv for about a year — resurfaced and spooked memory investors, contributing to a sharp pullback even as Micron reported FQ2 revenue of $8.05B, a record 75% gross margin, and FQ3 guidance of $8.8B. Arm unveiled its first in‑house Arm AGI CPU built on TSMC 3nm with 136 Neoverse V3 cores and a 300 W TDP; Arm described rack densities of 8,160 cores (air) and >45,000 cores (liquid) and projected $15B in AGI CPU revenue within five years, with Meta as an anchor customer. Supply stress is visible elsewhere: Intel and AMD raised CPU prices ~10–15% YTD and lead times widened to as much as six months. In photonics, Tower Semiconductor and Coherent demonstrated 400 Gbps/lane (420 Gb/s PAM4) on Tower’s silicon‑photonic process using Coherent InP CW lasers, potentially delaying the need for TFLN modulators; days later GlobalFoundries sued Tower over 11 patents, seeking import bans and prompting share price moves (Tower −7.5%, GF −4.6%). Applied Optoelectronics secured a $53M 800G order (Q2→mid‑Q3) after a $200M+ 1.6T win. Separately, helium spot prices surged over 50% after attacks on Qatar’s Ras Laffan (≈17% LNG capacity hit), spotlighting a material risk for lithography, ion implantation, and leak detection in fabs.

By Vik's Newsletter

49 divenewsletter.com 2026-04-25 5 min read

Open

Weekender: Sudden data center load losses prompt NERC alert, recommendations

Why it matters

NERC issued an alert and recommendations after sudden data center load losses that it described as a series of “widespread and unexpected” customer-initiated reductions.

Key details

The events occurred in 2024 and 2025 and involved instances where 1,000 MW or more dropped off the bulk power system during individual incidents.
The incidents involved data center customers initiating the load reductions, prompting NERC to warn of reliability risk to the bulk power system and to recommend actions to address those risks.

Brief

Sudden load losses at data centers prompted the North American Electric Reliability Corporation (NERC) to issue an alert and recommendations after a series of “widespread and unexpected” customer‑initiated reductions in 2024 and 2025. Utility Dive reports that multiple events saw 1,000 MW or more drop off the bulk power system in single incidents, raising concerns about coordination between large commercial customers and grid operators and the potential for cascading reliability impacts. NERC’s response flags the need for steps to mitigate similar future occurrences; the newsletter highlights the scale (≥1,000 MW per event) and timeframe (2024–2025) but does not detail the specific technical measures NERC recommended.

By Utility Dive

50 e.economist.com 2026-03-27 6 min read

Open

The Climate Issue: Europe must confront its energy problem

Why it matters

Spain has diversified its electricity mix with nuclear, flexible gas and large solar/wind fleets: gas-fired plants set the wholesale electricity price 26% of the time in 2025, down from ~50% in 2019 (source: Ember).

Key details

Italy remains highly gas-dependent: gas plants set the price 85% of the time in 2025, and Italy’s average wholesale price so far in 2026 has been €130/MWh versus Spain’s €47/MWh.
Renewables alone cannot solve northern Europe’s needs because solar is seasonally weak, wind is variable and grid/battery/backup capacity (often gas or future molecules) adds large fixed costs; policymakers must debate how to minimise and allocate those fixed costs as geopolitical shocks (2022 Russia–Ukraine war, 2026 Iran war) recur.

Brief

Europe's energy problem has resurfaced after the 2022 Russia–Ukraine shock and the 2026 Iran war, exposing uneven progress across member states. Spain has made measurable gains — a mix of nuclear, flexible gas plants and expanded solar and wind meant gas-set prices fell to 26% of hours in 2025 (from ~50% in 2019, Ember). By contrast Italy still relies heavily on gas (gas set the price in 85% of hours in 2025) and has averaged €130/MWh wholesale in 2026 to date versus Spain’s €47. The piece argues that expanding renewables helps but cannot be the whole solution: northern winters limit solar output, wind is variable and connecting offshore farms requires costly grids; batteries and backup generation create large fixed costs that policymakers must actively manage and decide who bears.

By The Economist

51 divenewsletter.com 2026-03-27 4 min read

Open

PJM's proposed data center colocation policy drew criticism at FERC from Vistra…

Why it matters

PJM's proposed data center colocation policy drew criticism at FERC from Vistra and the Data Center Coalition, which warned on March 27, 2026 that “even a customer that brings sufficient co-located generation to meet its load cannot avoid curtailment risk.”

Key details

Winter Storm Fern exposed transmission limits and price separation in late winter: some regions saw power prices of “hundreds of dollars per MWh” while neighboring areas experienced negative prices, a gap highlighted by Liza Reed of the Niskanen Center.
Coastal Virginia Offshore Wind (CVOW) began delivering power with a single 14.7-MW turbine online now toward a 2.6-GW planned project; PJM transmission upgrades are required before the full fleet can be deliverable.
FERC reports about 50 GW of data center capacity online at the end of 2025, with the Midcontinent ISO (MISO) posting the strongest growth — a 43% annual growth rate since 2020 — and over one-third of data center developers now planning 100% onsite power by 2030.

Brief

Utility Dive's March 27, 2026 briefing covers grid stress, policy debate and rapid data center growth. At FERC, PJM's data-center colocation proposal was sharply criticized by Vistra and the Data Center Coalition, which warned colocated onsite generation still faces curtailment risk. Winter Storm Fern revealed interregional transmission shortfalls and extreme price divergence — some hubs spiking into the “hundreds of dollars per MWh” while adjacent areas saw negative prices, underscoring calls for expanded interregional transfer capacity. Offshore, Coastal Virginia Offshore Wind has started producing electricity from a single 14.7-MW turbine as part of a 2.6-GW buildout, though PJM transmission upgrades are needed for full deliverability. Meanwhile FERC data show roughly 50 GW of data center capacity online by end-2025, with MISO growing fastest (≈43% annual growth since 2020) and a notable industry shift toward onsite power (≈>33% of developers target 100% onsite by 2030).

By Utility Dive

52 substack.com 2026-04-19 5 min read

Open

Weekly|AI Infra, SpaceX Part 2, ASML & TSMC Earnings, TMTB AMA Next Monday, PLTR, AMZN Acquiring GSAT

Why it matters

TSMC (1Q26) raised full‑year 2026 USD revenue growth guidance to >30% YoY and is planning 2026 CapEx at the high end of $52–$56 billion, citing Agentic AI demand (coverage Apr 17, 2026).

Key details

ASML (1Q26) revised 2026 revenue guidance up to €36–40 billion (from €34–39bn) and flagged a potential 2027 EUV supply shortage, underscoring capacity limits in the semicap upcycle (coverage Apr 17, 2026).
Anthropic’s ARR reportedly jumped from >$9 billion at end‑2025 to >$30 billion in April 2026, signaling much faster monetization of agentic AI than street models anticipated and shifting bottlenecks to supply side elements (nodes, memory, CPU, interconnect).
FundaAI’s AI Infra 2026 deep dive models TPU networking shifts — OCS scale‑up port ratios moving from ~1.5:1 to 2:1–10:1 and scale‑out from ~0.2:1 to 1:1 — finds TPU rack BOM TCO >2× better than GB200, and concludes networking/DCN (3D→4D→6D torus roadmap) is now the binding constraint; the report became FundaAI’s most‑read in three days.

Brief

FundaAI’s weekly briefing synthesizes a catalyst‑heavy week in which Agentic AI demand materially repriced the hardware and space stacks. Key reported numbers: TSMC raised 2026 revenue growth guidance to >30% YoY with 2026 CapEx at the high end of $52–$56bn; ASML lifted 2026 revenue to €36–40bn and warned of a 2027 EUV shortage; Anthropic’s ARR climbed from >$9bn at end‑2025 to >$30bn in April 2026. Using bottom‑up SOTP and TPU rack BOM analysis, FundaAI models network port‑ratio shifts (OCS scale‑up ~1.5:1 → 2:1–10:1; scale‑out ~0.2:1 → 1:1) and finds TPU TCO >2× better than GB200, concluding that DCN/interconnect — not raw FLOPS — is the immediate bottleneck. Implications: expanded TAM for optics, transceivers, memory and semicap equipment, and major near‑term catalysts including earnings season and Google Cloud Next (Apr 22–24, 2026).

By FundaAI

53 Twitter/X 2026-05-04 1 min read

Open

Futurum Group and NVIDIA frame AI as a five-layer stack

Why it matters

Futurum Group and NVIDIA frame AI as a five-layer stack: energy, chips, infrastructure, models, and applications.

Key details

The five largest U.S. hyperscalers are projected to spend up to $690B on infrastructure in 2026 (nearly double 2025); energy and cooling have overtaken silicon as the primary bottleneck.
Inference on NVIDIA Blackwell is roughly 35× cheaper per million tokens than on Hopper, but aggregate compute demand keeps rising because reasoning models and agentic workflows consume far more tokens; the build-out is creating high demand for electricians, HVAC technicians, steelworkers, and grid engineers (many earning six-figure salaries) and highlighting national sovereignty gaps for countries without energy, fabs, or domestic models.

Brief

Futurum Group and NVIDIA's report frames AI as a five-layer stack—energy, chips, infrastructure, models, applications—and warns the five largest U.S. hyperscalers could spend up to $690B on infrastructure in 2026 (nearly double 2025). Energy and cooling now outpace silicon as the main bottleneck; Blackwell inference is ~35× cheaper than Hopper, even as token-heavy reasoning and agents drive rising compute needs and reshape workforces and national sovereignty.

By @kimmonismus

54 datacenterdynamics.com 2026-05-05 1 min read

Open

Spain’s largest campus: New 100 MW roadmap

Why it matters

Nabiax’s ADC Madrid campus is planned as a 100 MW, AI-ready facility designed to support up to 100% liquid cooling with a target delivery in Q4 2027.

Key details

The campus includes secured grid access, on-site substations and live construction sequencing intended to remove power and permitting bottlenecks for hyperscalers expanding in Spain (DCD whitepaper, published 5 May 2026).

Brief

Nabiax’s ADC campus in Madrid outlines a 100 MW, AI-ready data center designed for up to 100% liquid cooling and scheduled for Q4 2027 delivery. The DCD whitepaper (5 May 2026) emphasizes secured grid access, on-site substations and live construction to accelerate deployment and address power/permitting constraints for hyperscalers in Spain.

By DCD Data Center Construction Channel

55 divenewsletter.com 2026-03-28 4 min read

Open

Weekender: Bechtel, Kiewit tapped for Japan-backed $33B Ohio power generation project

Why it matters

Bechtel and Kiewit were selected as lead contractors for a Japan-backed, $33 billion power-generation project in Pike County, Ohio

Key details

The investment—first announced in October—will fund construction of about 10 gigawatts of new energy capacity intended to power a forthcoming data center
The project is expected to pour billions into Pike County and was reported by Construction Dive in its Weekender on March 28, 2026

Brief

Bechtel and Kiewit were tapped as lead builders for a Japan-backed, $33 billion power-generation buildout in Pike County, Ohio, aimed at supplying roughly 10 gigawatts of new capacity for a forthcoming data center. First announced in October, the investment will channel billions into local infrastructure and construction work to create the generation capacity needed to serve hyperscale computing demand. Reported in Construction Dive's Weekender on March 28, 2026, the assignment positions two major U.S. contractors on a large-scale energy project tied directly to data‑center development and highlights the scale of private international capital flowing into U.S. power and data‑infrastructure projects.

By Construction Dive

56 Twitter/X 2026-05-04 1 min read

Open

NERC issued an 'essential action' Level 3 alert on 2026-05-04 identifying…

Why it matters

NERC issued an 'essential action' Level 3 alert on 2026-05-04 identifying data-center loads as a grid risk; Texas data centers are singled out for causing instabilities and will likely require new reliability standards.

Key details

Of the three Level 3 alerts NERC has issued in the past few years, one (last year, 2025) targeted inverter-based resources (wind and solar); in Texas these have caused mass instabilities because developers often configure farms cheaply without key electronic controls.
Latitude Media frames the Level 3 alert as a signal that NERC is moving toward formal reliability standards because the AI-driven power boom from data centers is outpacing grid infrastructure.

Brief

Fred Stafford (@fredstaffordcs) warns that NERC's May 4, 2026 'essential action' Level 3 alert targets data-center loads—especially in Texas where data centers have destabilized the grid and need new standards. He reminds readers that one of three recent Level 3 alerts (last year) concerned inverter-based wind and solar, which have caused mass instabilities when built without key electronic controls; Latitude Media links the alert to AI-driven power demand.

By @fredstaffordcs

57 datacenterdynamics.com 2026-04-01 3 min read

Open

ByteDance (TikTok owner) has reportedly pulled out of plans to lease additional…

Why it matters

ByteDance (TikTok owner) has reportedly pulled out of plans to lease additional capacity in Ireland and will not take a second data‑centre there (report published 1 April 2026).

Key details

Nebius announced plans for a 310 MW AI data centre campus in Lappeenranta, Finland, with initial capacity expected to be live in 2027.
Swedish IT firm Binero acquired a leased data centre from E.On located outside Stockholm; financial terms of the transaction were not disclosed.
Other industry moves flagged in the 1 April 2026 DCD EMEA newsletter: CoreWeave secured an $8.5 billion loan for GPU purchases, Hyperscale Data bought 48.5 acres in southwest Michigan for campus expansion, and TierPoint CEO Jerry Kent is stepping down.

Brief

The DCD EMEA weekly review (1 April 2026) highlights shifting demand and fresh supply moves across Europe and beyond. ByteDance reportedly cancelled plans to lease a second Irish data centre, removing a high‑profile hyperscaler demand signal for the market. On the supply side, Nebius has committed to a large‑scale 310 MW AI data centre in Lappeenranta, Finland, targeting initial capacity in 2027 — a project aimed squarely at high‑density AI workloads. In Sweden, Binero has acquired a leased E.On facility outside Stockholm (terms undisclosed), reflecting continued consolidation of regional operator assets. The newsletter also flags major financing and land‑acquisition activity — CoreWeave’s $8.5bn GPU loan and Hyperscale Data’s 48.5‑acre Michigan purchase — and includes industry content on rising rack densities and waterless liquid‑cooling approaches for AI factories.

By DCD Europe, Middle East & Africa Newsletter

58 substack.com 2026-04-01 14 min read

Open

The Industrial Resilience Stack

Why it matters

Published Apr 1, 2026 by Crucible Capital, the piece defines an “Industrial Resilience Stack” across five layers: Dispatch & Optimization, Verification & Ground Truth, Financial Infrastructure, Materials & Supply, and Defense & Infrastructure Security.

Key details

Urgency is driven by AI deployment pressures: GPU rack densities approaching ~1 MW, EIA-reported 18 GW of battery storage installed in 2025, and >90 GW of U.S. datacenter capacity in the pipeline — with the interconnection queue estimated to cost the industry over $1 trillion in delayed value at $3–4M per MW/year in lost revenue.
Crucible highlights portfolio and target companies that deliver ground truth and operating layers: Shatterdome (battery dispatch), Mercury Computing (non-firm utility capacity pilots with Duke Energy and Dominion covering ~30% of US datacenter market), Aravolta (GPU utilization/depreciation monitoring), Reliability Engine (chemical/thermal monitoring), Matium and Pillar (commodity trade finance/hedging), Supra (onshore gallium/scandium recovery using ion-specific supramolecular polymers), and Hextronics (persistent autonomous aerial ISR).
Financial fragility examples: Blue Owl gated redemptions after investors pulled >15% of net assets and secondary buyers bid at 20–35% haircuts — demonstrating lenders need better real‑time collateral valuation tools for GPU-backed loans.

Brief

Crucible Capital argues that the AI-era buildout has exposed decades of deferred industrial maintenance and just-in-time fragility, creating an investment opportunity they call the Industrial Resilience Stack. The fund maps five concrete layers where durable value accrues: (1) Dispatch & Optimization — software/market layers that operate batteries, grid and compute more efficiently (e.g., Shatterdome, Mercury Computing, which runs pilots with Duke and Dominion); (2) Verification & Ground Truth — real‑time telemetry and verified asset valuation for GPUs and cooling (e.g., Aravolta, Reliability Engine); (3) Financial Infrastructure — trade finance, hedging, and transaction ledgers for industrial materials (Matium, Pillar); (4) Materials & Supply — onshoring and new processing tech for critical minerals (Supra’s ion‑specific supramolecular receptors for gallium/scandium, TRISO fuel fabrication, laser‑infused copper/aluminum); and (5) Defense & Infrastructure Security — OT/IT convergence, persistent ISR (Hextronics/Hex) and AI-native forensics. Crucible backs companies that produce ground truth — verified pricing, provenance, and asset condition — because they believe execution commoditizes while verification gains value (citing Christian Catalini’s economics of AGI). The memo uses concrete metrics (1 MW+ rack densities, 18 GW battery installs in 2025, >90 GW pipeline, $3–4M/MW-year lost revenue, DoD 6% supplier visibility, 30% rise in cyberattacks in 2024) to argue the maintenance bill is imminent and that startups embedding into verification, finance, materials, and security will capture durable moats.

By Crucible Capital

59 divenewsletter.com 2026-03-28 6 min read

Open

Weekender: PJM data center colocation plan takes fire from Vistra, data center group, others

Why it matters

Vistra, the Data Center Coalition and other industry parties formally criticized PJM’s data center colocation proposal in filings with the Federal Energy Regulatory Commission (FERC), arguing the design exposes colocated customers to curtailment risk.

Key details

The Data Center Coalition told FERC: “Even a customer that brings sufficient co‑located generation to meet its load cannot avoid curtailment risk,” adding “It is unclear why a customer would pursue this pathway at all.”
The dispute arrives as data center demand is rising rapidly — FERC reported about 50 GW of data center load online at the end of 2025, with MISO experiencing ~43% annual growth in data center capacity since 2020 — increasing the stakes for market and interconnection rules.

Brief

PJM’s proposal to create a colocation pathway for data centers has drawn sharp objections from generators and industry groups — notably Vistra and the Data Center Coalition — in comments filed with the Federal Energy Regulatory Commission. Critics say the proposal still exposes colocated customers to curtailment even when they bring on-site generation sized to meet their load, undermining the commercial value of the pathway and potentially deterring participation. That challenge matters as data center capacity has surged (FERC counted ~50 GW online by end of 2025, with the Midcontinent ISO seeing ~43% annual growth since 2020), creating urgent pressure on market rules, interconnection processes and reliability planning as operators decide how to allocate curtailment and reliability risk for large, flexible loads.

By Utility Dive

60 datacenterdynamics.com 2026-04-02 1 min read

Open

Power availability is becoming the defining constraint on digital growth

Why it matters

DCD Mission Critical Power (Data Centre Dynamics) published the 316-word Critical Power Supplement on 2026-04-02, warning that power availability is becoming the primary constraint on digital infrastructure growth due to rising demand from AI, cloud, and high-performance workloads.

Key details

The supplement highlights technology shifts: solid-state UPS and advanced UPS architectures are being adopted to improve efficiency, flexibility, and reliability at scale.
Operators and energy providers are prioritizing distributed energy (microgrids, on-site generation), grid-interactive and hybrid electrical architectures, modular infrastructure, and intelligent energy management to support higher-density AI workloads, faster deployment timelines, and long-term resilience.

Brief

The DCD Critical Power Supplement (published 2 April 2026) argues that escalating AI, cloud and HPC demand is making power availability the defining limit on digital growth. It surveys technical responses — solid-state and advanced UPS designs, grid-interactive and hybrid electrical architectures, on-site generation, microgrids, modular infrastructure and intelligent energy management — to enable higher densities, faster 'speed to power' and greater resilience.

By DCD Mission Critical Power

61 divenewsletter.com 2026-04-29 5 min read

Open

PJM reopened its interconnection queue (first cycle since being effectively…

Why it matters

PJM reopened its interconnection queue (first cycle since being effectively closed in 2022) with more than 800 projects totaling ~220 GW seeking connection; gas-fired generation leads the list with ~106 GW.

Key details

MISO’s summer capacity auction saw offered capacity rise 3.4% year‑over‑year to 141 GW (up from 136.3 GW), as new supply—partly solar additions—outpaced demand growth and pushed capacity prices lower.
CAISO and inaugural EDAM participant PacifiCorp report readiness for the Western day‑ahead market after 90 days of parallel operations; PacifiCorp’s Mike Wilding said the market is 'working as intended.'
TVA and Plus Power will build the Crawfish Creek battery: a 200 MW / 800 MWh BESS in Alabama, one of TVA’s first grid‑scale storage projects and part of its plan to deploy up to 1.5 GW of storage by 2029.

Brief

Utility Dive’s April 29, 2026 Daily Dive highlights major grid planning and market developments: PJM reopened its revamped interconnection queue (first cycle since 2022) with >800 projects totaling about 220 GW, led by roughly 106 GW of proposed gas‑fired generation. In the Midcontinent, MISO’s summer auction showed offered capacity climbing 3.4% YOY to 141 GW (from 136.3 GW), a supply surge—aided by solar additions—that has driven capacity prices down. On the West Coast, CAISO and PacifiCorp report confidence in EDAM’s imminent launch after 90 days of parallel operations validating the day‑ahead market design. Separately, TVA and Plus Power announced the Crawfish Creek 200 MW / 800 MWh battery project in Alabama, supporting TVA’s target of up to 1.5 GW of storage by 2029. An opinion piece warns an $80B PJM capacity bill risks grid strain without faster replacement generation, grid‑enhancing tech and permitting reform.

By Utility Dive

62 divenewsletter.com 2026-05-05 5 min read

Open

NERC on May 5, 2026 issued a Level 3 alert saying computational/data center loads…

Why it matters

NERC on May 5, 2026 issued a Level 3 alert saying computational/data center loads pose “immediate risks” to reliability and ordered certain registered entities to complete seven specified actions by Aug. 3, 2026 to address data center load losses.

Key details

The Pennsylvania House on May 5, 2026 unanimously passed an advanced transmission technology (ATT) bill that could require utilities including PPL Electric, PECO Energy and FirstEnergy to integrate ATTs into proposed projects; similar laws now exist in at least nine states.
California’s Department of Justice has subpoenaed Golden State Wind as part of an investigation into offshore wind lease buyouts tied to the Trump lease deal, with the California Energy Commission warning it expects litigation over potential legal violations.
Dominion Energy said its 2.6-GW Coastal Virginia Offshore Wind (CVOW) began producing some electricity in March 2026, is expected fully online by 2027, could yield roughly $5 billion in fuel savings over 10 years, while the company reported a 67% jump in fuel and energy-related costs in Q1.

Brief

NERC's Level 3 alert (published May 5, 2026) highlights computational and data center loads as immediate reliability risks and requires certain transmission and distribution participants to complete seven defined actions by Aug. 3, 2026 to mitigate load-loss impacts. The newsletter also reports policy movement in Pennsylvania, where the state House unanimously approved an advanced transmission technology bill that could obligate utilities such as PPL Electric, PECO and FirstEnergy to deploy ATTs — part of a wave of similar laws across at least nine states. In California, the DOJ has subpoenaed Golden State Wind amid an investigation and anticipated litigation over offshore lease buyouts tied to the Trump lease deal. Corporate items include Schneider Electric attributing revenue growth partly to the AI boom, and Dominion Energy saying its 2.6‑GW Coastal Virginia Offshore Wind began producing in March, should be fully operational by 2027, and may save about $5 billion in fuel over a decade.

By Utility Dive

63 substack.com 2026-04-30 3 min read

Open

Alphabet Q1 2026. Enterprise AI demand Now The Primary Growth Driver For Google Cloud,

Why it matters

On April 29, 2026 Alphabet reported Q1 revenue of $109.9 billion, up 22% year‑over‑year from $90.2 billion (19% in constant currency), beating the $107.2 billion Street consensus by $2.7 billion.

Key details

Net income surged 81% to $62.6 billion and EPS hit $5.11 versus a $2.62 analyst consensus; shares rose more than 7% in after‑hours trading.
Google Cloud (described as Alphabet's 'AI conversion engine') generated $20.2 billion in revenue, up 63% YoY and above the $18.05 billion Street target; CEO Sundar Pichai said the business is compute‑constrained and enterprise AI demand is now Cloud’s primary growth driver.
The company reported its 11th consecutive quarter of double‑digit revenue growth, backlog nearly doubled sequentially to $462 billion, and operating margin expanded to 36.1%.

Brief

Alphabet's Q1 2026 results, reported April 29, were unambiguously strong: revenue was $109.9 billion (22% YoY, 19% in constant currency) and net income rose 81% to $62.6 billion, with EPS of $5.11 versus a $2.62 consensus. Google Cloud posted $20.2 billion in revenue, growing 63% YoY and beating the $18.05 billion Street target; management characterized Cloud as compute‑constrained and said enterprise AI demand has become the primary growth driver. The quarter marked Alphabet's 11th consecutive double‑digit revenue growth quarter, backlog nearly doubled sequentially to $462 billion, and operating margin expanded to 36.1%, prompting a >7% after‑hours share pop.

By William Martin Keating from Semicon Alpha

64 divenewsletter.com 2026-03-26 5 min read

Open

FERC reports 50 GW of data center capacity was online in the U.S. at the end of…

Why it matters

FERC reports 50 GW of data center capacity was online in the U.S. at the end of 2025, with the Midcontinent Independent System Operator (MISO) region posting the strongest growth—about 43% annual growth in data center capacity since 2020; ERCOT, SPP and Southeast regions also grew rapidly.

Key details

Over one-third of data center developers now plan to meet 100% of site power onsite by 2030, signaling a shift toward onsite generation and resilience strategies.
State legislation in Michigan and New York would restrict utility ownership of participating distributed energy resources (DERs) in virtual power plants (VPPs) and require reasonable access for third‑party aggregators.
DOE issued a $50 million funding opportunity for tribal energy projects that scores proposals on factors including how much they support the supply of firm, reliable power; meanwhile the EIA reported utility‑scale solar and wind reached a record 17% of U.S. net generation in 2025 (19% when small‑scale solar is included).

Brief

Data center growth and grid policy headlines dominated Utility Dive’s March 26, 2026 Daily Dive. FERC found 50 GW of data center capacity online at the end of 2025, with MISO exhibiting roughly 43% annual growth since 2020 and rapid increases also in ERCOT, SPP and the Southeast—pressuring transmission and local planning. More than one‑third of developers now expect 100% onsite power by 2030. At the state level, Michigan and New York lawmakers are advancing virtual power plant bills that would bar utility ownership of participating DERs and require reasonable access for third‑party aggregators. The DOE opened a $50 million tribal energy funding opportunity prioritizing projects that supply firm, reliable power. Separately, the EIA reported utility‑scale solar and wind hit a record 17% of U.S. generation in 2025 (19% including small‑scale solar).

By Utility Dive

65 datacenterdynamics.com 2026-05-01 1 min read

Open

Whitepaper: The impact of AI on data center design

Why it matters

Cadence whitepaper published 1 May 2026 on Data Centre Dynamics identifies AI-driven pressures — higher rack densities, rapid power fluctuations, cooling constraints and limited capacity — that increase operational risk and threaten uptime.

Key details

Recommended technical responses include digital twins for planning, commissioning and operations, adoption of liquid cooling for high-density racks, and smarter infrastructure planning to reduce stranded capacity and maximise efficiency.
The 300-word DCD Compute, Storage & Networking Channel whitepaper is available for download and focuses on future-proofing facilities to accommodate next-generation AI growth.

Brief

Cadence's whitepaper (published 1 May 2026 on Data Centre Dynamics) warns that AI workloads are forcing higher rack densities, rapid power swings, and cooling and capacity shortfalls that risk uptime. It advocates digital twins for planning/commissioning/operations, liquid cooling for dense racks, and smarter infrastructure planning to cut stranded capacity and improve efficiency.

By DCD Compute, Storage & Networking Channel

66 YouTube 2024-04-11 Video

Open

Energy Market update: Great Britain - Transmission (Wendel Hortop: Modo Energy)

Why it matters

Modo Energy’s GB index (average battery revenue annualized to £/MW/yr) fell to ~£27,000/MW/yr Nov–Feb (excluding Capacity Market); with typical Capacity Market add-ons (~£13,000/MW/yr on average) some contracted assets saw ~£40,000/MW/yr over the same period.

Key details

Frequency response went from >90% of battery revenue historically to as little as 5–10% of revenue from November onward; batteries are now earning most of their income from wholesale trading and the Balancing Mechanism (BM).
GB grid has ~3.9 GW of battery storage online today, with another 2–3 GW likely this year and industry forecasts suggesting >10 GW by 2027; system peak demand averaged ~37 GW this winter (peak <40 GW), ~10% below levels a few years earlier.
Wholesale intraday spreads this winter averaged ~£60–70/MWh (a roughly one-third drop in volatility versus recent winters), and the highest wholesale price seen this winter was ~£250/MWh—reducing arbitrage opportunities for batteries.

Brief

Wendell Hortop (GB Market Lead, Modo Energy) and host Quenton presented a Transmission Today interview-style market update focused on what happened to battery storage revenues in Great Britain over the recent winter and what it implies for the year ahead. Modo’s index (annualized revenue per MW) averaged about £27,000/MW/yr from November to February excluding Capacity Market payments; adding typical Capacity Market value (≈£13,000/MW/yr on average) pushed some assets toward ~£40,000/MW/yr. Those figures contrast with 2023’s ~£57,000/MW/yr and 2022’s ~£153,000/MW/yr—showing a steep drop driven by lower wholesale volatility, reduced peak prices, and growing battery supply (≈3.9 GW online today with 2–3 GW expected this year and >10 GW plausible by 2027).

Hortop attributed the revenue squeeze to three interacting forces: saturation of frequency response (its share fell from >90% historically to ~5–10% this winter), a materially smaller intraday spread (≈£60–70/MWh average this winter, about a third of prior winters), and lower system peak demand (average peak ~37 GW). Interconnectors (≈9.2 GW total, including the 1.4 GW Viking Link) and returning French nuclear and Norwegian hydro suppressed UK peak prices. Operational practice in the control room amplified the problem—batteries were often ‘skipped’ in favour of single, simpler-to-dispatch units (e.g., Dinorwig or pumped hydro). Positive developments include the Open Balancing Platform (launched December), introduction of bulk dispatch, and the March change increasing the dispatch visibility from 15 to 30 minutes. The long-term aim is no fixed‑duration rule: dispatch decisions driven by real‑time state‑of‑charge and algorithmic least‑cost selection. Hortop’s outlook: winter hurt revenues but the Balancing Mechanism should become the main growth channel for battery value as more capacity joins the BM, and summer negative‑price events could restore arbitrage spreads—while asset returns will continue to depend heavily on location and contract terms.

By Modo Energy

67 wordpress.com 2026-05-01 7 min read

Open

What I Learned about Hyperscalers’ AI Spend

Why it matters

Microsoft, Meta, Amazon and Alphabet told investors they will spend roughly $705–710 billion in capex in 2026 (article cites ~700B), nearly double combined 2025 spend (~$370B); three raised 2026 guidance while Amazon held a prior $200B forecast.

Key details

Company-level 2026 guides: Meta $125–145B (from $72B in 2025, ~+85% mid), Microsoft ~$190B (from ~$118B, ~+61%; Microsoft says ~$25B of that is component price inflation), Alphabet $180–190B (from $91B, ~+102%), Amazon ~$200B (from $88B TTM, ~+60%).
Quarterly capex jumped: Meta Q1 capex 13→20B, Alphabet 17→36B, Microsoft cash capex Q3 FY26 17→31B, Amazon ~25→44B; two‑thirds of Microsoft’s recent capex went to short‑lived assets (GPUs/CPUs) that depreciate in ~3–5 years, pressuring operating margins.
Financial engineering and off‑balance‑sheet commitments are large: Moody’s noted ~$662B in signed-but-not-commenced data center lease commitments; Meta’s contractual commitments rose by ~$107B in one quarter (implied ~190B at Mar 31, 2026); Alphabet disclosed $232.7B non‑cancelable commitments and a $467.6B backlog (≈55% expected to convert to revenue in 24 months).

Brief

Hyperscalers are dramatically accelerating AI‑driven infrastructure spend: Microsoft, Meta, Amazon and Alphabet collectively guided roughly $700–710B of 2026 capex (about double 2025’s ~$370B), with Microsoft (~$190B), Meta (125–145B), Alphabet (180–190B) and Amazon (~$200B) leading. Much of the increase reflects component price inflation (Microsoft cites ~$25B) and capacity additions; Q1/Q3 year‑over‑year quarter moves show capex roughly doubling (e.g., Alphabet Q1 2025→Q1 2026: $17B→$36B). Two‑thirds of some recent spend is on short‑lived GPUs/CPUs, accelerating depreciation and squeezing free cash flow (Amazon TTM FCF collapsed to ~$1B; Microsoft and Alphabet FCF down 22% and 38%). The picture is clouded by non‑cash equity gains (Alphabet $36.8B, Amazon $16.8B, Microsoft $5.9B) and large off‑balance‑sheet commitments (Moody’s ~$662B leases, Meta +$107B commitments in one quarter), revealing significant prepayments, SPV funding and financial engineering alongside real AI engineering investment.

By On my Om

68 substack.com 2026-05-02 15 min read

Open

Cerebras Deep Dive

Why it matters

Cerebras Wafer-Scale Engine 3 (WSE3) is built on TSMC 5nm with ~4 trillion transistors, ~900,000 AI compute cores, 44 GB on-wafer SRAM and ~21 PB/s memory bandwidth — the company claims up to 15x faster inference vs an Nvidia B200.

Key details

Disaggregated inference partnership with AWS (binding term sheet) will co-design a solution pairing Cerebras CS-3 wafers with AWS Trainium3; Cerebras projects ~5x more token throughput in the same footprint and up to 15x speed improvements on leading open-source models; AWS will expose the capability via Amazon Bedrock.
OpenAI Master Resale Agreement signed 24 Dec 2025; deliveries began 23 Jan 2026 and OpenAI’s Codex‑Spark on Cerebras went public 12 Feb 2026. OpenAI committed 750 MW of Cerebras inference capacity (~$20B total, ≈$6.7B/year) with an option to expand to 2 GW by 2030; Cerebras received a $1B working-capital loan and issued OpenAI a 33.5M-share warrant tied to purchase milestones.
Practical limits and economics: a 1T-parameter model requires ~45 CS-3 wafers (SRAM capacity), Cerebras can cluster up to 2,048 wafers; networking 45 CS-3 nodes can cost ~$100M+ while a single Nvidia GB200 rack (≈$3.5M) holds >6 similar models — Cerebras nodes are estimated $2–3M each and draw up to 27 kW.

Brief

Cerebras builds wafer-scale AI accelerators (WSE3) that keep large SRAM on‑chip to attack the memory‑bandwidth bottleneck in autoregressive decoding. The WSE3 — a TSMC 5nm wafer with ~4 trillion transistors and ~900k AI cores — exposes 44 GB of SRAM and ~21 PB/s bandwidth; Cerebras cites up to 15× latency improvements versus an Nvidia B200 for inference and extreme speedups (authors claim >1,000× in niche workloads). To address SRAM capacity limits, Cerebras clusters wafers (≈45 CS‑3s to host a 1T‑parameter model; theoretical max 2,048 wafers) and implements manufacturing redundancy to route around defects, at high capital and power cost (node ~$2–3M, ~27 kW).

Commercially the company has two anchor partnerships: a binding AWS term sheet to deliver a disaggregated inference stack pairing CS‑3 with Trainium3 (targeting ~5× token throughput in the same footprint and AWS Bedrock availability), and an OpenAI MRA signed 24 Dec 2025 (deliveries from 23 Jan 2026; Codex‑Spark public 12 Feb 2026). OpenAI committed 750 MW (~$20B over 3 years, ≈$6.7B/yr) with expansion optionality to 2 GW; financing includes a $1B loan and a 33.5M‑share warrant. The article flags two structural weaknesses — SRAM capacity and very high cost per deployment — and positions Cerebras as a low‑latency, premium offering for customers who pay for speed, while noting current modest scale (Abu Dhabi revenues), ~40% gross margin, and ~‑40% EBIT driven by R&D and equity compensation.

By Tech Investments

69 Twitter/X 2026-04-26 1 min read

Open

The U.S. is the world's largest uranium consumer but produced just 677,000 lb…

Why it matters

The U.S. is the world's largest uranium consumer but produced just 677,000 lb U3O8 in 2024 (≈1.4% of reactor requirements) and less than 1% of total needs; in 2023 foreign suppliers covered ~95% of U.S. reactor purchases (Canada, Kazakhstan, Australia, Russia, Uzbekistan).

Key details

Critical processing is a single-point-of-failure: 7 mining operations across 3 states, only 1 conventional ore mill, 1 conversion facility, and 1 LEU enrichment plant (the enrichment plant is foreign-owned); there is no commercial-scale HALEU enrichment in the U.S. (HALEU production currently limited to Russia and China).
Outlook and policy: domestic production is projected to reach 4M+ lb U3O8 by 2030 (≈6–7× 2024 output) but remains small versus the 433M lb needed from 2025–2035; Congress committed $2.7B to rebuild the domestic uranium and fuel supply chain, and the gap is structural—rebuilding will take decades (USGS Fact Sheet 2025-3057, Apr 2026).

Brief

U.S. uranium supply chain is heavily import-dependent: in 2023 foreign suppliers met ~95% of reactor purchases, while domestic mines produced 677,000 lb U3O8 in 2024 (~1.4% of demand). Critical processing and enrichment capacity is singular and partly foreign-owned, there is no commercial HALEU in the U.S., and a projected rise to 4M+ lb by 2030 still falls far short of the 433M lb needed from 2025–2035; Congress has pledged $2.7B but rebuilding will take decades (USGS Apr 2026).

By @da_sails

70 substack.com 2026-04-30 2 min read

Open

Review | GOOG / MSFT / AWS: CapEx Raised — 2026 Finally Becomes the Year of Cloud Acceleration

Why it matters

Google Cloud (GCP) revenue grew +63% YoY in 1Q26 and GCP backlog reached $460B, nearly doubling quarter-over-quarter.

Key details

Google raised 2026 CapEx guidance from $175–185B to $180–190B and said 2027 CapEx will be meaningfully higher, citing near-term compute constraints that limited Cloud revenue.
At-scale TPU monetization began in 1Q26 with TPU hardware deliveries into select customers' data centers; Google pairs custom TPUs and its Axion CPU with NVIDIA GPUs (including the Vera Rubin NVL72) in its compute portfolio.
Gemini Enterprise paid MAUs grew +40% QoQ; management expects AI improvements (better intent understanding) to expand ad coverage and monetize longer, more complex queries.

Brief

Google’s 1Q26 update shows AI-driven cloud acceleration: GCP revenue rose +63% YoY and backlog jumped to $460B (nearly doubled QoQ). Management lifted 2026 CapEx guidance to $180–190B (from $175–185B) and signaled 2027 CapEx will be meaningfully higher, saying the business is currently compute-constrained and that some Cloud revenue was lost for lack of capacity. At-scale TPU monetization has begun in 1Q26 with deliveries into customer data centers; Google is offering its custom TPUs alongside the Axion CPU and NVIDIA GPUs (including the Vera Rubin NVL72). Gemini Enterprise paid MAUs increased +40% QoQ, and management believes AI will expand ad monetization for longer, more complex queries. (AWS/MSFT detail in the post was behind a paywall.)

By FundaAI

71 Twitter/X 2026-02-26 2 min read

Open

Google is planning an 850 MW data center in Texas powered by AES Clean Energy’s…

Why it matters

Google is planning an 850 MW data center in Texas powered by AES Clean Energy’s co-located renewables, per an AES December filing that ties the site to a project with 600 MW of solar and 945 MW of wind.

Key details

A recent Texas rule change lets a co-located load piggyback on a generation project's interconnection agreement; Google appears to be using AES’ interconnection to bypass ERCOT’s large-load queue and reach service in ~18 months.
ERCOT’s large-load queue totaled 225 GW at the end of 2025; Google’s approach (wind phase expected online August 2027) matches the faster timelines Amazon and Meta have sought by building on-site gas plants, but uses mostly renewables instead of fossil fuel generation.
Cleanview-sourced documents referenced by the author claim this arrangement enables near round‑the‑clock clean energy with grid backup when output drops and that building the same data center in Virginia could have pushed commissioning to the early 2030s.

Brief

Google’s new 850 MW Texas data center will be powered via a co‑located AES Clean Energy buildout that, according to an AES filing in December, comprises 600 MW of solar and 945 MW of wind. Thanks to a recent Texas rule change allowing a co‑located load to use a generation project’s interconnection, Google can reportedly piggyback on AES’ grid connection and avoid ERCOT’s huge large‑load queue (225 GW at end‑of‑2025), cutting deployment time to about 18 months. AES’s wind phase is expected online August 2027. The author contrasts this path with Amazon and Meta’s move to build on‑site gas plants to meet fast timelines, arguing Google demonstrates you can achieve similar speed and near round‑the‑clock clean power by combining co‑located renewables with grid reliability; the claim is based on Cleanview‑sourced documents and the AES filing.

By @curious_founder

72 substack.com 2026-03-30 6 min read

Open

DeepSeek Part Two: Why AI Skeptics Are Wrong. 10 Stocks That Will Win.

Why it matters

Tae Kim (Key Context) wrote on 2026-03-30 that current market angst (Iran war, “peak AI capex”) echoes last year’s tariff/DeepSeek panic and is likely short‑lived as AI inference demand surges.

Key details

Nvidia chief scientist Bill Dally said ~90% of data‑center power is now going to inference; GTC conversations with Meta/Google/Nvidia staff indicated a severe inference compute shortage despite hyperscalers’ “hundreds of billions” in 2026 capex.
Arm CEO René Haas estimates agentic AI increases token demand ~15× and says CPU orchestration needs per GW have quadrupled from ~30 million cores to ~120 million cores; Intel’s CFO is negotiating 3–5 year supply agreements with hyperscalers.
Nvidia is positioned to benefit: networking revenue grew 263% YoY, the company acquired Groq to target ultra‑low‑latency inference, Vera Rubin GPUs are due later in 2026, and ecosystem advances (Anthropic’s rumored “Mythos,” OpenAI model work, Jeff Dean on larger context windows) will drive more compute. Logan Bartlett estimates U.S. knowledge work as a $6.2 trillion addressable market for AI agents.

Brief

Tae Kim argues (Key Context, published 2026-03-30) that short‑term macro fears — from the Iran war to “peak AI capex” talk — obscure a structural pivot: inference is now the dominant workload. Citing Bill Dally and on‑the‑ground GTC conversations with Meta/Google/Nvidia engineers, Kim reports ~90% of data‑center power is going to inference and hyperscalers’ planned “hundreds of billions” in capex still can’t meet demand. Agentic AI (notably coding assistants) multiplies token demand roughly 15× per Arm CEO René Haas and has driven CPU orchestration needs from ~30M to ~120M cores per GW; Intel is locking multi‑year supply deals. Nvidia is central — networking sales +263% YoY, Groq acquisition for ultra‑low‑latency inference, and Vera Rubin GPUs slated for 2026 — while model improvements (Anthropic “Mythos,” larger context windows, multimodal data) and Amazon’s GPU rearchitected ads point to sustained, large‑scale demand (U.S. knowledge work ≈ $6.2T).

By Tae Kim from Key Context Newsletter

73 divenewsletter.com 2026-05-09 5 min read

Open

Weekender: Eversource CEO: ‘We are resisting data centers’

Why it matters

Eversource Energy CEO Joe Nolan said May 2026 that the company is “resisting data centers,” arguing they are “of no value to our residential customer — actually, any customer” and will raise energy prices.

Key details

NERC issued a Level 3 alert citing computational loads as “immediate risks,” requiring certain grid participants to complete seven specified actions by Aug. 3, 2026 to address potential data center load losses.
American Electric Power is reviewing exits from PJM and SPP amid interconnection delays; its utilities hold contracts for about 63 GW of new large load by 2030.
Dominion’s 2.6 GW Coastal Virginia Offshore Wind began producing some electricity in March 2026, is expected fully operational by 2027, and the company projects roughly $5 billion in fuel savings over 10 years, even as its Q1 fuel and energy-related costs rose 67%.

Brief

Eversource Energy is taking an oppositional stance toward new data center load growth: CEO Joe Nolan said in May 2026 the assets provide “no value” to residential or other customers and will push up rates. That stance sits alongside system-level alarm — NERC issued a Level 3 alert saying computational loads pose “immediate risks” and ordering seven actions by Aug. 3, 2026 to mitigate data center-related load losses. The sector is also wrestling with interconnection and market strain: AEP, with ~63 GW of contracted large load through 2030, is weighing exits from PJM and SPP over slow generation interconnection, while PJM is floating capacity-market overhaul options. Meanwhile, Dominion’s 2.6 GW Coastal Virginia Offshore Wind has started producing (March 2026), aims for full operation by 2027, and expects about $5 billion in fuel savings over a decade even as Q1 energy costs jumped 67%.

By Utility Dive

74 substack.com 2026-04-30 11 min read

Open

[AINews] The Inference Inflection

Why it matters

Leaders are treating inference as a strategic, high‑growth resource: Noam Brown called “inference compute is a strategic resource, currently undervalued” and Sam Altman said “we have to become an AI inference company now” (AINews, Apr 30, 2026), framed as a reaction to a successful GPT‑5.5 launch.

Key details

Intel’s Q1 commentary and SemiAnalysis transcripts flag rising CPU demand: a 5–6 year refresh cycle from 2020–21 (when firms bought roughly $100B of CPUs) is reaching end‑of‑life, producing underinvestment and potential CPU shortages for inference and simulation workloads.
NVIDIA’s Jensen Huang declared the “inference inflection” and estimated compute demand growth: inference token/compute needs up ~10,000×, usage up ~100×, and he argued total computing demand may have risen by ~1,000,000× in two years.
Infrastructure and workload shifts: prefill/decode disaggregation is now common and vendors are pursuing specialized inference paths (article notes Nvidia–Groq, Intel–SambaNova references and Amazon pursuing Cerebras‑style approaches), reshaping GPU/CPU serving architectures.

Brief

Inference compute has moved from supporting experimentation to becoming the dominant production constraint across AI stacks. AINews positions recent comments from industry figures (Noam Brown, Sam Altman, Jensen Huang) and Intel’s Q1 signals as evidence of an "inference inflection": inference token and compute needs have exploded (Jensen cited ~10,000× per‑task compute growth and suggested an aggregate feeling of up to 1,000,000× over two years, with usage up ~100×). SemiAnalysis and pod transcripts underline a CPU refresh cycle issue — enterprises bought ~ $100B of CPUs in 2020–21, creating underinvestment and rising utilization now that simulation, RL gyms, and production agents increasingly run on CPUs.

The market response is visible across models, runtimes, and kernels. The newsletter highlights new model families (Mistral Medium 3.5 dense ~128B with 128K context; IBM Granite 4.1 open weights 30B/8B/3B with cost‑efficient Granite 8B), vendor moves toward prefills/decode disaggregation and specialized inference hardware, and software/hardware co‑design wins: vLLM+Blackwell throughput (DeepSeek V3.2 at 230 tok/s, 0.96s TTFT), Qwen FlashQLA’s 2–3× forward speedups, and serving fixes (e.g., GLM‑5 LayerSplit improving prefill throughput by up to 132%). The piece argues the combination of rising inference demand, falling open‑model pricing, and harness engineering (agent harnesses improving pass@1 and token efficiency) is reshaping procurement, architecture, and economics for AI in 2026.

By AINews

75 divenewsletter.com 2026-04-29 5 min read

Open

CAISO expects a “solid launch” for EDAM (the first Western day‑ahead market)…

Why it matters

CAISO expects a “solid launch” for EDAM (the first Western day‑ahead market); PacifiCorp’s Mike Wilding said 90 days of parallel operations before launch gave confidence the market is “working as intended” (reported Apr 29, 2026).

Key details

Octopus Energy and Lunar Energy rolled out battery‑enabled retail plans in Texas that include deeply discounted home batteries designed to provide grid support and backup power to customers.
Consumers Energy saw sales growth driven by industrial loads while costs tied to a coal plant emergency order have risen; at least one analyst flagged regulatory strategy concerns after DTE Energy said it may pause future rate‑hike requests if its current request is approved.
CenterPoint Energy plans to energize 8 GW of data center load by 2029 (CEO Jason Wells said Houston is a choice location for hyperscalers), and TVA with Plus Power will deploy a 200 MW / 800 MWh BESS at Crawfish Creek, Alabama — one of TVA territory’s first grid‑scale batteries and part of a board plan for up to 1.5 GW of storage by 2029.

Brief

The April 29, 2026 Load Management briefing highlights major operational and market moves across U.S. power systems: CAISO’s new EDAM day‑ahead market is expected to launch smoothly after 90 days of parallel operations, per PacifiCorp’s Mike Wilding. Retail innovation in Texas continues as Octopus Energy and Lunar Energy introduce plans that pair deeply discounted home batteries with grid‑support services and backup capability. Utilities are also managing shifting demand and costs — Consumers Energy’s sales were lifted by industrial loads even as emergency coal plant orders raised costs and prompted analyst questions about regulatory strategy following DTE’s comments on pausing future rate requests. On infrastructure, CenterPoint targets 8 GW of data center load by 2029, and TVA plus Plus Power announced a 200 MW / 800 MWh Crawfish Creek BESS as part of a path to 1.5 GW of storage. Meanwhile EIA data show retail electricity revenue per kWh jumped sharply in several states (Virginia +26.3%, Ohio +21.9%, Pennsylvania +19.5%) and averaged a 9% U.S. price increase in February.

By UD: Load Management

76 Epoch AI 2026-05-01 8 min read

Open

AI Chips: why they cost as much as a car, and why companies can't get enough

Why it matters

TSMC dominates fabrication of leading-edge AI chips (most fabs in Taiwan; new fabs in Arizona, Japan, Germany) while ASML (Netherlands) is the sole supplier of EUV lithography tools and HBM memory is supplied mainly by Samsung, SK Hynix, and Micron — HBM was a binding constraint on production in late 2025.

Key details

By Q4 2025 the five largest chip designers had shipped roughly 20 million AI chips; Nvidia accounted for ~50% of units and >66% of total AI computing capacity, Google was second, and Huawei (using an alternative chain after US export controls) held about 6% in 2025.
Nvidia flagship prices rose from $5,700 (2016) to $34,000 (2022) but cost-effectiveness improved: the H100 (2022) delivered ~17× more computation per dollar than the P100 (2016); compute-per-dollar has roughly doubled every 2.5 years and chips/computing account for ~54–62% of AI firms' spending.
Power and efficiency tradeoffs: a single high‑end chip draws ~1,000 W; total AI datacenter capacity reached 'tens of gigawatts' by late 2025 (comparable to New York peak); energy efficiency has improved ~40%/year (doubling ≈2.7 years) — e.g., Nvidia B100 (2024) ≈3× computation/watt vs A100 (2020) — yet aggregate consumption rises as installed chips scale faster than efficiency gains.

Brief

AI chips sit at the center of modern AI progress: specialized accelerators (Nvidia Blackwell/Hopper, Google TPU, Amazon Trainium, Huawei Ascend) are designed by a few firms but mostly fabricated by TSMC, with critical upstream suppliers such as ASML (EUV) and HBM vendors Samsung, SK Hynix, and Micron. By Q4 2025 roughly 20 million AI chips had been shipped by the five largest designers, with Nvidia supplying ~50% of units and >66% of deployed compute capacity. Metrics matter more than sticker price: although flagship chips rose from $5,700 (2016) to $34,000 (2022), computation per dollar improved dramatically (H100 ≈17× P100), and compute-per-dollar has doubled ~every 2.5 years. Chips consume ~1,000 W each and total datacenter power reached tens of gigawatts by late 2025; energy efficiency has improved ~40% annually, but aggregate electricity use still grows because deployment outpaces per-chip gains.

By The Epoch AI Team

77 divenewsletter.com 2026-04-04 5 min read

Open

Weekender: Large load tariffs proliferate as states take more active role in data center regulation

Why it matters

State regulators approved 29 large-load tariffs in 2025, and utilities and regulators expect more such tariffs to be proposed as states take a more active role in data center regulation (Utility Dive, published Apr 4, 2026).

Key details

Ratepayer and municipal groups are already pushing back on cost allocation: American Municipal Power (AMP) and the Office of the Ohio Consumers’ Counsel opposed the proposed ROE and rates for a $1.1 billion AEP–FirstEnergy transmission project at FERC, arguing ratepayers are exposed to data center-driven transmission costs.
The policy outcomes remain uncertain: experts say it’s too soon to tell whether large-load tariffs are achieving intended goals, even as related trends — 1 in 3 data center campuses projected to exceed 1 GW by 2035 and a 22% drop in U.S. solar installations in 2025 (FERC) — raise pressure on affordability and grid planning.

Brief

Large-load tariffs have proliferated as states step up oversight of data center growth: regulators approved 29 such tariffs in 2025 and many more proposals are underway (Utility Dive, Apr 4, 2026). The tariffs aim to allocate grid and transmission costs associated with big, concentrated electricity demand, but experts say it is too early to assess effectiveness. Pushback is mounting from ratepayer advocates and municipal utilities — for example AMP and the Ohio Consumers’ Counsel contested the return-on-equity and proposed rates for a $1.1 billion AEP–FirstEnergy transmission project at FERC, citing exposure of customers to data center-related costs. The shift comes amid broader grid pressures: analysts expect near‑term retail price increases (LBNL/Brattle), FERC reported a 22% fall in solar builds in 2025, and industry forecasts that 1 in 3 data center campuses could exceed 1 GW by 2035, all factors that heighten regulatory scrutiny and affordability concerns.

By Utility Dive

78 datacenterdynamics.com 2026-03-19 1 min read

Open

Scale AI cooling efficiently

Why it matters

Data Centre Dynamics' Management & Operations Channel published an on‑demand episode titled "Delivering future‑ready liquid cooling solutions faster" (published 19 Mar 2026) aimed at helping operators deploy liquid cooling at scale to address rising rack densities driven by AI workloads.

Key details

The episode promises concrete objectives: improve energy efficiency with advanced liquid cooling strategies, reduce power and water impact in high‑density environments, validate designs early to avoid costly inefficiencies, and support sustainability targets without compromising performance.

Brief

Scale AI cooling efficiently promotes an on‑demand episode from Data Centre Dynamics' Management & Operations Channel (published 19 Mar 2026) that guides operators on deploying liquid cooling at scale for AI‑driven high‑density racks. It emphasizes improving energy efficiency, cutting power and water impact, early design validation to prevent inefficiencies, and meeting sustainability targets while maintaining performance.

By DCD Management & Operations Channel

79 substack.com 2026-04-29 49 min read

Open

CERAweek

Why it matters

NRG’s planning horizon has compressed to roughly 12–18 months, according to Gin Kinney (Executive VP & CAO of NRG), reflecting rapid load growth and speed-to-power pressures discussed at CERAweek (April 2026).

Key details

NRG has sourced 5.4 GW of natural‑gas turbines, signed a construction labor arrangement with Kiewit, and is under construction on three gas plants financed in part by the Texas Energy Fund (TEF); one TEF project at TH Wharton is described as well advanced.
NRG is building a one‑gigawatt virtual power plant (VPP) in Texas by aggregating smart thermostats and distributed assets (target scale discussed through the 2030s/2035 horizon); executives say customers have been willing to enroll for bill savings and grid flexibility.
Market and queue uncertainty is acute: ERCOT’s large‑load interconnection queue has jumped dramatically (nearly 300% in the prior year) and hosts figures cited as high as ~410 GW of prospective large loads versus current summer peak demand around ~85.5 GW — numbers that force new interconnection/queue processes (batch zero) and SB6 rulemaking (Project No. 58317).

Brief

CERAweek conversations captured a Texas grid under rapid and uncertain expansion, where traditional 10–20 year planning cycles have compressed to 12–18 months. NRG’s Gin Kinney described a strategic shift toward speed‑to‑power and flexibility: the company has sourced 5.4 GW of gas turbines, secured Kiewit construction capacity, and is building three gas plants supported in part by the Texas Energy Fund (TEF). At the same time NRG is scaling a one‑gigawatt virtual power plant (VPP) in Texas by aggregating smart thermostats and other distributed assets, using AI and predictive analytics in market operations and home automation to shave peaks, lower customer bills, and avoid some transmission investments.

The wider market picture driving those moves is highly unsettled. ERCOT’s queue and forecasts are volatile — panelists cited the large‑load queue growing roughly 300% last year and headline figures as high as ~410 GW of prospective large loads compared with current summer peaks of ~85.5 GW. Regulators and operators are responding with batch‑zero queue processes, SB6 implementation rulemaking (PUCT Project No. 58317), and pilots such as ERCOT’s ADER program to target nodal congestion. The consequence: many hyperscalers and data centers are adopting BYOP (bring‑your‑own‑power) or bridge‑power strategies for speed to commercial operation; a striking example is a ~350 MW facility near El Paso temporarily powered by ~800 small ~450 kW generators. That fragmentation raises logistics, maintenance, and cost‑allocation questions.

Taken together, the episode argues that distributed flexibility (VPPs, residential smart homes, EVs as potential storage) can substitute for some new centralized generation and defer T&D build—if it can be scaled, contracted, and remunerated correctly. Speakers emphasized the need for cross‑sector partnerships (generators, retailers, T&D, aggregators), clearer policy/PUC direction, and robust interconnection processes to reconcile ambitious load growth claims with the physical limits of supply chains, permitting, and transmission build timelines.

By Texas Energy and Power Newsletter

80 Epoch AI 2026-04-06 4 min read

Open

Introducing the AI Chip Owners Explorer

Why it matters

Epoch AI's AI Chip Owners explorer (published 2026-04-06) estimates US hyperscalers (Amazon, Google/Alphabet, Meta, Microsoft, Oracle) own over 70% of global AI compute, with Google leading the group.

Key details

Google is estimated to hold the equivalent of ~5 million Nvidia H100 GPUs (~25% of global total) and nearly 4 million H100-equivalent TPU compute (confidence interval 3.1M–4.5M).
Mainland Chinese customers are estimated to own just over 5% of cumulative leading-AI-chip compute as of end-2025; prior research (Grunewald & Fist) estimates >100,000 Nvidia A100/H100 units were illicitly shipped to China in 2024 (high uncertainty).
The explorer builds on Epoch AI's AI Chip Sales dataset and allocates chips to owners using analyst estimates, company financial disclosures, capital spending, and frontier-scale data-center analysis; it also notes many frontier developers (e.g., OpenAI, Anthropic) rent most compute from hyperscalers.

Brief

Epoch AI's AI Chip Owners explorer (launched 2026-04-06) maps who owns the world's leading AI chips by distributing previously estimated sales volumes (Nvidia, Google TPU, Amazon Trainium, AMD, Huawei) across major owners using analyst estimates, company disclosures, capex data, and frontier data-center analysis. The study finds US hyperscalers control over 70% of AI compute, led by Google (≈5 million H100-equivalents, ~25% of global compute) with Google TPUs contributing nearly 4 million H100-equivalents (CI 3.1M–4.5M). Mainland China holds just over 5% of cumulative compute as of end-2025; illicit imports may have added tens of thousands to >100,000 A100/H100s in 2024 per prior reports but are unlikely to close the gap. The explorer includes interactive visualizations, a methodology and dataset for researchers and policymakers.

By Josh You, Venkat Somala

81 substack.com 2026-03-20 9 min read

Open

🍪 TWiC: MU HBM4, OCI MSA, Credo Optics, VCSEL Scale-Up, DGX GB300, NBIS $27B

Why it matters

Micron supplied HBM4 to NVIDIA’s Vera Rubin (NVL576) accelerator and Jensen Huang publicly called it the “industry’s fastest HBM4”; Micron’s stock was up ~62% YTD as of Mar 2026.

Key details

The Optical Compute Interconnect MSA (OCI MSA) launched to standardize optical scale‑up and enable interoperable CPO components across vendors; Lightmatter also announced an Open Compute Project collaboration for CPO specs.
Credo announced new optical products: Cardinal 1.6T DSP, Robin 400G/800G DSPs, and ZeroFlap 800G transceivers, positioning it to supply both optical and copper markets (noted links to Meta’s MTIA 400 use case).
Lumentum showcased a 1060 nm VCSEL platform for a “slow‑but‑wide” scale‑up approach (VCSEL arrays running many parallel lanes), offering a GaAs‑based supply‑chain alternative to InP EML lasers and claiming better speed/high‑temp reliability vs 850 nm VCSELs.

Brief

Subject: Optical and AI infrastructure scale‑up — Micron HBM4, OCI MSA, Credo, Lumentum, NVIDIA GB300, Nebius‑Meta deal. The newsletter summarizes OFC and GTC headlines (Mar 2026) that signal simultaneous scale‑up in memory, optics, and enterprise AI infrastructure. Micron is now publicly supplying HBM4 to NVIDIA’s Vera Rubin (NVL576) and was praised by Jensen Huang as delivering the “industry’s fastest HBM4,” reversing earlier rumors that Micron would not ship HBM4; Micron’s stock was ~62% higher YTD.

On optics, industry players formed the OCI MSA to standardize Optical Compute Interconnects and avoid vendor lock‑in, with Lightmatter pursuing complementary OCP CPO specs. Vendors pushed new hardware: Credo released Cardinal 1.6T and Robin 400/800G DSPs plus ZeroFlap 800G transceivers; Lumentum demoed 1060 nm VCSELs for a “slow‑but‑wide” parallel‑lane scaling approach (GaAs VCSEL arrays as an alternative to scarce InP EMLs). In infrastructure, NVIDIA’s DGX Station GB300 targets on‑prem enterprise develop‑to‑deploy inference, and Nebius struck a five‑year agreement with Meta for up to $27B of AI capacity ( $12B committed, up to $15B additional), with Vera Rubin cluster deliveries starting early 2027.

By Vik's Newsletter

82 cautiousoptimism.news 2026-04-07 10 min read

Open

Rewriting capitalism for the AI era

Why it matters

Anthropic announced it has crossed a $30 billion annualized revenue run rate as of early April 2026, up from roughly $1 billion at the start of 2025 and $9 billion at year-end 2025; the company earlier hit $14B (mid-February) and $19B (end of February).

Key details

Anthropic disclosed a 3.5‑gigawatt compute deal with Google Cloud (up to ~1,000,000 TPU chips) to be manufactured by Broadcom, with most capacity coming online in 2027; Alex Wilhelm estimates the deal’s value in the tens of billions (roughly ~$35B by a data‑center GW rule‑of‑thumb).
Anthropic is already imposing peak‑usage limits and restricting third‑party ‘OpenClaw’ usage of subscriber compute as it faces near‑term compute constraints; the company says customers paying $1M+ annually doubled in under two months to more than 100.
U.S. lawmakers are pushing the MATCH Act to block sales of key semiconductor manufacturing equipment (at minimum DUV immersion lithography and cryogenic etch tools, and effectively targeting EUV/DUV), a move that pressures ASML, risks accelerating China’s domestic tooling efforts, and could reshape the global SME market.

Brief

Anthropic’s growth narrative has gone hyperbolic: the AI lab reported a >$30 billion annualized revenue run rate in early April 2026 after rising from ~$1B at the start of 2025 and $9B at year‑end 2025, with intermediate milestones of $14B and $19B in February. The company recently capped peak‑usage and limited third‑party harnessing of subscriber compute while signing a 3.5‑GW deal with Google Cloud (up to ~1M TPUs) to be manufactured by Broadcom, capacity that will mostly arrive in 2027 and that Wilhelm values at roughly $35B by his GW cost heuristic. Wilhelm cautions that Anthropic’s headline revenue may overstate GAAP‑style economics because of how cloud‑partner revenue is counted.

At the same time, geopolitics and economics are converging: U.S. legislators’ MATCH Act would curtail exports of critical SME (DUV/EUV and cryogenic etch tools), pressuring ASML and likely accelerating Chinese tooling programs. OpenAI has proposed a policy package—higher capital taxes, targeted automated‑labor levies, wage‑linked incentives, a public wealth fund, portable benefits, and AI access as public infrastructure—framed as a new social contract to manage predicted labor disruption (Dario: 50% of entry‑level white‑collar roles disrupted in 1–5 years) and political risks voiced by investors like Vinod Khosla. The piece ties rapid commercial AI scale, scarce compute, infrastructure externalities, and tax/political choices together as central determinants of whether AI’s economic gains translate into broad prosperity or concentrated capital power.

By Alex Wilhelm from Cautious Optimism

83 Epoch AI 2023-07-28 27 min read

Open

Trading off compute in training and inference

Why it matters

Epoch AI (Pablo Villalobos & David Atkinson), published 2023-07-28, quantifies tradeoffs between training compute (TC) and per-inference compute (IC) across five techniques: model/data scaling, Monte Carlo Tree Search (MCTS), pruning (IMP), repeated sampling/resampling (pass@k / n@k), and chain-of-thought/cascades.

Key details

Varying the Chinchilla-style scaling policy (using TC ≈ 6ND and IC ≈ 2N) can trade ~1.2 orders of magnitude (OOM) extra training compute to reduce ~0.7 OOM of inference compute while holding performance constant (Hoffmann et al. 2022 fit used).
MCTS experiments replicated on Hex (extending Jones 2021) show an S-curve relation: at low performance one can trade ~1 OOM training for ~1.6 OOM inference, while at high performance the tradeoff inverts (≈1.6 OOM training buys only ~1.1 OOM inference) and disappears near perfect play.
Iterative Magnitude Pruning (IMP) can reduce inference compute by ~1 OOM (pruning to ~10% density) at the cost of increasing training compute by ~0.7 OOM due to repeated re-training rounds (e.g., 20% pruning per iteration → ≈4.5× extra training work to reach 10% density).

Brief

Across techniques they find typical trade spans of order 1–2 OOM in each axis: (a) Chinchilla-style scaling can trade ~1.2 OOM training for ~0.7 OOM inference; (b) MCTS exhibits an S-curve where at low performance ~1 OOM training ↔ ~1.6 OOM inference (and the trade shrinks as models near perfect play); (c) IMP pruning to ≈10% density saved ≈1 OOM inference at ~0.7 OOM extra training because of multi-round re-training; and (d) resampling with cheap verification (e.g., code generation pass@k) extrapolates to much larger spans (authors estimate up to ~3 OOM training savings vs ~6 OOM inference increase) but caution low confidence in extrapolation. The authors categorize three tradeoff archetypes—saturating/constant span (e.g., Chinchilla), saturating/decreasing span (e.g., MCTS, limited n@k), and non-saturating/increasing span (resampling with effectively unlimited trials)—and show that some techniques combine roughly additively but often with diminishing returns, so realistic combined spans are ~2–3 OOM. Methodologically they fit empirical curves (smoothly broken power laws, log relations) and prioritize Pareto frontiers for optimal TC/IC choices. The paper emphasizes governance implications: because aggregate inference cost often exceeds training cost in deployed systems, commercial models will be biased toward low-IC configurations, but researchers or well-resourced actors can invest IC to simulate larger-model capabilities for evaluation, internal use, or small-customer deployments—affecting safety assessments and policy proposals aimed at controlling frontier capabilities.

By Pablo Villalobos, David Atkinson

84 substack.com 2026-03-24 8 min read

Open

Why hasn’t oil gotten even more expensive?

Why it matters

More than 20% of global oil normally transits the Strait of Hormuz; Yglesias estimates a supply cut of that size would require oil to rise toward ~$150/barrel to reduce consumption by ~20%, versus a roughly $100/barrel price cited on Mar 24, 2026.

Key details

A 'TACO equilibrium' (Trump Always Chickens Out) is damping futures spikes: traders sell into big rallies expecting President Trump to force a political resolution if U.S. gasoline soars, so futures markets understate real supply pain.
Grade and geography matter: the Iran-linked supply shortfall is heavier crude that feeds Asian refineries, producing shortages in distilled products (jet fuel, LPG, naphtha); examples include India rationing commercial cooking gas, airlines canceling flights for expensive jet fuel, and ship-fuel shortages.
Sanctions changes doubled the price of Russian Urals crude after being lifted, and niche blends (Labuan, Minas, Bach Ho) have traded about $10/barrel over Brent (Bloomberg), creating larger moves outside headline Brent/WTI indexes.

Brief

Matthew Yglesias (Mar 24, 2026) argues that the apparent muted move in headline oil benchmarks masks acute real-world disruption: more than 20% of global oil transits the Strait of Hormuz, and a supply loss that large would, by his reckoning, require prices nearer $150/barrel to force a 20% demand cut versus the roughly $100/barrel price cited. Short‑run gasoline elasticity is low, so consumption falls mainly via product shortages, not immediate consumer behavior. The disruption disproportionately affects heavier crudes feeding Asian refineries, causing rationing (India’s commercial LPG curbs), jet‑fuel‑related flight cancellations, naphtha and marine‑fuel shortages, and refinery windfall margins. Futures are restrained by a 'TACO' expectation that President Trump would politically intervene if U.S. pump prices spike, while post‑sanctions Urals crude doubled in price and niche blends trade ≈$10/bbl over Brent.

By Matthew Yglesias

85 Twitter/X 2026-05-04 1 min read

Open

On 2026-05-04 energybants announced a joint venture between Brookfield and The…

Why it matters

On 2026-05-04 energybants announced a joint venture between Brookfield and The Nuclear Company to deliver GW-scale nuclear, specifically deploying Westinghouse reactors and managing completion of AP1000 units in South Carolina

Key details

The author identifies themself as Chief of Staff to proven builder-CEO Jonathan Webb and says they are building the team alongside Chief Nuclear Officer @TheNuclearJoe to 'fix the nuclear plant deployment problem in the West'
The Nuclear Company’s stated strategy is to use proven, repeatable designs, disciplined project management, and emerging technologies to deliver plants on-time and on-budget at GW scale; the author pledges further updates as the program progresses

Brief

The Nuclear Company and Brookfield announced a joint venture on 2026-05-04 to deliver GW-scale nuclear power, including Westinghouse reactors and management of AP1000 completions in South Carolina. The author, serving as Chief of Staff to CEO Jonathan Webb and working with CNO @TheNuclearJoe, emphasizes repeatable, proven designs, disciplined project management, on-time/on-budget delivery, and use of emerging technologies.

By @energybants

86 Twitter/X 2026-05-02 2 min read

Open

Commodity DRAM prices have surged more than 100%; the memory industry posted…

Why it matters

Commodity DRAM prices have surged more than 100%; the memory industry posted record results driven by that price jump (source post dated 2026-05-02).

Key details

CPU makers (Intel, AMD) are targeting 300–400 GB of commodity DRAM per 'AI CPU' — up to four times typical CPU memory (typical 96–256 GB) — to support context memory for inference orchestration.
Server architectures are shifting: historic 8-GPU-to-1-CPU setups are moving to 1 CPU : 4 GPU and trending toward 1:1 for inference workloads (Intel executives cited this shift).
GPU/accelerator memory arms race continues: NVIDIA 'Vera Rubin' targets 288 GB via 8 HBM stacks; AMD MI400 targets 432 GB; Google TPU 8i slated for 288 GB HBM — all increasing DRAM/HBM demand.

Brief

AI CPUs are driving a new surge in DRAM demand as the industry pivots from training-centered GPU farms to inference architectures that require large "context memory." Vendors aim to pack 300–400 GB of commodity DDR5 into next‑generation CPUs — up to four times current CPU capacities — while GPUs and accelerators are also ballooning HBM capacity (NVIDIA Vera Rubin 288 GB, AMD MI400 432 GB, Google TPU 8i 288 GB). Server ratios are shifting from 8:1 (GPU:CPU) toward 1:4 and even 1:1 for inference coordination. Spot-price evidence shows DDR5 holding a premium (DDR5 +2.8% vs DDR4 −16% in April on a 16 GB basis), and an industry source estimates roughly a 10 percentage‑point supply shortfall. The post concludes that memory suppliers (Samsung, SK hynix) are unlikely to catch up in 2026, pushing the shortage into 2027.

By @jukan05

87 substack.com 2026-04-20 39 min read

Open

How Much Do GPU Clusters Really Cost?

Why it matters

SemiAnalysis published a Cluster TCO methodology and two free calculators (GPU Cluster TCO and Goodput Calculator) based on August 2025 pricing data and interviews with 150+ customers; the TCO formula explicitly includes GPUs, storage, networking, control plane, support, goodput, setup, and debugging.

Key details

When holding GPU price constant, SemiAnalysis finds gold‑tier providers have 5–15% lower TCO than silver‑tier providers across large training workloads; in their Large LLM Pretrain example (5,184 NVL72 GPUs, 80% of cluster used), relative multi‑year costs were Gold = 1.00x, Hyperscaler = 1.10x, Silver = 1.15x at $4/GPU‑hr.
Goodput (useful work) is modeled with three cases—G_chkpt‑hot, G_chkpt‑cold, and G_tolerant—using inputs like MTBF, time‑to‑identify failures, time‑to‑repair, checkpoint frequency, job size, and blast radius; in the Large Pretrain scenario reported goodput losses were ~6.14% (gold), 10.53% (hyperscaler), and 20.91% (silver).
SemiAnalysis details three fault‑tolerance approaches: TorchFT (open source, uses GLOO vs NCCL, observed >10% perf overhead), AWS SageMaker HyperPod checkpointless (launched Dec 2025, AWS claims ~1m45s recovery vs ~15m for checkpoint restart), and TorchPass (commercial, maintains baseline performance but requires idle spare nodes; their example used 32 idle GPUs = 0.62% of cluster).

Brief

SemiAnalysis proposes a practical, bottom‑up Total Cost of Ownership (TCO) framework for GPU clusters that goes beyond headline $/GPU‑hr. Their monthly TCO formula sums GPU rental, storage (hot/warm/cold), networking, control plane, support uplift, plus two categories of implicit costs: Goodput Expense (lost useful work from failures and inefficiencies) and amortized engineering expenses for setup and debugging. The firm published two interactive tools — a GPU Cluster TCO Calculator and a Goodput Calculator — and populated defaults from an August 2025 pricing snapshot, hands‑on tests of 80+ neoclouds, and interviews with >150 users.

They evaluate three representative provider tiers (Gold‑tier, Hyperscaler, Silver‑tier) across three scenarios: Large LLM Pretrain (5,184 NVL72 GPUs, ~80% allocated; 500 TiB hot + 10 PiB cold storage; $4/GPU‑hr input), Multimodal RL Research (2,048 B200 cluster with ~12 TB/GPU storage; neocloud $2.40 vs hyperscaler $3.10/GPU‑hr), and Inference Endpoints (512 GPUs, 1 TB/GPU). Key results: holding GPU price equal, gold‑tier TCO was roughly 5–15% lower than silver in large training; in the pretrain scenario multi‑year cost ratios were Gold 1.00x, Hyperscaler 1.10x, Silver 1.15x. Goodput modeling—with formulas for checkpoint‑hot, checkpoint‑cold, and fault‑tolerant cases—shows large jobs suffer disproportionately from MTBF, detection latency, repair time, checkpoint frequency and blast radius: in the Large Pretrain example SemiAnalysis reports goodput losses of ~6.14% (gold), 10.53% (hyperscaler), and 20.91% (silver).

On fault tolerance, they compare TorchFT (Meta open source; easier recovery but >10% comms overhead due to GLOO vs NCCL), AWS SageMaker HyperPod checkpointless (announced Dec 2025; AWS claims ≈1m45s recovery and uses model redundancy over EFA), and TorchPass (commercial; no perf hit but requires idle spare nodes — their test used 32 spare GPUs, ~0.62% of cluster). The analysis concludes that hidden line items (support tiers, orchestration premiums, setup/debug engineering, and goodput losses) often make ostensibly cheaper GPU‑hour offers more expensive in practice. ClusterMAX 2.1 (Apr 2026) adds several providers (Core42, BitDeer, FPT, Radiant/Ori, etc.), and SemiAnalysis plans broader ClusterMAX 3.0 testing and MTBF data collection this summer.

By SemiAnalysis

88 YouTube 2026-01-07 Video

Open

Will Trump’s reforms of nuclear energy lead to a wave of innovation or a nuclear disaster?

Why it matters

Executive Order 14301 (signed by President Trump) removes the Nuclear Regulatory Commission (NRC) design-review requirement for test reactors and replaces it with Department of Energy consultation focused on technological readiness, site evaluations, financial viability and a plan to achieve criticality — with an explicit goal for test reactors to reach criticality by July 4, 2026.

Key details

New Scale (the only company the presenter cites as having an innovative design approved) spent over $500 million and more than 2 million labor hours preparing its Design Certification Application: 12,000+ pages for the DCA, 14 topical reports, and ~2 million pages of supporting information across six phases, yet still has not built a reactor.
Valor Atomics is a startup led by Isaiah Taylor (a 20‑something high‑school dropout) developing a helium-cooled pebble-bed small modular reactor using TRISO-like fuel and a graphite moderator; Valor lists ~7 people as 'nuclear engineers' on LinkedIn (only one with a PhD), and plans include a low-power ~100 kW thermal test reactor (initially planned in the Philippines) and a later prototype in Utah enabled by EO14301.
Technical and safety concerns highlighted: Valor’s concept operates near ~900°C (vs ~300°C for most light-water reactors), creating questions about TRISO fuel behavior at high temperature, oxygen ingress consequences, and neutron-induced accelerated aging of reactor vessels — issues normally explored in NRC failure-mode-and-effects analyses that EO14301 sidesteps.

Brief

The presenter flags concrete safety and program‑management concerns: Valor’s higher operating temperature (~900°C) raises unanswered questions about fuel integrity, oxygen ingress, and neutron‑accelerated material aging that would normally be explored in NRC failure‑mode analyses. He also notes an operational incident tied to Valor’s head of operations in 2021 as a data point on quality culture. Economically, the talk argues SMRs struggle without mass production — citing China’s heavy investment in ~1 GW pressurized water reactors — and predicts many startups will fail for market reasons rather than regulatory blockage. The recommended middle path is to streamline but not eliminate expert review (for example, require an initial NRC phase review) so innovators can iterate while retaining independent safety assurance.

By Decarbonize!

89 substack.com 2026-04-16 4 min read

Open

Flashing Orange

Why it matters

The EU exited the winter of 2025–2026 with disastrously low natural gas stocks just as a war in Iran closed the Strait of Hormuz, removing roughly 20% of global LNG supply (article published Apr 16, 2026).

Key details

The combined area of the EU, Great Britain and Norway (EU+2) produces less than half the natural gas it consumes, leaving an approximate shortfall of ~20 billion cubic feet per day (bcf/d) — about the entire output of the Permian Basin.
Germany once operated 19 nuclear reactors that generated about 170 terawatt-hours (TWh) of baseload electricity per year; the author frames their decommissioning as a major strategic loss as Europe faces a refilling season with little margin for error.

Brief

Flashing Orange, by Doomberg (Apr 16, 2026), warns that Europe is facing a high-risk natural gas crunch: the EU finished winter 2025–2026 with very low inventories while a war in Iran has effectively closed the Strait of Hormuz and taken ~20% of global LNG offline. Doomberg aggregates EU+2 production and consumption data to show the region produces under half its gas needs, leaving a deficit near 20 bcf/d (roughly the Permian Basin's output). The newsletter contrasts this vulnerability with Germany's lost nuclear capacity — 19 reactors that once delivered ~170 TWh/year — calling the phase-out a geopolitical own goal. The author says the refilling season leaves little room for error, expects higher energy prices and continued deindustrialization, and reports having mapped major inflows and risks (the detailed findings are paywalled).

By Doomberg

90 substack.com 2026-04-30 3 min read

Open

Microsoft Q3 FY2026. The $190 Billion CapEx Bombshell

Why it matters

Microsoft Q3 FY2026 revenue $82.9B (+18% YoY), beating the ~$81.4B consensus; Microsoft Cloud revenue $54.5B (+29% YoY) and now ~two-thirds of company revenue

Key details

Azure grew 40% YoY (39% in constant currency), above Q2 guidance of 37–38% CC and up from 35% a year ago
Q3 operating income $38.4B (+20% YoY) with operating margin 46.3%; GAAP diluted EPS $4.27 (+23% YoY) — prior-quarter EPS was inflated by a large OpenAI investment gain
Q3 CapEx (including finance leases) $31.9B (below $34.9B consensus) but CFO Amy Hood guided FY2026 CapEx of $190B (≈$35B above prior consensus), which spooked markets; commercial remaining performance obligations hit $627B (+99% YoY)

Brief

Microsoft's Q3 FY2026 results beat expectations across revenue, cloud, and profit metrics: revenue totaled $82.9B (+18% YoY), Microsoft Cloud brought in $54.5B (+29% YoY), and Azure accelerated to 40% YoY growth (39% in constant currency), outperforming prior guidance of 37–38% CC. Operating income reached $38.4B (+20% YoY) with a 46.3% margin, and GAAP diluted EPS was $4.27 (+23% YoY) after a Q2 spike tied to an OpenAI investment gain (non‑GAAP trends show underlying growth). Q3 CapEx including finance leases was $31.9B (below the $34.9B consensus), but CFO Amy Hood's FY2026 CapEx guidance of $190B — roughly $35B above prior market expectations — drove a negative market reaction. Commercial remaining performance obligations surged to $627B (+99% YoY), the largest backlog Microsoft has reported.

By William Martin Keating from Semicon Alpha

91 Twitter/X 2026-04-29 1 min read

Open

NRCan unveiled a national nuclear framework at the Canadian Nuclear Association…

Why it matters

NRCan unveiled a national nuclear framework at the Canadian Nuclear Association conference on April 29, 2026, with a final strategy due by the end of 2026 built around four pillars: new builds across Canada; global export leadership; expanded uranium and fuel cycle; and next‑gen innovation across fission and fusion.

Key details

Department of National Defence is allocating $40M (2026–2027) to assess Canadian-controlled microreactors for remote and Arctic bases, emphasizing energy sovereignty and military applications.
Federal investment and sector scale: $2.2B committed to Chalk River over 10 years for SMRs, fuels, CANDU, and safety/security; Canada supplied ~24% of global uranium in 2024 (largely Saskatchewan) with a $2.6B economic impact; 17 CANDU reactors produce ~13% of Canada’s electricity and the nuclear sector contributes ~$22B annually.

Brief

Canada announced a national nuclear framework at the Canadian Nuclear Association conference on April 29, 2026, with a final strategy due by end‑2026, committing to full‑spectrum nuclear policy (new builds, exports, fuel‑cycle expansion, fission and fusion R&D). Measures include DND’s $40M microreactor assessment and $2.2B for Chalk River over 10 years; Canada already supplies ~24% of global uranium.

By @da_sails

92 datacenterdynamics.com 2026-04-24 3 min read

Open

Google announced eighth-generation TPUs (April 2026)

Why it matters

Google announced eighth-generation TPUs (April 2026): two purpose-built chips — one dedicated for training and one for inference — designed to accelerate large-model training and inference workloads.

Key details

Automaker Stellantis signed a five-year migration agreement with Microsoft Azure and intends to reduce its on-premises data center footprint by 60% as part of the cloud migration.
Anthropic is actively seeking data‑center leasing deals across Europe and Australia as it expands capacity, while OpenAI is stepping back from its Stargate Europe presence.
Vast Data closed a $1 billion Series F round at a $30 billion valuation, and DCD published an NTT guide on India data-center expansion amid rising regional hyperscale demand.

Brief

DCD's April 24, 2026 Cloud & Hybrid monthly newsletter spotlights major shifts in AI infrastructure and enterprise cloud strategy. Google unveiled eighth‑generation TPUs — two distinct, purpose‑built chips for training and inference — aimed at boosting performance for large model workloads. In enterprise moves, Stellantis signed a five‑year deal with Microsoft Azure to migrate workloads and target a 60% reduction in its data‑center footprint. Anthropic is scouting data‑centre leases in Europe and Australia as it scales capacity and OpenAI retreats from Stargate Europe. The issue also flags industry developments including Vast Data’s $1bn Series F at a $30bn valuation, Google breaking ground on a Kronstorf, Austria data centre, and an NTT guide mapping India’s rapid hyperscale expansion, underscoring ongoing geographic and vendor shifts in compute and real‑estate strategy.

By DCD Cloud & Hybrid Newsletter

93 wordpress.com 2026-05-04 2 min read

Open

How AI Is Changing the Network(s)

Why it matters

Om Malik (May 4, 2026) argues AI is 'supersizing' networks and the capital required for competitive advantage, reshaping how cloud, data center and network architectures are conceived.

Key details

Malik says the expected consumer-side surge (personal AI agents constantly querying the cloud) has been overhyped; the biggest impacts are concentrated in data centers, backbone/physical pipes and cloud infrastructure based on interviews with networking and infrastructure insiders.
He produced a 5,000-word overview titled "Say Hello to the Internet of AI," analyzing physical pipes, shifting demand profiles, and who owns the new internet.

Brief

Om Malik's May 4, 2026 essay (5,000-word), "Say Hello to the Internet of AI," argues AI is 'supersizing' network scale and the capital needed for competitive advantage. Drawing on interviews with networking and infrastructure insiders, he finds the major impacts are in data centers, cloud and backbone (physical pipes), shifting demand profiles and ownership models—more infrastructure-centric than consumer-bandwidth-driven.

By On my Om

94 datacenterdynamics.com 2026-05-08 1 min read

Open

Scale for triple-digit rack densities

Why it matters

DCD Compute, Storage & Networking Channel published 'Scale for triple-digit rack densities' on 2026-05-08, promoting a whitepaper titled 'The technology trends shaping the future of the data center' that targets triple‑digit rack densities and gigawatt‑scale expansion.

Key details

The whitepaper recommends treating infrastructure as a single 'unit of compute'—aligning power, thermal, and IT layers—to handle AI workloads' extreme compute density and overcome densification complexities.
The email is from Data Centre Dynamics Ltd (registered in England and Wales, reg. 06799556) and includes download/subscriber links to obtain the full whitepaper and guidance for next‑generation high‑performance infrastructure.

Brief

Scale for triple‑digit rack densities (DCD, 8 May 2026) promotes a whitepaper arguing AI workloads are pushing data centers toward triple‑digit rack densities and gigawatt‑scale expansion. It advises treating infrastructure as a single 'unit of compute' by aligning power, thermal and IT layers to manage extreme densification and enable next‑gen high‑performance deployments.

By DCD Compute, Storage & Networking Channel

95 cautiousoptimism.news 2026-04-21 8 min read

Open

The Cerebras IPO filing isn’t the same mess as before

Why it matters

Cerebras reported calendar 2025 revenue of $510.0M (up 76% YoY): hardware sales grew 69% to $358.4M and cloud & services grew 94% to $151.6M; cost of revenue rose 86% and gross margin fell to 39.0% from 42.3%.

Key details

Customer concentration improved but remains notable: G42 accounted for 85.0% of 2024 revenue and 24.0% of 2025 revenue; MBZUAI accounted for 62.0% of 2025 revenue and represented 77.9% of accounts receivable at 12/31/2025 (G42 was 91.0% AR at 12/31/2024).
Strategic commercial deals inked in 2026 materially de‑risk revenue: a January 2026 multi‑year OpenAI agreement to deploy 750 MW of Cerebras wafer‑scale systems (option to expand up to 2.0 GW), including a $1.0B term loan and tranched warrants; and a March 2026 AWS deal to deploy CS‑3 systems and co‑design an inference solution integrating AWS Trainium3 with Cerebras CS‑3.
Near breakeven unit economics before stock‑based comp: stripping $11.3M and $16.0M of SBC from the final two quarters of 2025 leaves the business close to breakeven; remaining performance obligations (RPOs) were $24.6B, with ~$3.69B recognized in 2026–27 and ~$10.59B in 2028–29.

Brief

Cerebras is pursuing an IPO after re‑filing with materially different dynamics: strong top‑line growth and strategic hyperscaler/lab deals have reduced its historical concentration risk. The company posted $510.0M in 2025 revenue (hardware $358.4M, cloud/services $151.6M), with sequential quarterly growth of +31% and +26% in the latest two quarters, though gross margin slipped to 39.0% as costs rose. Historically dominated by Abu Dhabi customers (G42/MBZUAI), Cerebras closed a January 2026 multi‑year agreement with OpenAI to deploy 750 MW (option to scale to 2.0 GW) that includes a $1B term loan and warrants, and a March 2026 AWS partnership to deploy CS‑3 systems integrated with Trainium3 for disaggregated inference. With $24.6B RPOs and near‑breakeven results excluding stock‑based comp, the IPO case rests on continued expansion of AI inference demand and adoption of purpose‑built wafer‑scale inference chips.

By Alex Wilhelm from Cautious Optimism

96 divenewsletter.com 2026-05-06 5 min read

Open

NERC issued a Level 3 alert on May 6, 2026, saying computational/data center…

Why it matters

NERC issued a Level 3 alert on May 6, 2026, saying computational/data center loads pose “immediate risks” to the grid and requiring certain participants to complete seven specified actions by Aug. 3, 2026.

Key details

TXU Energy’s Texas retail EV charging plan gives Ford drivers 15 hours per day of “free” home charging under the special program; Ford reports it shifted 515 MWh of vehicle charging to off‑peak periods in 2025.
Ann Arbor’s new supplemental municipal utility — the Ann Arbor Sustainable Energy Utility — has begun locally sited solar-plus-storage installations intended to boost local reliability and lower costs for subscribing customers.
American Electric Power is reviewing exiting PJM and SPP over slow generation interconnection processes; AEP’s utilities have contracts for 63 GW of new large load commitments by 2030.

Brief

Load Management weekly (Utility Dive, May 6, 2026) spotlights immediate operational and planning pressures on U.S. grids: NERC raised a Level 3 alert centered on rapidly growing computational/data center loads and ordered specific participants to complete seven actions by Aug. 3, 2026 to mitigate risk. Retail programs and local DERs are presented as practical load‑management responses — TXU Energy’s Texas EV plan provides Ford drivers 15 hours/day of “free” home charging and Ford reports shifting 515 MWh to off‑peak in 2025 — while Ann Arbor’s new Sustainable Energy Utility is deploying locally sited solar and batteries to improve reliability and lower subscriber costs. Meanwhile, American Electric Power is reconsidering membership in PJM and SPP amid slow interconnection timelines as its systems absorb contracted large loads totaling 63 GW by 2030. The newsletter also highlights calls (eg., Jigar Shah) to scale front‑of‑meter storage and strengthen cyber resilience as complementary strategies.

By UD: Load Management

97 substack.com 2026-03-30 11 min read

Open

Rising CPU Tide Is Not Likely to Lift ARM

Why it matters

At ARM's Analyst Day (Mar 24, 2026) the company positioned CPUs as central to an "agentic" AI data center, claiming CPU density would rise from ~30 million cores per GW today to ~120 million cores per GW (4x) and forecasting Data Center CPU TAM growth from $50B to >$100B by FY2031.

Key details

ARM launched its first commercial ARM AGI CPU (not yet shipping); presenter-specs show it is not competitive with AMD's Venice on published specs except possibly TDP, and ARM asserts up to 2x performance-per-watt vs x86 — a claim the author finds implausible without independent benchmarks (likely not until H2 2026).
ARM disclosed rack designs: an air-cooled rack with 60 CPUs (8,160 cores) and a liquid-cooled rack with 336 CPUs (45,696 cores). Meta and OpenAI shared the stage as announced customers, while other cited users include Cloudflare, SAP, and F5.
Financial projections are questioned: ARM projects royalty CAGR rising from 14% (FY22–26) to 20% (FY26–31), expects ~ $1B ARM AGI CPU revenue in FY28 and projects $15B of $25B revenue from AGI CPUs by FY2031. The author argues these lifts are unlikely given Qualcomm litigation, Apple’s low rates, a 2026 memory-driven device downturn, and competitive pressures.

Brief

ARM’s Mar 24, 2026 Analyst Day reframed the AI data center around CPUs, arguing the shift to "agentic" workloads will multiply CPU deployment (from ~30M to ~120M cores per gigawatt) and double the Data Center CPU TAM to >$100B by FY2031. Management unveiled a strategic move from IP licensing to Compute Subsystems and announced a commercial ARM AGI CPU (not yet shipping), two rack designs (air: 60 CPUs / 8,160 cores; liquid: 336 CPUs / 45,696 cores), and customer endorsements from Meta and OpenAI — which together catalyzed market moves in ARM, Intel and AMD shares after the event.

Beyond The Hype criticizes both product and financial narratives. The AGI CPU’s disclosed specs are described as pedestrian and not competitive with AMD Venice except possibly on TDP; ARM’s 2x perf-per-watt claim vs x86 is labeled implausible without third‑party benchmarks (expected H2 2026). The newsletter also disputes ARM’s revenue and royalty math: royalty CAGR is projected to accelerate from 14% to 20%, AGI CPU revenue is small until FY28 (~$1B), and management forecasts $15B of $25B coming from AGI CPUs by FY2031 — a mix the author says would compress margins, contradicts claimed >65% operating margins, and ignores hyperscaler preference for in‑house ASICs, Qualcomm legal setbacks, Apple’s low rates, a 2026 memory-driven device decline, and potential Ampere acquisition costs.

By Beyond The Hype from EnerTuition

98 cautiousoptimism.news 2026-04-30 8 min read

Open

Big Tech’s AI growth (mostly) impressed Wall Street

Why it matters

Alphabet Q1 2026: revenue $109.9B vs $107.1B expected and EPS $5.11 vs $2.62 expected; Google Cloud revenue $20.0B (63% YoY) with operating profit rising from $2.2B to $6.6B; Google Cloud backlog grew to over $460B; API demand rose to 16 billion tokens/min (from 10B in Q4 2025); Alphabet raised 2026 capex guidance to $180–195B and said Waymo now handles 500,000 paid rides weekly (investor reaction: stock up ~8%).

Key details

Microsoft FY26 Q3: revenue $82.9B vs $81.4B expected and EPS $4.27 vs $4.06; Azure grew ~40% and Microsoft Cloud revenue was $54B (up 29% YoY); Microsoft reports its AI business surpassed $37B ARR (up 123%), Microsoft 365 Copilot has >20M paid seats (seat additions +250%), GitHub Copilot is used by ~140,000 organizations; capex in quarter $31.9B and company expects roughly $190B calendar-year capex (investor reaction: neutral).
Policy/supply-chain note: two House Republicans are probing U.S. companies’ use of Chinese AI models — naming Anysphere (Cursor) which used Moonshot’s Kimi K2.5 and Airbnb which built on Alibaba’s Qwen — a potential constraint that could shift training to domestic labs and affect startup costs and inference pricing.
Market/operational signals: Both hyperscalers attribute rapid cloud growth to enterprise GenAI adoption (Alphabet said products built on its GenAI models grew nearly 800% YoY; Google reported many customers processing trillions of tokens; Microsoft cited capacity constraints and a 40% improvement in inference throughput for its most-used Copilot models), driving materially higher GPU/CPU capex and expanded data‑center buildouts.

Brief

Alphabet and Microsoft led Thursday’s earnings beat driven largely by enterprise AI demand: Alphabet reported Q1 2026 revenue of $109.9B (vs $107.1B expected) and EPS $5.11, with Google Cloud at $20.0B (63% YoY), operating profit rising to $6.6B, a cloud backlog >$460B, API traffic at 16 billion tokens/min, Gemma 4 at 50M downloads, and raised 2026 capex guidance to $180–195B; Waymo now does 500k paid rides/week. Microsoft posted $82.9B revenue and $4.27 EPS, Azure growth ~40%, Microsoft Cloud $54B (+29% YoY), an AI business >$37B ARR (+123%), >20M Microsoft 365 Copilot paid seats, GitHub Copilot in ~140k orgs, $31.9B quarter capex and ~ $190B calendar-year capex target. The newsletter also flags a congressional probe into U.S. firms using Chinese models (Anysphere/Cursor used Moonshot Kimi K2.5; Airbnb used Alibaba Qwen), a move that could favor domestic labs but raise costs and slow inference-price declines.

By Alex Wilhelm from Cautious Optimism

99 substack.com 2026-04-24 8 min read

Open

🍪 TWiC: TPU v8, Intel Rises Up, Marvell-Polariton

Why it matters

Google (announced at Google Cloud Next, Apr 2026) launched TPU v8 as two SKUs — v8t for training and v8i for inference — with the 8i featuring 3× more on‑chip SRAM than the training chip and higher HBM capacity; Arm Axion CPUs are used as head nodes.

Key details

Google changed datacenter networking: Virgo (using high‑radix Optical Circuit Switches) replaces Jupiter; scale-up for inference uses a Boardfly approach (4 TPU 8i per copper board, 8 boards per rack via AEC, 36 groups per pod via OCS) yielding 1,152 TPUs per pod, while training still relies on a 3D torus.
Intel reported AI‑driven business is ~60% of revenue and grew 40% year‑over‑year (per its earnings call); the company is seeing EMIB packaging backlogs, improving 18A yields, and hired Eric Demers earlier in 2026 as Chief GPU architect.
Marvell announced acquisition of Swiss startup Polariton Technologies to integrate plasmonic phase modulators: electro‑optic dielectric waveguides that use surface plasmon polaritons (SPPs) to enable very small modulators with low parasitic capacitance, higher modulation speed, and lower power.

Brief

TPU v8, Intel's comeback, and Marvell's Polariton acquisition are the focus of Vikram Sekar's Apr 24, 2026 TWiC. Google introduced two TPU v8 SKUs — v8t for training and v8i for inference — noting the 8i has substantially larger on‑chip SRAM (3× the training chip) and higher HBM capacity, and uses Arm Axion CPUs as head nodes. Google is also shifting networking: Virgo (OCS, high radix) replaces Jupiter for scale‑out, while a Boardfly approach connects 4 chips/board × 8 boards/rack × 36 groups/pod for 1,152 TPUs per pod; training retains a 3D torus. Intel’s AI business now represents ~60% of revenue with 40% YoY growth, plus EMIB packaging demand and improving 18A yields; Eric Demers was hired as Chief GPU architect in 2026. Marvell bought Polariton to deploy SPP‑based plasmonic phase shifters that promise lower capacitance, faster modulation, and reduced power for photonics.

By Vik's Newsletter