No body text on file.
Open the original to read the full piece.
Tae Kim argues (Key Context, published 2026-03-30) that short‑term macro fears — from the Iran war to “peak AI capex” talk — obscure a structural pivot: inference is now the dominant workload. Citing Bill Dally and on‑the‑ground GTC conversations with Meta/Google/Nvidia engineers, Kim reports ~90% of data‑center power is going to inference and hyperscalers’ planned “hundreds of billions” in capex still can’t meet demand. Agentic AI (notably coding assistants) multiplies token demand roughly 15× per Arm CEO René Haas and has driven CPU orchestration needs from ~30M to ~120M cores per GW; Intel is locking multi‑year supply deals. Nvidia is central — networking sales +263% YoY, Groq acquisition for ultra‑low‑latency inference, and Vera Rubin GPUs slated for 2026 — while model improvements (Anthropic “Mythos,” larger context windows, multimodal data) and Amazon’s GPU rearchitected ads point to sustained, large‑scale demand (U.S. knowledge work ≈ $6.2T).
Tae Kim (Key Context) wrote on 2026-03-30 that current market angst (Iran war, “peak AI capex”) echoes last year’s tariff/DeepSeek panic and is likely short‑lived as AI inference demand surges.
Open the original to read the full piece.