What happened
The AI industry faces a critical computing capacity shortage, shifting from frictionless scaling to allocation. Major AI companies ration access; Anthropic implements tighter usage limits on Claude during peak demand. Agentic systems, consuming significantly more compute, accelerate this strain. A global shortage of high-performance GPUs, largely from Nvidia, drives rising costs and longer procurement cycles. CoreWeave, an AI infrastructure provider, raised prices and moved to longer-term contracts as demand outstrips supply. Physical infrastructure, including data centres and power grids, also faces strain, with electricity and interconnection delays becoming binding constraints.
Why it matters
AI product development and deployment now face hard limits, shifting from innovation-led growth to compute-constrained allocation. Procurement teams encounter rising costs and extended lead times for high-performance GPUs, impacting project timelines and budgets. Platform engineers must contend with tighter usage limits on frontier models and increased variability in AI API service reliability. Founders building AI-first products face higher infrastructure costs and reduced on-demand compute availability, potentially delaying product scaling. This constraint on physical infrastructure aligns with recent reports detailing significant delays and cancellations in planned US data centre capacity.
Subscribe for Weekly Updates
Stay ahead with our weekly AI and tech briefings, delivered every Tuesday.




