AI Is Not Free – And That Was Always the Plan

Chapter 14 of 18 Practitioner · 16 min

The narrative around AI is almost entirely about what it can do for you. The conversation about what it costs – and who designed that cost – is much quieter. This is the second conversation, end to end: the trillion-dollar bet being built above you, and the bill that lands on you – as an individual, as an enterprise, and as the startup that went 100% autonomous before anyone modelled the meter.

// the crux

AI's revenue model is consumption – per token, per seat, per API call – built on the largest capital bet in tech history. The meter is always running. Understanding what it costs is the difference between deploying a tool and being deployed by one.

// in one breath

The companies building AI have committed hundreds of billions to infrastructure – and the model paying it back is your consumption, metered like a utility.
Chips and power are the two real constraints: HBM memory is now ~63% of an AI chip's cost, and the industry is signing 20-year nuclear contracts to keep the meters running.
Then the bill itself, with receipts, at three levels – from my own ~€1,000/month stack up to the startup running on autonomous agents before anyone modelled the meter.

AI promises to make you more productive. That promise, broadly, is real. The question being obscured is: productive on whose infrastructure, at whose pricing, under whose model of consumption? The companies building AI are not charities. They are making the largest infrastructure bets in modern technological history – and they need a revenue model that justifies those bets.

That revenue model is you. And me. And every enterprise paying per seat, per token, per API call.

the scale of the bet

What Is Actually Being Built

In January 2025, the Stargate project was announced: a joint venture between OpenAI, SoftBank, Oracle, and MGX, with $500 billion committed over four years to AI infrastructure in the United States. Not research. Not product development. Infrastructure – data centres, compute clusters, power grids.

The opening deployment was $100 billion, beginning immediately in Texas. The message from the announcement was unambiguous: we are building the pipes through which intelligence will flow, the same way AT&T built telephone infrastructure and Cisco built the early internet backbone. The bet is that those pipes will be essential, permanent, and profitable.

Stargate Project

$500B

4-year commitment · AI infrastructure · OpenAI + SoftBank + Oracle · Jan 2025

Microsoft

$80B

Data centre capex 2025 · plus $13B invested in OpenAI partnership

Chips and Power – The Two Things the World Does Not Have Enough Of

Building AI infrastructure at this scale requires two inputs: specialised compute and electricity. Both are under severe constraint.

On compute: NVIDIA controls roughly 80% of the market for AI training and inference hardware. An H100 GPU – the workhorse of large model training – peaked at $25,000 to $40,000 per unit in 2023-2024. NVIDIA’s data centre revenue grew from $47 billion in FY2024 to $115 billion in FY2025. Demand has consistently outpaced supply. TSMC, which manufactures NVIDIA’s chips, is racing to add capacity in new fabs across Taiwan and Arizona. They are not building fast enough.

And inside each chip, the constraint has a name: memory. Epoch AI’s component cost analysis of the chips designed by NVIDIA, AMD, Google, and Amazon shows high-bandwidth memory (HBM) growing from 52% to 63% of total chip cost between early 2024 and late 2025, while every other component’s share shrank. In absolute terms, HBM spend across those four designers nearly tripled – from roughly $12 billion in 2024 to $32 billion in 2025. The bottleneck is already repricing the hyperscalers’ plans: Microsoft raised its 2026 capex outlook by $25 billion citing component costs, and Meta raised its range by $10 billion for the same reason. When one component of one part of the supply chain moves two of the world’s largest companies to revise guidance by tens of billions, that is not a procurement detail. That is the meter being built into the silicon.

On power: the International Energy Agency (IEA) projected in 2024 that data centre and AI electricity demand would double by 2026 – from roughly 400 terawatt-hours to over 800 TWh annually. For context: 800 TWh is approximately the annual electricity consumption of Japan. The industry’s response to this has been, unusually, nuclear:

Microsoft – Three Mile Island

20-year deal to restart Unit 1 of Three Mile Island nuclear plant. Operational by 2028. Entire output dedicated to Azure AI data centres.

Announced September 2024 · Constellation Energy

Google – Kairos Power

Contract for 7 small modular nuclear reactors (SMRs), delivering ~500 MW of power. First reactor by 2030, remainder by 2035.

Announced October 2024 · Kairos Power

Amazon – Nuclear Expansion

Multiple nuclear power purchase agreements for AWS data centres across the United States, including deals with X-energy and Dominion Energy.

Multiple announcements 2024 · Amazon Web Services

IEA Projection

AI and data centre power demand projected to double by 2026. Goldman Sachs: 160% increase in data centre power demand by 2030.

IEA Electricity 2024 report · Goldman Sachs Power Up report

When the companies building AI are signing 20-year nuclear power contracts, they are not speculating. They are committing to a consumption curve that requires energy infrastructure that doesn’t yet fully exist. That commitment is priced into what you pay per API call.

what the leaders are saying

The People Running This Are Not Subtle About the Roadmap

The strategic framing from the people at the top of this industry is worth reading carefully – not for what they say AI will do, but for what they say AI will cost and how it will be consumed.

Sam Altman CEO · OpenAI

“We are approaching a moment where many instances of AI will be able to do the work of a highly-skilled professional... Intelligence will become radically cheap and broadly available.”

From “The Intelligence Age” (September 2024). Altman’s consistent framing across multiple essays and interviews positions AI intelligence as infrastructure – like electricity or bandwidth – that becomes cheap at scale and is billed as a utility. The commercial model follows directly: consumption-based pricing, subscription tiers, API billing. OpenAI already bills per token. The direction is set.

Jensen Huang CEO · NVIDIA

“Every company in the world is going to be an AI company. The question is whether they are going to transition, whether they are going to be transitioned, or whether they are going to be left behind.”

Huang has said variants of this in investor presentations and interviews throughout 2023–2025. His documented argument: AI compute is the new electricity for industry – a capital investment as essential to competitiveness as any other. One note: a specific claim circulating that Huang said “if you earn $500K, you need to spend $250K on AI bills” has not been verified from a primary source. The concept behind it (proportional AI investment relative to productivity gain) reflects his documented positions, but the specific numbers should not be treated as a direct quote without finding the original interview or transcript.

Satya Nadella CEO · Microsoft

“We are turning the world’s most advanced AI models into a new computing platform that makes every individual and every organisation on the planet more productive.”

Microsoft’s commercial model is the most concrete of all: Microsoft 365 Copilot at $30/user/month, GitHub Copilot at $10–19/user/month, Azure OpenAI at consumption-based pricing. The product already exists, the pricing is already set. Nadella is not describing a future – he is describing a revenue line that appeared in Microsoft’s earnings reports from Q1 2024 onwards.

Sundar Pichai CEO · Google / Alphabet

“AI is the most profound technology we are working on today. More profound than fire or electricity or anything that we’ve done in the past.”

Google I/O 2023. At face value this is hyperbole – at structural value it is a framing statement that justifies unlimited investment in AI infrastructure. If AI is more important than electricity, then spending on AI infrastructure is never too much. Alphabet’s 2024 capex of $52 billion – up significantly from prior years – reflects that framing translated into capital allocation.

Dario Amodei CEO · Anthropic

“A country of geniuses in a datacenter.”

From “Machines of Loving Grace” (October 2024) – his description of what powerful AI amounts to: millions of expert-level instances running in parallel. Read it as a consumption statement, because that is what it is: a country of geniuses is rented, not bought, and the datacenter is the meter. Anthropic’s own published trajectory shows how fast this compounds – by May 2026 the company reported that over 80% of its production code is authored by Claude, and its public roadmap on recursive self-improvement states that AI systems building their successors “could come sooner than most institutions are prepared for.” In his January 2026 essay “The Adolescence of Technology”, Amodei is also the most explicit of any CEO on this list about the risks of his own roadmap – which makes the infrastructure corollary harder to dismiss: whoever rents the geniuses pays for the country.

the point

These statements are not made in isolation. They come from the CEOs of companies that have committed, collectively, hundreds of billions to AI infrastructure. The framing – that AI is essential, inevitable, and will be priced like a utility – is not a neutral market observation. It is the monetisation roadmap of the infrastructure they are building. Their return on investment depends on your consumption growing.

So that is the wager above you. Now the other half: what it costs to operate inside it. I will start with my own numbers, because I think the most honest version of this conversation is one where the person giving advice is transparent about what they actually spend.

personal cost

My Stack, My Bill

I track my AI spend carefully. Not because I resent it – I think the value is real – but because I believe the only honest relationship with any tool is knowing exactly what you are paying for it. Here is what my personal stack costs each month, as of early 2026:

// Personal AI Stack · Monthly · Munich 2026

Claude (Anthropic) Primary reasoning · architecture · writing · review · this website

~€200

Cursor Primary coding environment · AI-first IDE

~€20

Codex (OpenAI) Code review · additional model perspective

~€100

Gemini Advanced (Google) Images · video · multimodal tasks

~€22

Other tools Perplexity · specialised models · occasional usage

~€100

Monthly Total

~€1,000

That is roughly what a junior developer earns per month in some European markets, spent entirely on AI tool subscriptions. Most of my active usage – around 20 hours per week – happens outside working hours: personal projects, architecture thinking, writing, learning. I do this by choice, and the productivity return justifies it for where I am in my career. But I say that clearly: this is a real cost with a real payoff that requires honest accounting. It is not a hobby expense. It is an infrastructure investment in my own capability.

The question worth asking before you build a similar stack: what would this cost look like if the tools stopped working the way they do today? What if the model that underpins your main productivity tool gets rate-limited, deprecated, or repriced? These are not hypothetical risks. They are scheduled events in an industry that is still figuring out its sustainable pricing model.

enterprise cost

What the Enterprise Bill Actually Looks Like

At the enterprise level, the bill compounds differently. The individual subscription becomes a seat count. The occasional API call becomes a committed monthly consumption tier. And the procurement conversation that starts with "we're just adding Copilot" tends to end somewhere considerably more expensive.

Let me use GitHub Copilot as a specific example, because it is the most widely deployed AI developer tool in enterprise settings and its pricing structure is unusually instructive about how enterprise AI cost actually works. One honesty note before the table: the first three rows are GitHub's published list prices. The three extension bands below them are not public pricing – they are composite figures from enterprise rollouts I have seen, included because the base subscription is never where the bill ends:

GitHub Copilot tier	Monthly cost	What it covers
Individual	$10 / user	Basic completions, chat in IDE, standard models
Business	$19 / user	Policy controls, audit logs, IP indemnity, org management
Enterprise	$39 / user	Custom models, knowledge bases, fine-tuning, advanced agents
Extensions (Standard) *	$25–60 / user Per-user monthly quota on top of base subscription	Third-party integrations: Jira, Datadog, Sentry, etc.
Extensions (Premium) *	$100–500 / user Elevated access tiers for power users	Advanced agent capabilities, higher model access quotas
Extensions (Enterprise-grade) *	Up to $1,000 / user Requires Business Unit Director approvalApproval gate	Custom agent workflows, dedicated model capacity, full API access

* Composite figures, not list prices. Base tiers ($10 / $19 / $39) are GitHub's published pricing as of early 2026. The extension bands are illustrative composites of add-on AI tooling spend observed in enterprise deployments – the specific numbers will vary by organisation, vendor mix, and negotiated terms.

That approval gate at $500+ per user per month is not a product limitation. It is a governance control that enterprises are having to build in real time, because no one designed their procurement process for a world where a developer productivity tool could cost $12,000 per person per year before the base subscription. The control exists because someone, somewhere, approved $500/month per user across a large team and the quarterly invoice produced a conversation that nobody wanted to have.

// From a real enterprise conversation

The procurement conversation that starts with "we're just adding Copilot" tends to end somewhere else. The initial subscription is visible and approved. What follows is a series of smaller approvals – a specialist AI extension here, an API integration there, a premium agent tier for the senior developers – that individually seem reasonable and collectively exceed the original budget by a margin that surprises nobody in retrospect.

A mid-sized technology company deploying GitHub Copilot Business ($19/user) across 200 developers, adding Microsoft 365 Copilot ($30/user) for the broader organisation at 500 seats, plus Azure OpenAI for three internal integrations, is at $500,000 to $700,000 annually before any additional AI tool spending. Add data, marketing, and operations AI tools, and you are at $1M+ per year routinely. The line items were all reasonable. The total required a board conversation.

None of this criticises the tools themselves – the value is, in many cases, real. The point is how utility models expand. You do not decide once to spend X on electricity. You decide to add each appliance, and the bill follows behaviour. Enterprise AI billing is heading exactly this direction – and the companies designing these pricing models know it.

the startup experiment

What Happens When You Go All-In on Autonomous Agents

The most instructive cost story in AI right now is not the individual developer or the cautious enterprise. It is the startup that decided to take the pitch seriously and build with 100% AI development from day one.

This is a pattern happening at multiple companies, in different forms. The trajectory looks like this:

// A pattern, not a single company – composite of real accounts

"90% of the new code that serves 1M+ users is being written by AI. In October, AI was a code assistant. By December, we were directing 5 to 10 fully autonomous coding agents at a time. Engineering now comes down to four things: problem definition, system architecture, QA, and debugging."

This is a real description of a real team. The productivity numbers are real. The outcome – serving 1M+ users with a fraction of the traditional engineering headcount – is genuinely remarkable. The investors are satisfied. The metrics are impressive.

What is not in that description: the monthly AI bill. Running 5–10 autonomous coding agents simultaneously, at multi-step execution depth, with full RAG retrieval and MCP tool calls at each step, is an infrastructure cost that scales with every agent task, every context refresh, every tool call, every review cycle – a different category of spend from a $100/month tool subscription. At this level of usage, the monthly AI bill for a small engineering team routinely reaches $500,000 per month or more.

Investors are currently funding this. They are satisfied because the output metrics – features shipped, users served, engineering velocity – look extraordinary. The structural question being deferred: what happens to the AI bill when the application scales further and the agents start encountering the parts of the codebase that don't have clean bounded contexts, clear inputs and outputs, and predictable failure modes? The answer is that the agent execution depth increases, the retry rate goes up, the cost per feature climbs, and the productivity numbers that justified the model start to compress.

The caution here applies to operating any powerful tool without understanding its cost model at scale – AI is simply the current example. The startup that goes from "AI as a code assistant" to "10 autonomous agents in parallel" in two months has made a bet. The bet may pay off. But the cost structure of that bet is real, and the deferred question – what does this look like at ten times the current scale? – is the one that will eventually produce a difficult board conversation.

↳ see also · Chapter 5 – What Agents Actually Need – its agent requirements (bounded context, cost caps, failure definition) are exactly what prevent this runaway.

the quality problem

The Cost Nobody Puts on a Spreadsheet

The dollar cost of AI is the visible cost. There is a second cost that compounds alongside it, and it is less visible because it does not appear as a line item: the quality degradation that emerges when AI adoption outpaces the process design that should contain it.

// The Context and Scale Problem

What high-percentage AI adoption looks like six months in

Context is not shared. Each developer's AI session has different history, different loaded files, a different mental model of the system. There is no collective memory. The team's understanding of the codebase is distributed; the AI's understanding is fragmented per session.
Codebases exceed context windows. A large production system cannot be held in any model's context simultaneously. Every AI interaction is working with a partial view. The larger the system, the larger the blind spot – and the more the agent's confident output diverges from the system's actual state.
Hallucination rates multiply with complexity. A 2–4% incorrect suggestion rate that seems acceptable in isolation compounds across a team. Five developers each accepting one wrong suggestion per day is twenty incorrect pieces of code before end of sprint – many of which pass automated tests because the tests were also generated by AI, against the same incomplete context.
The review burden shifts but does not disappear. The senior engineer's role becomes less about writing code and more about detecting subtle errors in AI-generated code – a different skill, and in some ways a harder one, because the errors are less obvious and more architecturally coherent.

The solution to this is not less AI. It is process design that accounts for how AI actually works – its context limitations, its tendency to be confidently wrong, and its inability to hold shared state across sessions. The dollar cost of AI is knowable in advance. The process cost of deploying it wrong at scale is not – and it compounds in ways that a subscription renewal notice will not warn you about.

owned publicly

// I believe this

The meter is running for every individual developer, every engineering team, and every startup directing autonomous agents at a codebase. The question is not whether to pay – it is whether you know exactly what you are paying, why, and what the bill looks like at two times your current scale.

None of this is an argument against using AI. I use it constantly, and my honest assessment is that it is the most significant productivity amplifier I have worked with in more than two decades of professional engineering. The value is real.

What I am arguing for is intentionality. The companies building the infrastructure win when consumption grows – and growth is their goal, not optimisation of your bill. Your electricity provider does not nudge you toward energy efficiency. Your AI provider does not nudge you toward using fewer tokens. Budget for AI the way you budget for cloud infrastructure – with actual numbers, reviewed quarterly, with ceiling thresholds and an honest assessment of what the consumption is producing.

Understanding what AI costs is not pessimism about what AI can do. It is the difference between deploying a tool and being deployed by one. The infrastructure is real, the value is real, and the bill is also real – and it will keep growing until the process around the tool is designed to contain it.

// carry forward

The one question this chapter deliberately left open is where the capability curve goes next, and how fast. That deserves its own honest treatment – the forecasters who grade themselves, the one metric worth tracking, and the week it got personal. That is Chapter 15.