Pairs with the Moat Anatomy Canvas — a ready-to-use strategy tool. Included with a subscription, or $1.99.
In 2022, a chip company hired a single freelance developer to do something strange: build a way to run its rival's software on its own hardware, without anyone noticing. The company was AMD. The project was called ZLUDA — a drop-in layer that lets unmodified programs written for Nvidia's CUDA run on AMD GPUs. AMD funded it quietly, then cut it loose, and in February 2024 it surfaced as open-source.6 Think about what that admission contains. AMD makes excellent chips. And to compete in AI, it concluded the fastest path was to make its hardware pretend to be Nvidia's.
The story everyone tells about this race is that Nvidia builds faster chips. That AMD is the perennial runner-up with weaker silicon. Almost every word of that is wrong. AMD's silicon is good — good enough to win the world's most demanding contracts. The thing it cannot easily buy, build, or out-engineer is the twenty years of software that runs on top of Nvidia's chips. The chip was never the moat. The code was.
AMD already proved it can build the hardware
Start with the part the popular story gets backwards. In the data-center CPU market — the one Nvidia doesn't even play in — AMD's EPYC reached roughly 32–33% of the x86 server market exiting 2024, with analysts projecting it would push toward 38–39% by the end of 2025.8 That is not a distant also-ran; that is a third of the world's servers, taken from Intel one design cycle at a time. And on the GPU side, AMD's Instinct accelerators power El Capitan at Lawrence Livermore — a machine built entirely on AMD chips, both GPU and CPU, that held the world's #1 ranking on the TOP500 list from its debut in November 2024.9 When the U.S. government wanted the most powerful supercomputer it could build, it did not buy Nvidia. So when AMD says its hardware is competitive, the receipts exist.
The money is moving too. AMD's Data Center segment doubled in a year — from $6.5 billion in 2023 to a record $12.6 billion in 2024, up 94%45 — and CEO Lisa Su attributed more than $5 billion of that to Instinct GPUs alone.4 A company that could not build AI silicon does not generate that number. The hardware is not the problem. Which raises the obvious question: if AMD can match the chip, why is it still so far behind?
| AMD | Nvidia | |
|---|---|---|
| Server CPU share | ~32–33% of x86, exiting 2024 | Not in this market |
| World's #1 supercomputer | El Capitan — AMD GPU + CPU | — |
| Data-center GPU revenue | $5B+ Instinct (mgmt. claim, 2024) | Tens of billions per quarter |
| The software every model runs on | ROCm — maturing, must port to it | CUDA — ~20 years, the default |
The moat is the code, and the code compounds
CUDA is Nvidia's software platform for running general computation on its GPUs, and it has been accumulating for the better part of two decades. That length matters more than any single feature. Every AI researcher learned on it. Every deep-learning framework was first optimized for it. Every tutorial, every Stack Overflow answer, every library of hand-tuned kernels assumes it. The asset is not a piece of software — it is the gravitational field that twenty years of developers built around the software. A chip is a thing you buy once. CUDA is a thing your entire team already knows, and the cost of un-knowing it is enormous.
This is why a faster AMD chip does not automatically win a sale. The buyer is not choosing a piece of silicon; they are choosing whether to rewrite, re-validate, and re-tune the code their business already runs. The switching cost lives in the customer's own codebase, which means Nvidia doesn't have to defend it — the customer defends it for them, every time they decline to port. AMD's challenge isn't to be better. It's to be enough better to justify the migration. That is a far higher bar than a benchmark.
And the gravity keeps the chip business spinning. Nvidia's revenue went from $130.5 billion in fiscal 2025, up 114%, to $215.9 billion in fiscal 2026, up another 65%, with Data Center revenue alone hitting $62.3 billion in a single quarter.12 Each of those dollars funds more libraries, more developer tools, more reasons for the next engineer to learn CUDA and not ROCm. The moat doesn't sit still. It widens with every quarter the lead is spent.
How AMD is attacking a software lead with software
AMD understands this exactly, which is why its real campaign is not about transistors. ROCm — its open software stack — now ships PyTorch official packages as a first-class option, with upstream Linux kernel support, OpenAI Triton support, and Hugging Face integration, and AMD sits as a founding member of the PyTorch Foundation.7 That last point is the strategic one: if the framework most researchers actually write in supports your hardware natively, the developer never has to touch CUDA's lowest layers at all. AMD isn't trying to out-build CUDA feature-for-feature. It's trying to make the layer above CUDA — the one developers live in — vendor-neutral.
ZLUDA attacks from the other direction. Rather than asking developers to rewrite anything, it lets some unmodified CUDA binaries run directly on AMD GPUs.6 Pair that with AI-assisted code porting — tools that can now migrate a complete CUDA backend to ROCm without a single line written by hand10 — and the twenty-year head start starts to look less like a wall and more like a toll booth someone is quietly building a bypass around. The switching cost is exactly the thing both projects exist to drive toward zero.
“...helped our competitors build larger developer and customer ecosystems to challenge us worldwide.”3
That line is remarkable coming from the incumbent. Nvidia's own filing concedes that when policy locked it out of China, the absence handed rivals room to grow the one thing that actually matters — the ecosystem.3 The company knows precisely where its moat is, because it just watched a piece of it form on the other side.
Isn't a 'software moat' just a polite word for inevitable decline?
The fair objection is that switching costs always look unbreakable right up until they break — and the tools to break this one already exist. ZLUDA is open-source. PyTorch runs on AMD. AI can port code now. So isn't CUDA just the next dominant standard waiting for its disruption? Maybe. But notice the asymmetry. Every porting tool AMD ships is a layer of translation on top of an interface Nvidia controls and keeps moving. Building your competitive strategy on emulating a rival's API means you are forever one version behind, re-implementing their new features after they ship instead of inventing your own. ZLUDA's very existence is the tell: AMD couldn't beat the standard, so it tried to become compatible with it — and compatibility is a follower's position by definition.
The honest counter to Nvidia's confidence is subtler. Switching costs erode not when a rival matches them feature-for-feature, but when the layer customers actually touch moves somewhere neutral. If PyTorch and Triton become the real interface and CUDA becomes plumbing nobody sees, the moat doesn't fall — it gets abstracted away beneath a floor that no longer cares which chip is underneath. That is the genuine race. Not chip versus chip. Nvidia widening the lock-in through vertical integration, against an industry quietly agreeing to stop writing code that knows whose silicon it's running on.
When a customer's switching cost lives inside their own work — the code they wrote, the workflows they learned, the tools their team mastered — the incumbent gets a moat it never has to actively defend, because every customer defends it for free by choosing not to migrate. The strategic lesson cuts two ways. If you're the leader: your durable asset is rarely the product spec; it's the accumulated investment customers have made around your product, so spend your lead funding more of it. If you're the challenger: matching the product is necessary and nowhere near sufficient — you must either make switching nearly free (porting tools, compatibility layers) or move the fight up to a neutral layer the leader doesn't own. Just be honest that building on a rival's interface keeps you a step behind their roadmap, permanently.
AMD can build a chip that earns the world's top supercomputer ranking9 and still trail in AI, and that tells you everything about where the contest actually lives. Nvidia's genius was never only the GPU. It was getting an entire generation of engineers to write their work in a language it controlled, so that two decades later, even a rival with competitive silicon and a third of the server market has to begin by funding a project to speak that language back. The chips will keep trading benchmarks. The real war is over whose code the next model is written in — and the company that wins that war barely has to ship hardware at all.
Moat Anatomy Canvas
A one-page canvas that dissects a moat instead of asserting it: where the advantage comes from, how much of the market it covers, how long it would take to copy, and what keeps it from eroding. Blank to dissect your own claimed edge; filled as the worked example tracing the structure of the story's defensible advantage. Use it to tell a real moat from a head start.
Included with any subscription, or unlock this tool for $1.99. Get it → · See plans →
Sources
Where this comes from — the filings, records, and reporting behind it.
- 1Nvidia total revenue for fiscal year 2025 (ending Jan 26, 2025) was $130.5 billion, up 114% year-over-year; Data Center revenue was up 142% year-over-year.
- 2Nvidia fiscal year 2026 (ending Jan 25, 2026) record full-year revenue was $215.9 billion, up 65%; Q4 FY2026 Data Center revenue was $62.3 billion, up 75% year-over-year.
- 3Nvidia's FY2026 10-K states that export-control foreclosure from the China data-center market 'helped our competitors build larger developer and customer ecosystems to challenge us worldwide.'
- 4AMD full-year 2024 Data Center segment revenue was a record $12.6 billion, up 94% year-over-year, driven by both Instinct GPU shipments and EPYC CPU sales; AMD Instinct accelerator revenue exceeded $5 billion for 2024 (CEO Lisa Su attribution).
- 5AMD 2023 Data Center segment revenue was $6.5 billion, up 7% year-over-year, driven by Instinct GPU and 4th Gen EPYC CPU sales.
- 6AMD quietly funded the ZLUDA project — a drop-in CUDA binary-compatibility layer built on ROCm — contracting developer Andrzej Janik in 2022; the project was open-sourced in February 2024, allowing unmodified CUDA binaries to run on AMD GPUs without source-code changes.
- 7AMD's official ROCm product page documents that PyTorch official packages, upstream Linux Kernel support, MIOpen deep learning libraries, OpenAI Triton support, and Hugging Face integration are all part of the ROCm ecosystem; AMD is a founding member of the PyTorch Foundation.
- 8AMD EPYC server CPU market share reached 32–33% exiting 2024, with analysts projecting 38–39% by end of 2025; AMD's Instinct GPU revenue represented approximately 4–5% of the AI accelerator market in 2024.
- 9El Capitan at Lawrence Livermore National Laboratory is powered by AMD 4th Gen EPYC CPUs and AMD Instinct MI300A accelerators; it held the #1 spot on the TOP500 list from November 2024 through June 2026, when it was displaced by China's LineShine system.
- 10AI coding agents (e.g., Claude Code) have demonstrated the ability to port complete CUDA backends to AMD ROCm, with agentic tools handling straightforward kernels without manual rewriting, reducing the practical friction of switching GPU stacks.