Is Nvidia's lead over AMD about hardware or software?

Mostly software. AMD is competitive on silicon — its Instinct GPUs power El Capitan, which debuted as the world's #1 supercomputer on the TOP500 list,[[cite:s9]] and its EPYC CPU holds roughly a third of the x86 server market. Nvidia's durable advantage is CUDA, a roughly two-decade software ecosystem that every model and framework is written against, which creates switching costs no single chip can overcome.

Does AMD have a way to run CUDA software on its own GPUs?

Partly. AMD quietly funded ZLUDA, a drop-in CUDA binary-compatibility layer built on its ROCm stack, and open-sourced it in February 2024 — letting some unmodified CUDA binaries run on AMD GPUs. AMD has also made PyTorch official packages a first-class part of ROCm. The compatibility is real but incomplete, and depending on a competitor's interface is a fragile place to build from.

Does Nvidia really have 90% of the AI-chip market?

Not of the whole market. The widely cited 80–90% figure is an analyst estimate for the merchant data-center GPU subset, and it excludes in-house silicon like Google's TPUs, AWS Trainium, and custom ASICs. No fully audited figure for Nvidia's share of the entire AI-accelerator market has been published; AMD's Instinct GPUs were estimated at roughly 4–5% of the AI accelerator market in 2024.

AMD Is Catching Nvidia on Silicon. It's the Software It Can't Catch.

Pairs with the Moat Anatomy Canvas — a ready-to-use strategy tool. Included with a subscription, or $1.99.

In 2022, a chip company hired a single freelance developer to do something strange: build a way to run its rival's software on its own hardware, without anyone noticing. The company was AMD. The project was called ZLUDA — a drop-in layer that lets unmodified programs written for Nvidia's CUDA run on AMD GPUs. AMD funded it quietly, then cut it loose, and in February 2024 it surfaced as open-source.⁶ Think about what that admission contains. AMD makes excellent chips. And to compete in AI, it concluded the fastest path was to make its hardware pretend to be Nvidia's.

The story everyone tells about this race is that Nvidia builds faster chips. That AMD is the perennial runner-up with weaker silicon. Almost every word of that is wrong. AMD's silicon is good — good enough to win the world's most demanding contracts. The thing it cannot easily buy, build, or out-engineer is the twenty years of software that runs on top of Nvidia's chips. The chip was never the moat. The code was.

AMD already proved it can build the hardware

Start with the part the popular story gets backwards. In the data-center CPU market — the one Nvidia doesn't even play in — AMD's EPYC reached roughly 32–33% of the x86 server market exiting 2024, with analysts projecting it would push toward 38–39% by the end of 2025.⁸ That is not a distant also-ran; that is a third of the world's servers, taken from Intel one design cycle at a time. And on the GPU side, AMD's Instinct accelerators power El Capitan at Lawrence Livermore — a machine built entirely on AMD chips, both GPU and CPU, that held the world's #1 ranking on the TOP500 list from its debut in November 2024.⁹ When the U.S. government wanted the most powerful supercomputer it could build, it did not buy Nvidia. So when AMD says its hardware is competitive, the receipts exist.

The money is moving too. AMD's Data Center segment doubled in a year — from $6.5 billion in 2023 to a record $12.6 billion in 2024, up 94%⁴⁵ — and CEO Lisa Su attributed more than $5 billion of that to Instinct GPUs alone.⁴ A company that could not build AI silicon does not generate that number. The hardware is not the problem. Which raises the obvious question: if AMD can match the chip, why is it still so far behind?

	AMD	Nvidia
Server CPU share	~32–33% of x86, exiting 2024	Not in this market
World's #1 supercomputer	El Capitan — AMD GPU + CPU	—
Data-center GPU revenue	$5B+ Instinct (mgmt. claim, 2024)	Tens of billions per quarter
The software every model runs on	ROCm — maturing, must port to it	CUDA — ~20 years, the default

Where AMD is strong, and where the gap actually sits

The moat is the code, and the code compounds

CUDA is Nvidia's software platform for running general computation on its GPUs, and it has been accumulating for the better part of two decades. That length matters more than any single feature. Every AI researcher learned on it. Every deep-learning framework was first optimized for it. Every tutorial, every Stack Overflow answer, every library of hand-tuned kernels assumes it. The asset is not a piece of software — it is the gravitational field that twenty years of developers built around the software. A chip is a thing you buy once. CUDA is a thing your entire team already knows, and the cost of un-knowing it is enormous.

This is why a faster AMD chip does not automatically win a sale. The buyer is not choosing a piece of silicon; they are choosing whether to rewrite, re-validate, and re-tune the code their business already runs. The switching cost lives in the customer's own codebase, which means Nvidia doesn't have to defend it — the customer defends it for them, every time they decline to port. AMD's challenge isn't to be better. It's to be enough better to justify the migration. That is a far higher bar than a benchmark.

$215.9B

Nvidia's FY2026 revenue, up 65% in a single year — and the durable part of it is not the chip, it's that the world's AI code is written against CUDA²

And the gravity keeps the chip business spinning. Nvidia's revenue went from $130.5 billion in fiscal 2025, up 114%, to $215.9 billion in fiscal 2026, up another 65%, with Data Center revenue alone hitting $62.3 billion in a single quarter.¹² Each of those dollars funds more libraries, more developer tools, more reasons for the next engineer to learn CUDA and not ROCm. The moat doesn't sit still. It widens with every quarter the lead is spent.

How AMD is attacking a software lead with software

AMD understands this exactly, which is why its real campaign is not about transistors. ROCm — its open software stack — now ships PyTorch official packages as a first-class option, with upstream Linux kernel support, OpenAI Triton support, and Hugging Face integration, and AMD sits as a founding member of the PyTorch Foundation.⁷ That last point is the strategic one: if the framework most researchers actually write in supports your hardware natively, the developer never has to touch CUDA's lowest layers at all. AMD isn't trying to out-build CUDA feature-for-feature. It's trying to make the layer above CUDA — the one developers live in — vendor-neutral.

ZLUDA attacks from the other direction. Rather than asking developers to rewrite anything, it lets some unmodified CUDA binaries run directly on AMD GPUs.⁶ Pair that with AI-assisted code porting — tools that can now migrate a complete CUDA backend to ROCm without a single line written by hand¹⁰ — and the twenty-year head start starts to look less like a wall and more like a toll booth someone is quietly building a bypass around. The switching cost is exactly the thing both projects exist to drive toward zero.

“...helped our competitors build larger developer and customer ecosystems to challenge us worldwide.”³

NVIDIA CorporationOn losing the China data-center market to export controls — from its FY2026 annual report (Form 10-K)

That line is remarkable coming from the incumbent. Nvidia's own filing concedes that when policy locked it out of China, the absence handed rivals room to grow the one thing that actually matters — the ecosystem.³ The company knows precisely where its moat is, because it just watched a piece of it form on the other side.

Isn't a 'software moat' just a polite word for inevitable decline?

The fair objection is that switching costs always look unbreakable right up until they break — and the tools to break this one already exist. ZLUDA is open-source. PyTorch runs on AMD. AI can port code now. So isn't CUDA just the next dominant standard waiting for its disruption? Maybe. But notice the asymmetry. Every porting tool AMD ships is a layer of translation on top of an interface Nvidia controls and keeps moving. Building your competitive strategy on emulating a rival's API means you are forever one version behind, re-implementing their new features after they ship instead of inventing your own. ZLUDA's very existence is the tell: AMD couldn't beat the standard, so it tried to become compatible with it — and compatibility is a follower's position by definition.

The honest counter to Nvidia's confidence is subtler. Switching costs erode not when a rival matches them feature-for-feature, but when the layer customers actually touch moves somewhere neutral. If PyTorch and Triton become the real interface and CUDA becomes plumbing nobody sees, the moat doesn't fall — it gets abstracted away beneath a floor that no longer cares which chip is underneath. That is the genuine race. Not chip versus chip. Nvidia widening the lock-in through vertical integration, against an industry quietly agreeing to stop writing code that knows whose silicon it's running on.

The moat is in the codebase, not the catalog

When a customer's switching cost lives inside their own work — the code they wrote, the workflows they learned, the tools their team mastered — the incumbent gets a moat it never has to actively defend, because every customer defends it for free by choosing not to migrate. The strategic lesson cuts two ways. If you're the leader: your durable asset is rarely the product spec; it's the accumulated investment customers have made around your product, so spend your lead funding more of it. If you're the challenger: matching the product is necessary and nowhere near sufficient — you must either make switching nearly free (porting tools, compatibility layers) or move the fight up to a neutral layer the leader doesn't own. Just be honest that building on a rival's interface keeps you a step behind their roadmap, permanently.

AMD can build a chip that earns the world's top supercomputer ranking⁹ and still trail in AI, and that tells you everything about where the contest actually lives. Nvidia's genius was never only the GPU. It was getting an entire generation of engineers to write their work in a language it controlled, so that two decades later, even a rival with competitive silicon and a third of the server market has to begin by funding a project to speak that language back. The chips will keep trading benchmarks. The real war is over whose code the next model is written in — and the company that wins that war barely has to ship hardware at all.

When the real moat hides one layer up

Visa's invisible toll

It lends nothing and risks nothing — the network is the moat.

Read →

The Bloomberg lock-in

A network of users nobody can afford to leave.

Read →

Amazon's flywheel

How a loop becomes an un-attackable moat.

Read →

Take it with you — The Moat Anatomy

Canvas

Moat Anatomy Canvas

A one-page canvas that dissects a moat instead of asserting it: where the advantage comes from, how much of the market it covers, how long it would take to copy, and what keeps it from eroding. Blank to dissect your own claimed edge; filled as the worked example tracing the structure of the story's defensible advantage. Use it to tell a real moat from a head start.

Blank template

Included with any subscription, or unlock this tool for $1.99. Get it → · See plans →

Sources

Where this comes from — the filings, records, and reporting behind it.

1
Primary · Company recordDocumented
Nvidia total revenue for fiscal year 2025 (ending Jan 26, 2025) was $130.5 billion, up 114% year-over-year; Data Center revenue was up 142% year-over-year.
NVIDIA Newsroom / Investor Relations, NVIDIA Announces Financial Results for Fourth Quarter and Fiscal 2025 ↗ · 2025-02-26
2
Primary · Company recordDocumented
Nvidia fiscal year 2026 (ending Jan 25, 2026) record full-year revenue was $215.9 billion, up 65%; Q4 FY2026 Data Center revenue was $62.3 billion, up 75% year-over-year.
NVIDIA Newsroom / Investor Relations, NVIDIA Announces Financial Results for Fourth Quarter and Fiscal 2026 ↗ · 2026-02-25
3
Primary · SEC filingDocumented
Nvidia's FY2026 10-K states that export-control foreclosure from the China data-center market 'helped our competitors build larger developer and customer ecosystems to challenge us worldwide.'
U.S. Securities and Exchange Commission / NVIDIA Corporation, NVIDIA Annual Report on Form 10-K for FY2026 (nvda-20260125.htm) ↗ · 2026
4
Primary · Company recordDocumented
AMD full-year 2024 Data Center segment revenue was a record $12.6 billion, up 94% year-over-year, driven by both Instinct GPU shipments and EPYC CPU sales; AMD Instinct accelerator revenue exceeded $5 billion for 2024 (CEO Lisa Su attribution).
Advanced Micro Devices, Inc. — Investor Relations, AMD Reports Fourth Quarter and Full Year 2024 Financial Results ↗ · 2025-02-04
5
Primary · SEC filingDocumented
AMD 2023 Data Center segment revenue was $6.5 billion, up 7% year-over-year, driven by Instinct GPU and 4th Gen EPYC CPU sales.
U.S. Securities and Exchange Commission / Advanced Micro Devices, Inc., AMD Form 8-K — Q4 and Full Year 2023 Financial Results ↗ · 2024-01-30
6
PublishedWidely reported
AMD quietly funded the ZLUDA project — a drop-in CUDA binary-compatibility layer built on ROCm — contracting developer Andrzej Janik in 2022; the project was open-sourced in February 2024, allowing unmodified CUDA binaries to run on AMD GPUs without source-code changes.
Phoronix, AMD Quietly Funded A Drop-In CUDA Implementation Built On ROCm: It's Now Open-Source ↗ · 2024-02-12
7
Primary · Company recordDocumented
AMD's official ROCm product page documents that PyTorch official packages, upstream Linux Kernel support, MIOpen deep learning libraries, OpenAI Triton support, and Hugging Face integration are all part of the ROCm ecosystem; AMD is a founding member of the PyTorch Foundation.
Advanced Micro Devices, Inc., AMD ROCm Software — Official Product Page ↗ · 2024
8
PublishedAttributed to source
AMD EPYC server CPU market share reached 32–33% exiting 2024, with analysts projecting 38–39% by end of 2025; AMD's Instinct GPU revenue represented approximately 4–5% of the AI accelerator market in 2024.
Creative Strategies, AMD Q424 Earnings – What's Going on With AMD? ↗ · 2025-02-06
9
Primary · ArchivalDocumented
El Capitan at Lawrence Livermore National Laboratory is powered by AMD 4th Gen EPYC CPUs and AMD Instinct MI300A accelerators; it held the #1 spot on the TOP500 list from November 2024 through June 2026, when it was displaced by China's LineShine system.
Lawrence Livermore National Laboratory, El Capitan retains title as world's fastest supercomputer in latest Top500 ↗ · 2025-11-17
10
PublishedWidely reported
AI coding agents (e.g., Claude Code) have demonstrated the ability to port complete CUDA backends to AMD ROCm, with agentic tools handling straightforward kernels without manual rewriting, reducing the practical friction of switching GPU stacks.
Techstrong.ai, Claude Code Ports NVIDIA CUDA to AMD ROCm in 30 Minutes ↗ · 2026-01-26

Keep going

The Loop That Ate Retail: How Amazon's Flywheel Turns Low Prices Into an Unstoppable Machine →The $380 Million Machine Only One Company on Earth Can Build →The $30,000 Habit Wall Street Can't Quit: Inside the Bloomberg Terminal's Lock-In →

AMD Is Catching Nvidia on Silicon. It's the Software It Can't Catch.

AMD already proved it can build the hardware

The moat is the code, and the code compounds

How AMD is attacking a software lead with software

Isn't a 'software moat' just a polite word for inevitable decline?

When the real moat hides one layer up

Moat Anatomy Canvas

Sources

Moat & Competition

Explore

Start here