Pairs with the Moat Anatomy Canvas — a ready-to-use strategy tool. Included with a subscription, or $1.99.

In fiscal year 2025, Nvidia sold $115.2 billion of data-center hardware - up 142% year-over-year.1 That pace kept accelerating: by Q3 of fiscal 2026 (the quarter ending October 2025), a single quarter hit $51.2 billion at a gross margin of 73.4%.3 Those are not the numbers of a company under siege. They are the numbers of a tollbooth on a road everyone suddenly needs. And yet the most-asked question in the chip industry is whether someone can finally break it. The honest answer is that the people trying to out-engineer Nvidia are losing - and the people who might actually win aren't competing on the chip at all.

The official story is that Nvidia's moat is CUDA, the software layer developers learned on, and that whoever cracks CUDA cracks the company. That's the comforting version, because software can in principle be replaced. The real moat is wider and uglier to copy: CUDA plus nearly two decades of accumulated libraries, the framework optimizations every AI lab quietly depends on, the NVLink interconnect that lets thousands of chips behave like one, priority on TSMC's scarce advanced packaging, and the high-bandwidth memory supply lines. Pull any single thread and the lock-in survives. You have to beat all of it at once.

Why beating Nvidia on the chip keeps failing

The standard challenger play is to ship a faster GPU and let benchmarks do the rest. AMD ran exactly that play with the MI300X, and the result is instructive. In real multi-node training - the workload that builds frontier models - the MI300X delivered under 30% of its theoretical compute, while Nvidia's parts reached roughly 40%, and the older H100 still outran it by 10 to 25%.6 The gap is not silicon. AMD's chip has the raw transistors. The gap is the surrounding software and interconnect stack that turns transistors into usable throughput at scale - the exact compound moat that CUDA-alone framing misses. Where AMD genuinely wins is narrower: memory-bound single-node inference, where the MI300X's 192GB of memory let it land within 2-3% of the H100 on a standard inference benchmark.6 That is a real beachhead. It is not the war.

Multi-node trainingMemory-bound inference
What it buildsFrontier models from scratchServing an existing model
MI300X vs NvidiaUnder 30% of theoretical FLOPS; trails H100 by 10-25%Within 2-3% of H100
What decides itInterconnect + software stackRaw memory capacity
Nvidia's gripStrongestMost contestable
Where the challenger actually competes

Notice the pattern. Nvidia is least beatable exactly where the margins are richest and the workload is hardest, and most contestable where the work is becoming a commodity. That is the seam the whole market is about to pry open.

The threat isn't a rival chip - it's the market splitting in two

Here is the thesis a smart friend should be able to repeat at dinner: Nvidia's dominance is structurally durable but not unbreakable, because the AI market is quietly bifurcating. Training - building the models - stays on Nvidia, where the compound moat is real and the margins are fat. Inference - running those models a billion times a day - is sliding toward purpose-built silicon, where the work is repetitive enough to bake into a custom chip. ASIC-based AI server shipments are projected to reach about 27.8% of the market in 2026, growing roughly 44.6% year-over-year against just 16.1% for merchant GPUs.7 Nvidia isn't being displaced. It risks being stranded in the high-margin, slower-growing half while the volume half walks off to someone else's silicon.

27.8%
of AI server shipments projected to be custom ASICs in 2026, growing ~44.6% a year - nearly triple the GPU growth rate. The volume is migrating, not the prestige7

And here's the twist most coverage gets wrong: the hyperscalers building these chips aren't really building them. Google's TPU, Meta's MTIA, Microsoft's Maia, and accelerator programs from OpenAI and Anthropic are all tied to Broadcom as a design partner - one partner sitting behind a wall of branded silicon.7 They all fab at TSMC, the same foundry that makes Nvidia's parts. So the war for AI compute is, underneath the logos, partly a fight over who captures the design margin on inference - and Nvidia is not guaranteed that seat.

The competitor Nvidia is forbidden to beat

The fastest crack in the moat wasn't made by an engineer. It was made by a regulator. On April 9, 2025, the U.S. government required a license to export even Nvidia's deliberately throttled China chip, the H20 - a Hopper-based part built down to the export thresholds - and confirmed days later that the requirement was indefinite.48 The cost showed up immediately in Nvidia's own filings: a $4.5 billion charge for stranded inventory and purchase obligations, plus $2.5 billion of H20 revenue it simply could not ship that quarter.2 Nvidia had even told Washington it was building the H20 back in October 2023; it still got cut off.4 The chip was legal, the customers were waiting, and the door closed anyway.

Nvidia incurred a $4.5 billion charge from H20 excess inventory and purchase obligations after the U.S. government required a license for H20 exports to China.2
Nvidia CorporationFrom its Q1 fiscal-2026 results (Form 8-K)

This is the part of the moat Nvidia cannot defend, because the threat isn't a better product - it's a captive market it is no longer allowed to serve. Every quarter the H20 stays grounded, Chinese buyers have no Nvidia option, and a domestic champion gets a guaranteed customer base to mature against. A moat protects you from rivals you can compete with. It does nothing against a government drawing a line on the map.

Isn't a 73% margin proof the moat is fine?

The strongest objection is that all of this is theory and the cash is fact. Nvidia just printed a record $51.2 billion data-center quarter at 73.4% gross margin;3 AMD's challenger trails on the hard workloads; Intel's Gaudi accelerator largely failed to tap into AI demand - missing even a $500M revenue target and never becoming a meaningful third player;9 and the OpenAI-AMD deal - a warrant for up to 160 million AMD shares tied to gigawatts of deployment - reads as much like a hedge as a defection.5 All true, and it's why the answer to 'can anyone break Nvidia?' is not 'yes, soon.' But margin is a lagging indicator. A bifurcation doesn't announce itself in the quarter it begins; it announces itself when the fastest-growing slice of demand - inference at scale - has already moved to silicon someone else designed, and the export-walled markets have already raised a rival you're banned from fighting. The numbers that look like invincibility today are exactly the numbers a stranded incumbent posts on the way to a smaller, richer corner of its own market.

A moat protects the product. It can't protect the market.

Nvidia's compound moat - software, interconnect, packaging, memory supply - is doing its job: no rival is out-engineering it on training, the workload that matters most. But two threats route around the moat entirely. The first is commoditization: when a workload becomes repetitive enough (inference), buyers stop paying for general-purpose excellence and bake the job into cheaper custom silicon, and no amount of CUDA lock-in stops that drift. The second is regulation: a moat assumes you're allowed to compete, and export controls quietly delete that assumption for an entire geography, handing a captive market to whoever's left. The lesson for any dominant firm: audit your moat against the two attacks it was never built to stop - your best customers no longer needing your best product, and someone with a pen deciding you can't sell to them at all.

So, can anyone break Nvidia's AI-chip dominance? Not by building a faster chip - the compound moat will keep eating those for years. The break, if it comes, looks nothing like a duel. It looks like the most lucrative, hardest work staying right where it is, while the high-volume work quietly leaves for custom silicon, and an entire country builds its own champion behind a wall Nvidia is forbidden to climb. The genius of Nvidia's position was standing in the one place every AI model has to pass through. The danger is that the world is busy building a second road - and drawing borders around the first.

Take it with you — The Moat Anatomy
Canvas

Moat Anatomy Canvas

A one-page canvas that dissects a moat instead of asserting it: where the advantage comes from, how much of the market it covers, how long it would take to copy, and what keeps it from eroding. Blank to dissect your own claimed edge; filled as the worked example tracing the structure of the story's defensible advantage. Use it to tell a real moat from a head start.

Blank template

Included with any subscription, or unlock this tool for $1.99. Get it → · See plans →

Sources

Where this comes from — the filings, records, and reporting behind it.

  1. 1
    Primary · SEC filingDocumented
    Nvidia's full-year fiscal 2025 data center revenue was a record $115.2 billion, up 142% year-over-year; Q4 FY2025 data center revenue was $35.6 billion, up 93% year-over-year.
  2. 2
    Primary · SEC filingDocumented
    Nvidia incurred a $4.5 billion charge in Q1 FY2026 from H20 excess inventory and purchase obligations after the U.S. government on April 9, 2025 required a license for H20 exports to China. Nvidia was additionally unable to ship $2.5 billion of H20 revenue that quarter.
  3. 3
    Primary · SEC filingDocumented
    Nvidia's Q3 FY2026 (ended October 26, 2025) reported record revenue of $57.0 billion and record data center revenue of $51.2 billion, up 66% year-over-year. Gross margins were 73.4% (GAAP) and 73.6% (non-GAAP).
  4. 4
    Primary · SEC filingDocumented
    The H20 export license requirement was confirmed by the U.S. government as applying indefinitely as of April 14, 2025. Nvidia disclosed this was not a normal recurring cost in an SEC correspondence filing, noting it had notified the USG of H20 manufacturing plans in October 2023.
  5. 5
    Primary · SEC filingDocumented
    AMD announced a partnership with OpenAI including a warrant for up to 160 million AMD shares tied to GPU deployment milestones up to 6 gigawatts; AMD's Q2 2025 8-K disclosed MI350 Series GPU ramp as the key H2 2025 growth driver, with ROCm 7 as the new open-source AI software stack.
  6. 6
    PublishedAttributed to source
    SemiAnalysis (December 2024) found MI300X achieves under 30% of theoretical FLOPS in real multi-node training versus Nvidia's ~40%, with H100 outperforming MI300X by 10-25% in multi-node training. MI300X's primary real-world advantage is memory-bound inference, where its 192GB HBM3 scored within 2-3% of H100 on MLPerf Inference Llama 2 70B.
  7. 7
    PublishedWidely reported
    ASIC-based AI server shipments are projected to reach 27.8% of the market in 2026, with custom ASIC shipments growing 44.6% year-over-year — nearly triple the 16.1% growth rate projected for merchant GPUs. Broadcom is the design partner behind Google's TPU, Meta's MTIA, Microsoft's Maia, and OpenAI/Anthropic's Titan accelerator programs.
  8. 8
    Primary · ArchivalDocumented
    U.S. Congressional Research Service confirmed BIS required Nvidia to apply for a license to sell its H20 GPU in China, and that BIS conditionally approved both the Nvidia H20 and AMD MI308 for export on a case-by-case basis. The H20 is built on Hopper architecture (2022) with reduced performance density to comply with BIS thresholds.
  9. 9
    PublishedWidely reported
    Intel largely failed to tap into soaring demand for AI accelerators; its Gaudi family missed a $500M revenue target and never became a meaningful third player in the market