TSMC A13 Skips High-NA EUV: 2nm-Class AI Chips Without $200M Upgrade

Abhishek GautamApril 24, 20265 min read

TSMC A13 Skips High-NA EUV: 2nm-Class AI Chips Without $200M Upgrade

Quick summary

TSMC announced A13 chip for 2029 and N2U node that extract 2nm-class density from existing EUV machines, skipping ASML High-NA systems. Multi-chip stacking scales to 10+20 by 2028.

What High-NA EUV Is and Why It Costs $200 Million

Standard EUV (Extreme Ultraviolet) lithography uses light at 13.5 nanometers wavelength to expose circuit patterns onto silicon wafers. ASML's current NXE:3600D machines — the ones in every advanced fab today — use a numerical aperture (NA) of 0.33. The light is focused tightly, but at 0.33 NA, the minimum feature sizes you can reliably print are approaching a physics limit.

High-NA EUV uses a 0.55 NA lens system that focuses the light more tightly, enabling smaller features. ASML's first High-NA machine, the NXE:5000, costs approximately $200 million per unit versus $170 million for the current generation. Each machine requires specialised infrastructure, takes 18-24 months to install and qualify, and needs retraining of fab personnel. At the scale of a leading-edge fab with 50+ tools, the capex difference between High-NA and current EUV represents billions of dollars.

Intel has committed to High-NA EUV for its 14A (14 angstrom) node — a significant part of why Intel's manufacturing costs are structurally higher than TSMC's. Samsung is evaluating High-NA for SF2 and beyond.

TSMC's A13 bet is that algorithmic improvements — better computational lithography, improved resist chemistry, multi-patterning optimisations — can extract High-NA-equivalent density from existing 0.33 NA tools. This is not a new claim in the abstract; TSMC has done this before (N3E extracted more performance from the N3 equipment base). The question is whether it works at the A13 node, where the physics are considerably tighter.

The A13 Node: What It Means for AI Chip Buyers

A13 targeting 2029 production means it is the process node for AI accelerators that will enter volume production in the 2029-2030 window. The chips designed on A13 are the ones Nvidia's GPU successor to Blackwell's successor will be fabbed on. Apple's A19 or A20 chip will be on A13. Google's TPU generation after TPU 8 will be on A13.

For AI buyers, the key implication is density: more transistors per die means either more compute in the same power envelope, lower power for the same compute, or a physically smaller die that costs less per wafer. All three outcomes are positive for AI inference cost curves in the 2029-2030 timeframe.

If TSMC succeeds with A13 on existing EUV equipment, it maintains its cost advantage over Intel 14A (which requires High-NA premium capex) and over Samsung (which is still resolving yield issues at N3). TSMC's volume pricing advantage compounds at every generation where it avoids a disruptive equipment transition.

N2U: The Affordable 2nm Variant

N2U is positioned between N2 (currently in production for Apple A18 and Nvidia GPU successors) and the full A13 node. Think of it as N2 with process optimisations that reduce per-wafer cost while maintaining most of N2's density and performance.

The commercial target: AI inference chips that need good density and efficiency but do not require the absolute leading edge. Cloud providers building custom inference chips (Google TPU 8i is already on N2-class; AWS Trainium, Microsoft Maia successors) are the primary N2U market. These chips run in volume serving environments where cost-per-inference matters more than peak performance.

N2U also enables more price-accessible consumer devices. The Apple A18 is on N3E; an A18e or similar cost-optimised variant for iPhone SE-class devices would be an N2U candidate. Qualcomm's mid-range Snapdragon X series, used in sub-$1,000 Windows laptops with on-device AI, is another target.

Multi-Chip Stacking: The Architecture Shift

The more forward-looking announcement in TSMC's April 24 briefing was its multi-chip stacking roadmap:

Current (TSMC CoWoS for Nvidia Vera Rubin): 2 large compute chips + 8 HBM memory stacks per package

By 2028: 10 large compute chips + 20 HBM memory stacks per package

This is not an incremental improvement — it is a 5x increase in compute chiplets and 2.5x increase in memory stacks within a single package. The 2028 configuration would put what today requires multiple boards worth of GPUs into a single package with dramatically lower inter-chip communication latency and power overhead.

For AI training: a single 2028-generation package could contain more compute than today's 8-GPU server node. Nvidia's next-generation architecture beyond Blackwell is almost certainly designed around this stacking capability.

For AI inference: the 20 HBM stacks address the memory capacity constraint that limits long-context inference. The TPU 8i's 288 GB HBM per chip (announced last week) becomes the floor, not the ceiling, if 20 HBM stacks per package become standard by 2028.

What Developers and Architects Should Track

For cloud cost planning: TSMC maintaining cost efficiency through A13 without High-NA transition means the per-FLOPS cost curve for cloud AI compute keeps dropping through 2029-2030. Budget AI infrastructure costs on a declining trajectory, not a flat one — the 2026-2028 constraint is HBM supply (covered in SK Hynix earnings analysis), not logic silicon.

For on-device AI: N2U's cost optimisation enables sub-$150 AI chips at 2nm-class density by 2028. Qualcomm, MediaTek, and Apple Silicon moves toward affordable on-device inference accelerators become more viable. The workloads currently running on cloud inference APIs will shift to device in the 2028-2030 window.

For Nvidia watch: Every TSMC multi-chip stacking advance is a Nvidia roadmap prerequisite. Vera Rubin (current generation) is already a 2+8 multi-chip design. The 10+20 design for 2028 maps directly to Nvidia's post-Vera Rubin architecture, tentatively called Feynman.

Key Takeaways

TSMC A13 targets 2029 production: achieves 2nm-class transistor density on existing 0.33 NA EUV machines, skipping ASML's High-NA EUV ($200M per tool); maintains cost advantage vs. Intel 14A and Samsung SF2
N2U node for affordable 2nm: optimised for cost-sensitive AI inference chips and consumer devices; bridges N2 and A13 for cloud custom silicon (AWS Trainium, Microsoft Maia successors) and mid-range mobile
Multi-chip stacking roadmap: current 2+8 configuration (2 compute + 8 HBM) scales to 10+20 by 2028; 5x compute chiplet increase in single package; addresses both training throughput and inference memory capacity
HBM constraint not solved here: A13 and N2U improve logic silicon density; the memory bottleneck (SK Hynix's 20%+ HBM shortage through 2030) is a separate supply chain problem that A13 does not address
2029 implications: Nvidia post-Blackwell successors, Apple A19/A20, and Google TPU 9th generation will be A13-fabbed; sets the compute density floor for the next AI generation

For the HBM memory constraint context, read SK Hynix $27B Profit: HBM Shortage Lasts Until 2030, AI Memory at Risk. For the competing US chip independence angle, read Elon Musk's TeraFab Uses Intel 14A: SpaceX Leads High-Volume AI Chip Production. For AI chip market context, read Google TPU 8t and 8i at Cloud Next 2026: The Inference War Starts Now.

FAQ

Frequently Asked Questions

What is TSMC's A13 chip and when will it be produced?

TSMC's A13 is its next-generation process node after N2 (currently in production), announced April 24, 2026, targeting production in 2029. A13 achieves 2nm-class transistor density without requiring ASML's new High-NA EUV machines (which cost approximately $200 million each), instead extracting further gains from existing 0.33 NA EUV equipment through algorithmic improvements and advanced patterning techniques. A13 chips will be used for Nvidia's post-Blackwell GPU generations, Apple A19/A20 processors, and next-generation AI accelerators from Google, AWS, and Microsoft.

Why is TSMC skipping High-NA EUV for the A13 node?

TSMC is betting it can achieve 2nm-class density improvements through better computational lithography, improved photoresist chemistry, and multi-patterning optimisations on its existing 0.33 NA EUV tools rather than paying the $200 million per unit premium for ASML's High-NA EUV systems. This decision preserves TSMC's cost advantage over Intel (which has committed to High-NA for Intel 14A) and Samsung (evaluating High-NA for SF2). If TSMC succeeds, it avoids billions of dollars in equipment capex per fab, maintaining lower per-wafer costs and higher margins on foundry services.

What is TSMC's multi-chip stacking roadmap and why does it matter for AI?

TSMC announced it will scale from today's 2 compute chips + 8 HBM stacks per package (used in Nvidia Vera Rubin) to 10 compute chips + 20 HBM stacks per single package by 2028. This 5x increase in compute chiplets within one package reduces inter-chip communication latency and power overhead vs. spreading compute across multiple boards. For AI training, a single 2028 package would contain compute equivalent to today's multi-GPU server nodes. For inference, 20 HBM stacks per package addresses the long-context memory capacity bottleneck — the limiting factor for large language model serving today.

What is the TSMC N2U node and who is it for?

N2U (N2 Ultra or N2 Upscaled) is a cost-optimised variant of TSMC's current N2 node that reduces per-wafer cost while maintaining most of N2's density and efficiency. It is positioned between N2 (Apple A18, Nvidia successors) and the future A13 node. Primary targets are cost-sensitive AI inference chips for cloud providers (AWS Trainium, Microsoft Maia, Google TPU inference variants) where cost-per-inference matters more than peak performance, and mid-range consumer devices like Qualcomm's Snapdragon X mid-tier and Apple's iPhone SE-class chips that need on-device AI acceleration without premium pricing.

Free Weekly Briefing

The AI & Dev Briefing

One honest email a week — what actually matters in AI and software engineering. No noise, no sponsored content. Read by developers across 30+ countries.

No spam. Unsubscribe anytime.