TSMC 3nm Is Overloaded: Apple, Nvidia, Amazon Fighting for Wafers in 2026
Quick summary
TSMC 3nm node hit full capacity in March 2026 for the first time. Apple M5, Nvidia Blackwell, Amazon Trainium3, and Microsoft Maia2 all compete for the same fab slots.
Read next
- China's EUV Machine After the ASML Ban: How Close SMEE Actually GotChina's SMEE has a working DUV prototype and an active EUV program. If it succeeds, ASML export controls fail. Current status, timeline, and what it means for chip supply.
- Micron Sanand Comes Online: Inside India's First Commercial Chip Production and What It Means for AI HardwareMicron's Sanand ATMP facility in Gujarat has begun commercial production in early 2026, marking India's first modern semiconductor manufacturing output. This post unpacks what that actually means for AI ready chips, the India AI Mission, and global supply chains.
TSMC's 3nm node (N3E and N3P) hit full capacity utilization in March 2026 for the first time since the node entered mass production in late 2023. The constraint is not demand speculation — it's confirmed by multiple customers reporting allocation delays. Apple M5, Nvidia Blackwell GPUs, Amazon Trainium3, and Microsoft Maia 2 are all competing for the same fab slots.
What "3nm Full Capacity" Actually Means
TSMC's N3E process is the primary 3nm variant in volume production. N3P, with improved performance-per-watt, entered production in late 2024. When TSMC hits full utilization on a node, wafer starts are essentially fixed by reactor availability — you cannot add meaningful capacity in less than 18-24 months because adding a new cleanroom fab takes that long.
TSMC currently operates 3nm production at Fab 18 in Tainan across multiple phases. Each phase adds roughly 10,000 to 15,000 wafers per month capacity. Total 3nm capacity is estimated at 100,000-120,000 wafer starts per month across all phases as of Q1 2026. That capacity is now spoken for.
Who Is Taking Up All the Capacity
Apple is TSMC's largest customer by volume and the anchor tenant for 3nm. The M4 chip family — M4, M4 Pro, M4 Max, M4 Ultra — is manufactured on N3E. M5, expected for late 2026, is reportedly on N3P and already consuming allocation. Apple typically locks in 50-60% of a node's total wafer starts during its first two years of production.
Nvidia's Blackwell GPUs — specifically the B200 and the upcoming B300 — use TSMC CoWoS-L packaging with N4P compute dies and N3 memory controller dies. The memory controller die sits on the same node as Apple's M chips. As Nvidia scales Blackwell production to meet cloud provider demand from AWS, Azure, and Google, the 3nm allocation requirements increase.
Amazon Trainium3 and Microsoft Maia 2 are both custom AI accelerators designed on TSMC 3nm. Amazon committed to massive Trainium3 deployments for AWS after strong Trainium2 adoption by Amazon's own ML teams. Microsoft is manufacturing Maia 2 at TSMC after the first generation Maia was produced on TSMC 5nm.
The result is four major customers — Apple, Nvidia, Amazon, Microsoft — each with forecasted demand that exceeds prior quarters, converging on a node that cannot add capacity quickly.
CoWoS Is the Real Bottleneck
The less-discussed constraint is CoWoS (Chip-on-Wafer-on-Substrate) packaging, which TSMC provides alongside wafer production. AI accelerators like Blackwell, Trainium3, and Maia 2 require CoWoS-L to attach HBM memory stacks to compute dies. TSMC's CoWoS capacity has been constrained since 2023 and remains the binding constraint for AI chip production even when wafer capacity is available.
TSMC has been expanding CoWoS capacity aggressively — reportedly doubling it from 2024 to 2025, and planning another 50% increase in 2026. But each CoWoS tool takes 12-18 months to install and qualify. The expansion is happening; it's just slower than demand growth.
What This Means for Cloud GPU Availability
When TSMC can't produce more Blackwell dies, Nvidia can't ship more GPUs, and AWS, Azure, and Google Cloud can't add more B200 instances. The H100 production lines are on older nodes (N4P, 5nm family) and are not capacity-constrained, which is why H100 inventory is relatively stable. H200 production — also on an older node — is similarly unconstrained by 3nm.
The 3nm constraint primarily affects Blackwell GPU supply for the second half of 2026 and Trainium3 rollout at AWS. Developers who need Blackwell-class hardware for training runs in Q3-Q4 2026 are looking at limited availability and premium pricing.
TSMC 2nm Is Not a Near-Term Relief Valve
TSMC's 2nm node (N2) enters risk production in 2025 and volume production in late 2026 with Apple A20 as the first customer. N2 capacity in 2026 is small — perhaps 20,000-30,000 wafer starts per month initially, insufficient to absorb the overflow from N3. N2P, the improved variant, is a 2027 production target.
The practical timeline: 3nm constraints persist through at least mid-2027, when N2 reaches meaningful scale. Until then, AI accelerator supply — Blackwell and its successors — is structurally limited by TSMC's fab expansion pace.
Implications for the Hyperscaler Arms Race
Google, Amazon, and Microsoft are all designing custom silicon partly to reduce dependence on Nvidia and partly to get preferred TSMC allocation outside of Nvidia's purchasing power. That strategy requires them to be TSMC customers themselves, which means they're competing with Nvidia for the same constrained capacity. Custom silicon reduces Nvidia dependence but not TSMC dependence.
The only companies that escape this constraint are those designing chips on non-TSMC nodes — Samsung 3GAE (Gate-All-Around) or Intel 18A. Neither has proven production yields comparable to TSMC N3, which is why no major hyperscaler has committed volume AI production to those nodes.
Key Takeaways
- TSMC 3nm hit full capacity in March 2026 — first time N3E/N3P production is capacity-constrained since launch
- Four major customers competing for allocation: Apple M5 (N3P), Nvidia Blackwell (memory controller die), Amazon Trainium3, Microsoft Maia 2
- CoWoS packaging is the additional bottleneck — AI accelerators need CoWoS-L for HBM attachment, and TSMC CoWoS capacity is expanding slower than AI demand
- Blackwell GPU supply impact: limited B200 instance availability at AWS/Azure/GCP through H2 2026
- H100/H200 are not affected — those chips use older TSMC nodes (N4P, 5nm family) that are not at capacity
- TSMC 2nm won't help until 2027 — N2 volume production begins late 2026 with Apple A20, insufficient scale to offset N3 overflow
FAQ
Frequently Asked Questions
Why is TSMC 3nm at full capacity in 2026?
Apple M5, Nvidia Blackwell GPUs, Amazon Trainium3, and Microsoft Maia 2 are all manufactured on TSMC's N3E/N3P node and all scaled up simultaneously in early 2026. Combined demand from these four customers exceeds 3nm's production ceiling of approximately 100,000-120,000 wafer starts per month.
Does the TSMC 3nm shortage affect H100 or H200 GPUs?
No. H100 and H200 are manufactured on TSMC N4P (4nm family), not 3nm. Those nodes have available capacity. The 3nm constraint specifically affects Blackwell (B200, B300) GPUs and custom AI accelerators like Amazon Trainium3 and Microsoft Maia 2.
What is CoWoS and why is it also constrained?
CoWoS (Chip-on-Wafer-on-Substrate) is TSMC's advanced packaging technology that attaches HBM memory stacks directly to GPU compute dies. AI accelerators require it for high-bandwidth memory. TSMC's CoWoS capacity has been constrained since 2023 because packaging tools take 12-18 months to install and qualify, slower than the AI demand ramp.
When will TSMC 3nm capacity constraints ease?
Not before mid-2027. TSMC 2nm (N2) enters volume production in late 2026 with Apple as the first customer, but initial capacity is too small to absorb 3nm overflow. Meaningful capacity relief requires both N2 scale-up and new 3nm cleanroom phases, both with 18-24 month lead times.
How does the TSMC 3nm shortage affect cloud GPU availability for developers?
Directly. Blackwell GPU production is capped by 3nm and CoWoS capacity, which means AWS, Azure, and Google Cloud cannot scale B200 instances as fast as demand requires. Developers needing Blackwell-class compute for large training runs in Q3-Q4 2026 should join waitlists now — availability will be limited and pricing will reflect supply constraints.
Free Weekly Briefing
The AI & Dev Briefing
One honest email a week — what actually matters in AI and software engineering. No noise, no sponsored content. Read by developers across 30+ countries.
No spam. Unsubscribe anytime.
More on Semiconductors
All posts →China's EUV Machine After the ASML Ban: How Close SMEE Actually Got
China's SMEE has a working DUV prototype and an active EUV program. If it succeeds, ASML export controls fail. Current status, timeline, and what it means for chip supply.
Micron Sanand Comes Online: Inside India's First Commercial Chip Production and What It Means for AI Hardware
Micron's Sanand ATMP facility in Gujarat has begun commercial production in early 2026, marking India's first modern semiconductor manufacturing output. This post unpacks what that actually means for AI ready chips, the India AI Mission, and global supply chains.
The 2027 Tariff Cliff: How US China Chip Policy and Case by Case GPU Rules Should Shape Your AI Hardware Procurement
The United States now applies steep tariffs on Chinese electronics and is heading toward a June 2027 review cliff, while GPU exports are governed by case by case rules. This post explains what that mix means for AI hardware buyers and how to plan budgets in 2026.
Trump's 25% Chip Tariff on H200 and AMD MI325X: What Developers Pay in 2026
Trump's January 2026 Section 232 tariff put 25% on Nvidia H200 and AMD MI325X exports. US data centers are exempt. Here's what it costs developers outside the US.
Free Tool
Will AI replace your job?
4 questions. Get a personalised developer risk score based on your stack, role, and what you actually build day to day.
Check Your AI Risk Score →Written by
Software Engineer based in Delhi, India. Writes about AI models, semiconductor supply chains, and tech geopolitics — covering the intersection of infrastructure and global events. 882+ posts cited by ChatGPT, Perplexity, and Gemini. Read in 167 countries.
