"The Era of Scaling is Over": Ilya Sutskever's Interview Explained Simply

Abhishek Gautam·February 23, 2026·7 min read

Quick summary

Ilya Sutskever said "the era of scaling is over" and it went viral. Here is what he actually meant, why it matters, what comes next in AI development, and whether he is right.

The Claim That Shook the AI Industry

In late 2025, Ilya Sutskever — co-founder of OpenAI, inventor of techniques behind ChatGPT, and founder of Safe Superintelligence Inc. — gave a long interview with podcaster Dwarkesh Patel. It went viral not because of drama, but because of one central claim:

"The era of scaling is over. We have returned to the era of research."

In a field where hundreds of billions of dollars are being invested based on the assumption that bigger = better, this was a remarkable thing to say. Here is what he actually meant.

First: What is the Scaling Hypothesis?

To understand Sutskever's claim, you need to understand what he is rejecting.

The scaling hypothesis is the idea that made modern AI possible. It states: if you make a neural network bigger (more parameters), train it on more data, and use more computing power, it will get better. Reliably. Predictably.

This sounds simple. The implications are enormous.

If scaling works, then AI progress is primarily an engineering and capital problem, not a science problem. You do not need new ideas. You need bigger clusters, more GPUs, more data, more money. The company with the most resources wins.

This hypothesis has been astonishingly accurate for roughly a decade:

GPT-2 was impressive. GPT-3 was much better. The main difference was scale — 100x more parameters and training data.
GPT-4 was dramatically better than GPT-3. Again, primarily more scale.
Every major lab — OpenAI, Google DeepMind, Anthropic, Meta — has operated on this assumption. It is the foundation of the current AI investment cycle.

The scaling hypothesis is why Nvidia became a $3 trillion company. It is why data centres are being built at a rate not seen since the railroad era. It is the thesis underlying most major AI investments.

What Sutskever Is Saying Has Changed

Sutskever's claim is that the scaling hypothesis has run its course — not that it was wrong, but that it has been exhausted.

The low-hanging fruit of scaling has been picked. The gains from simply adding more parameters and data are diminishing. Making a model 10x bigger no longer produces 10x better performance. The curve is flattening.

He puts it this way: current AI models have consumed virtually all the high-quality human-generated text on the internet. There is a hard limit to internet-scale training data. You cannot simply scale the data dimension further.

And while compute continues to improve (chips get faster, data centres get bigger), the returns on raw compute are also diminishing compared to the extraordinary gains seen between 2015 and 2023.

The era of predictable, capital-driven progress through scaling is over. What comes next requires new ideas.

What are Those New Ideas?

Sutskever does not claim to have all the answers — which is partly why SSI exists. But he outlined several directions he believes are necessary:

Better Generalisation

Current models are surprisingly brittle outside their training distribution. A model trained on billions of examples can fail on variations it has not seen. Humans generalise from a small number of examples to many new situations. AI systems need learning mechanisms that generalise better, not just ones trained on more examples of the same thing.

Continual Learning

Current AI models are trained once, then frozen. They do not learn from new experiences after deployment. Humans learn continuously — every interaction updates their understanding. Building AI systems that can learn from ongoing experience without forgetting what they already know (a problem called catastrophic forgetting) is an open research problem.

Reasoning That Scales with Compute at Inference

Scaling during training is reaching limits. But Sutskever and others believe there is significant room to improve AI by giving models more compute at inference time — letting them "think longer" about hard problems rather than generating a response immediately. DeepSeek R1 and OpenAI's o1 are early examples of this approach. This is a different kind of scaling — not bigger models, but smarter use of compute at the moment of answering.

Something Like Intuition

This is the most philosophical part of Sutskever's argument. He suggests that human intelligence involves something like intuition — a form of pattern recognition that operates below conscious reasoning and enables fast, accurate judgement in complex situations. Current AI models lack this. They can reason explicitly when prompted, but do not have an equivalent to the fast, automatic recognition humans use for most decisions. Building this into AI systems, he argues, is necessary for genuine intelligence rather than sophisticated pattern matching.

Is He Right?

Sutskever's claim is contested — and the stakes are high enough that it is worth understanding both sides.

The case that he is right:

The largest models in 2025 are not dramatically better than the largest models in 2023 on many benchmarks, despite significantly more compute. The performance curves have visibly flattened for certain capabilities. Data scarcity is real — high-quality text data is not growing as fast as compute budgets. Several major labs have reportedly seen disappointing returns on their largest recent training runs.

The case that he is wrong (or too early):

Scaling skeptics have been wrong before. Previous claims that AI had hit a wall were followed by breakthroughs from scaling. Synthetic data — AI-generated training data — may extend the data scaling curve. Multimodal models (image, audio, video) open new data dimensions. And the inference-time compute scaling approach (longer reasoning chains) may represent a new scaling frontier that has barely been explored.

The honest answer is: we do not know yet. Sutskever is making a prediction about where diminishing returns kick in, and that prediction may be right, premature, or wrong in ways we cannot see from the current vantage point.

What is not contested is that the easy part of scaling is over. The question is whether there is a harder version of scaling still to come, or whether genuinely new ideas are required.

Why This Matters Beyond the Research Debate

Sutskever's interview matters for reasons beyond the technical argument.

For AI companies: If scaling is over, the competitive advantages built on capital (who can spend the most on compute) weaken. A smaller, more creative lab can compete with a better idea. This is why SSI — small team, elite researchers, no commercial pressure — is a credible bet.

For investors: The thesis of "whoever spends the most wins" weakens if scaling has diminishing returns. Capital efficiency matters more. Research quality matters more. The investment calculus for AI chips, data centres, and large foundation model companies changes if Sutskever is right.

For developers: If the current generation of models is approaching its ceiling, the practical improvements in AI tools you use day-to-day will slow. The extraordinary pace of capability improvement between 2022 and 2024 may not continue at the same rate.

For everyone: The timeline to superintelligence — AI dramatically more capable than humans — extends if the scaling path is exhausted and new approaches require years of research. Or it could compress, if the new ideas Sutskever and others are working on unlock capabilities the scaling approach could not reach.

The Bottom Line

"The era of scaling is over" is a precise technical claim with large implications.

It means: the approach that drove a decade of AI breakthroughs — bigger, more data, more compute, reliably better results — has reached diminishing returns. Progress will continue, but the pace of improvement per dollar of compute has slowed, and continued advancement requires new ideas rather than more of the same.

It does not mean: AI is hitting a wall, progress will stop, or the current models are near their useful ceiling. Current models will continue improving. Inference-time scaling is a real frontier. New architectures will emerge.

Sutskever left OpenAI, raised $3 billion, and assembled one of the most talented research teams in the world to pursue those new ideas. Whether he is right about scaling being over, his bet is clear: the future of AI belongs to whoever figures out what comes next — not whoever builds the largest cluster.

For the full background on who Sutskever is and why he left OpenAI, read Who is Ilya Sutskever.

Free Tool

Will AI replace your job?

4 questions. Get a personalised developer risk score based on your stack, role, and what you actually build day to day.

Check Your AI Risk Score →

ShareX / Twitter LinkedIn

Written by

Abhishek Gautam

Full Stack Developer & Software Engineer based in Delhi, India. Building web applications and SaaS products with React, Next.js, Node.js, and TypeScript. 8+ projects deployed across 7+ countries.

LinkedIn GitHub Portfolio Leave a thought →

Free Weekly Briefing

The AI & Dev Briefing

One honest email a week — what actually matters in AI and software engineering. No noise, no sponsored content. Read by developers across 30+ countries.

No spam. Unsubscribe anytime.