substack.com

On Dwarkesh Patel's 2026 Podcast With Dario Amodei

2026-02-16 · 14:27 UTC ·Zvi Mowshowitz from Don't Worry About the Vase ·23 min read

Brief

Zvi Mowshowitz’s commentary on Dwarkesh Patel’s 2026 podcast with Dario Amodei presents a mixed picture of Anthropic’s worldview: extremely bullish on near-term model capability gains, somewhat cautious on capital allocation, and comparatively muted on alignment and existential risk. Amodei reportedly said the top-level scaling picture he has used since 2017 still holds, with seven core ingredients continuing to matter for raw capability, including compute, data, data quality/distribution, training length, and scalable objective functions. He maintained that coding progress is ahead of schedule and defended the idea that highly capable “geniuses in a data center” could emerge within a few years. At the same time, he acknowledged that verification remains a constraint and that economic diffusion is slower than raw model improvement. Zvi agrees diffusion is a real bottleneck, pushing back on Dwarkesh’s suggestion that “diffusion is cope,” and argues that enterprise adoption, procurement, and organizational change remain major friction points even if model capability is already strong.

The post is especially detailed on software engineering and AI business economics. Amodei said Anthropic has seen AI write most of its code internally and that current coding systems may yield 15%-20% speedups, versus around 5% six months before, with much larger gains expected once models can close more of the end-to-end loop. He also gave striking revenue figures for Anthropic: $100 million ARR in 2023, $1 billion in 2024, and roughly $9-$10 billion in 2025, plus several more billion in January 2026. Yet despite this growth, he argued that aggressively pre-buying all possible compute would be ruinous if demand disappoints; Zvi frames this as the core contradiction of frontier AI labs, where underbuying compute looks conservative ex ante but like underinvestment in hindsight. Amodei suggested profitability may arrive as early as 2026 or by 2028 not because Anthropic is optimizing for margins, but because overestimating demand is more dangerous than underestimating it when data-center commitments must be made in advance.

On policy and geopolitics, Amodei reiterated familiar positions: stronger export controls on advanced chips to China, greater transparency and security regulation, skepticism of naive “many AIs check each other” theories, and concern about offense-dominant AI scenarios and authoritarian misuse. He argued against both dumb state-level restrictions and a blanket 10-year moratorium on state AI laws absent a federal framework. Zvi finds some of this sensible but criticizes the interview for avoiding the hardest alignment questions and for downplaying catastrophic-risk framing relative to business and governance concerns. His meta-take is that the interview is most valuable not for brand-new claims, but for exposing a persistent tension: Amodei sounds highly confident that transformative systems are close, yet Anthropic still behaves like a company balancing ordinary commercial constraints, procurement realities, and survivable capital strategy.

Why it matters

Zvi Mowshowitz’s breakdown of Dwarkesh Patel’s February 2026 interview with Anthropic CEO Dario Amodei centers on AI capability timelines, coding automation, compute economics, and AI policy toward regulation and China.

Key details

Amodei largely reaffirmed his rapid-progress thesis: he said AI progress is tracking his long-running scaling model from 2017 within “plus or minus a year or two,” with coding advancing faster than expected, and put roughly 90% probability on achieving his “country of geniuses in a data center” within 10 years, possibly by 2028 or sooner.
On software engineering, Amodei said Anthropic has effectively seen AI write 90% of code internally and estimated current coding models deliver 15%-20% developer speedups, up from about 5% six months earlier; he also argued that going from 90% to 100% code generation matters disproportionately because it closes workflow bottlenecks.
The post highlights a tension between Anthropic’s aggressive rhetoric and more conservative capital deployment: Amodei described buying enough compute to fully match anticipated demand as potentially a “burn the boats” bet worth on the order of $5 trillion within two years, while still forecasting Anthropic compute growth above 3x annually.
Anthropic’s revenue growth, as cited by Amodei, was extraordinary: from $0 ARR to $100 million in 2023, $1 billion in 2024, and about $9-$10 billion in 2025, with “a few more billion” added in January 2026 alone; he suggested 10x annual growth may begin tapering during 2026.
On automation outside coding, Amodei pointed to OSWorld benchmark improvement from roughly 15% to 65%-70% over one year as evidence that “computer use” capabilities are climbing rapidly, arguing that many white-collar tasks can be automated as task-level competence improves even if full jobs are not yet end-to-end replaceable.

Cleaned source text

title: On Dwarkesh Patel's 2026 Podcast With Dario Amodei

author: Zvi Mowshowitz from Don't Worry About the Vase

content_type: newsletter

publication: substack.com

published: 2026-02-16T14:27:15+00:00

source_url: gmail://19c66d9a7de956d7

word_count: 5075

Some podcasts are self-recommending on the ‘yep, I’m going to be breaking this one down’ level.

͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏

Forwarded this email? Subscribe here for more

On Dwarkesh Patel's 2026 Podcast With Dario Amodei

Zvi Mowshowitz

Feb 16

READ IN APP

Some podcasts are self-recommending on the ‘yep, I’m going to be breaking this one down’ level. This was very clearly one of those. So here we go.

As usual for podcast posts, the baseline bullet points describe key points made, and then the nested statements are my commentary. Some points are dropped.

If I am quoting directly I use quote marks, otherwise assume paraphrases.

What are the main takeaways?

1. Dario mostly stands by his predictions of extremely rapid advances in AI capabilities, both in coding and in general, and in expecting the ‘geniuses in a data center’ to show up within a few years, possibly even this year.

2. Anthropic’s actions do not seem to fully reflect this optimism, but also when things are growing on a 10x per year exponential if you overextend you die, so being somewhat conservative with investment is necessary unless you are prepared to fully burn your boats.

3. Dario reiterated his stances on China, export controls, democracy, AI policy.

4. The interview downplayed catastrophic and existential risk, including relative to other risks, although it was mentioned and Dario remains concerned. There was essentially no talk about alignment at all. The dog did not bark in the nighttime.

5. Dwarkesh remains remarkably obsessed with continual learning.

Table of Contents

1. The Pace of Progress.

2. Continual Learning.

3. Does Not Compute.

4. Step Two.

5. The Quest For Sane Regulations.

6. Beating China.

The Pace of Progress

1. AI progress is going at roughly Dario’s expected pace plus or minus a year or two, except coding is going faster than expected. His top level model of scaling is the same as it was in 2017.

1. I don’t think this is a retcon, but he did previously update too aggressively on coding progress, or at least on coding diffusion.

2. Dario still believes the same seven things matter: Compute, data, data quality and distribution, length of training, an objective function that scales, and two things around normalization or conditioning.

1. I assume this is ‘matters for raw capability.’

3. Dwarkesh asks about Sutton’s perspective that we’ll get human-style learners. Dario says there’s an interesting puzzle there, but it probably doesn’t matter. LLMs are blank slates in ways humans aren’t. In-context learning will be in-between human short and long term learning. Dwarkesh asks then why all of this RL and building RL environments? Why not focus on learning on the fly?

1. Because the RL and giving it more data clearly works?

2. Whereas learning on the fly doesn’t work, even if it did what happens when the model resets every two months?

3. Dwarkesh has pushed on this many times and is doing so again.

4. Timeline time. Why does Dario think we are at ‘the end of the exponential’ rather than ten years away? Dario says his famous ‘country of genuines in a data center’ is 90% within 10 years without biting a bullet on faster. One concern is needing verification. Dwarkesh pushes that this means the models aren’t general, Dario says no we see plenty of generalization, but the world where we don’t get the geniuses is still a world where we can do all the verifiable things.

1. As always, notice the goalposts. Ten years from human-level AI is ‘long time.’

2. Dario is mostly right on generalization, in that you need verification to train in distribution but then things often work well (albeit less well) out of distribution.

3. The class of verifiable things is larger than one might think, if you include all necessary subcomponents of those tasks and then the combination of those subcomponents.

5. Dwarkesh challenges if you could automate an SWE without generalization outside verifiable domains, Dario says yes you can, you just can’t verify the whole company.

1. I’m 90% with Dario here.

6. What’s the metric of AI in SWE? Dario addresses his predictions of AI writing 90% of the lines of code in 3-6 months. He says it happened at Anthropic, and that ‘100% of today’s SWE tasks are done by the models,’ but that’s all not yet true overall, and says people were reading too much into the prediction.

1. The prediction was still clearly wrong.

2. A lot of that was Dario overestimating diffusion at this stage.

3. I do agree that the prediction was ‘less wrong,’ or more right, than those who predicted a lack of big things for AI coding, who thoughts things would not escalate quickly.

4. Dario could have reliably looked great if he’d made a less bold prediction. There’s rarely reputational alpha in going way beyond others. If everyone else says 5 years, and you think 3-6 months, you can say 2 years and then if it happens in 3-6 months you still look wicked smart. Whereas the super fast predictions don’t sound credible and can end up wrong. Predicting 3-6 months here only happens if you’re committed to a kind of epistemic honesty.

5. I agree with Dario that going from 90% of code to 100% of code written by AI is a big productivity unlock, Dario’s prediction on this has already been confirmed by events. This is standard Bottleneck Theory.

7. “Even when that happens, it doesn’t mean software engineers are out of a job. There are new higher-level things they can do, where they can manage. Then further down the spectrum, there’s 90% less demand for SWEs, which I think will happen but this is a spectrum.”

1. It would take quite a lot of improved productivity to reduce demand by 90%.

2. I’d go so far as to say that if we reduce SWE demand by 90%, then we have what one likes to call ‘much bigger problems.’

8. Anthropic went from zero ARR to $100 million in 2023, to $1 billion in 2024, to $9-$10 billion in 2025, and added a few more billion in January 2026. He guesses the 10x per year starts to level off some time in 2026, although he’s trying to speed it up further. Adoption is fast, but not infinitely fast.

1. Dario’s predictions on speed of automating coding were unique, in that all the revenue predictions for OpenAI and Anthropic have consistently come in too low, and I think the projections are intentional lowballs to ensure they beat the projections and because the normies would never believe the real number.

9. Dwarkesh pulls out the self-identified hot take that ‘diffusion is cope’ used to justify when models can’t do something. Hiring humans is much more of a hassle than onboarding an AI. Dario says you still have to do a lot of selling in several stages, the procurement processes are often shortcutted but still take time, and even geniuses in a datacenter will not be ‘infinitely’ compelling as a product.

1. I’ve basically never disagreed with a Dwarkesh take as much as I do here.

2. Yes, of course diffusion is a huge barrier.

3. The fact that if the humans knew to set things up, and how to set things up, that the cost of deployment and diffusion would be low? True, but completely irrelevant.

4. The main barrier to Claude Code is not that it’s hard to install, it’s that it’s hard to get people to take the plunge and install it, as Dario notes.

5. In practice, very obviously, even the best of us miss out on a lot of what LLMs can do for us, and most people barely scratch the surface at best.

6. A simple intuition pump: If diffusion is cope, what do you expect to happen if there was an ‘AI pause’ starting right now, and no new frontier models were ever created?

7. Dwarkesh sort of tries to backtrack on what he said as purely asserting that we’re not currently at AGI, but that’s an entirely different claim?

10. Dario says we’re not at AGI, and that if we did have a ‘country of geniuses in a datacenter’ then everyone would know this.

1. I think it’s possible that we might not know, in the sense that they might be sufficiently both capable and misaligned to disguise this fact, in which case we would be pretty much what we technically call ‘toast.’

2. I also think it is very possible in the future that an AI lab might get the geniuses and then disguise this fact from the rest of us, and not release the geniuses directly, for various reasons.

3. Barring those scenarios? Yes, we would know.

Continual Learning

It’s a Dwarkesh Patel AI podcast, so it’s time for continual learning in two senses.

11. Dwarkesh thinks Dario’s prediction for today, from three years ago, of “We should expect systems which, if you talk to them for the course of an hour, it’s hard to tell them apart from a generally well-educated human” was basically accurate. Dwarkesh however is spiritually unsatisfied because that system can’t automated large parts of white-collar work. Dario points out OSWorld scores are already at 65%-70% up from 15% a year ago, and computer use will improve.

1. I think it is very easy to tell, but I think the ‘spirit of the question’ is not so off, in the sense that on most topics I can have ‘at least as good’ a conversation with the LLM for an hour as with the well-educated human.

2. Can such a system automate large parts of white-collar work? Yes. Very obviously yes, if we think in terms of tasks rather than full jobs. If you gave us ten years (as an intuition pump) to adapt to existing systems, then I would predict a majority of current white-collar digital job tasks get automated.

3. The main current barrier to the next wave of practical task automation is that computer use is still not so good (as Dario says), but that will get fixed.

12. Dwarkesh asks about the job of video editor. He says they need six months of experience to understand the trade-offs and preferences and tastes necessary for the job and asks when AI systems will have that. Dario says the ‘country of geniuses in a datacenter’ can do that.

1. I bet that if you took Claude Opus 4.6 and Claude Code, and you gave it the same amount of human attention to improving its understanding of trade-offs, preferences and taste over six months that a new video editor would have, and a similar amount of time training video editing skills, that you could get this to the point where it could do most of the job tasks.

2. You’d have to be building up copious notes and understandings of the preferences and considerations, and you’d need for now some amount of continual human supervision and input, but yeah, sure, why not.

3. Except that by the time you were done you’d use Opus 5.1, but same idea.

13. Dwarkesh says he still has to have humans do various text-to-text tasks, and LLMs have proved unable to do them, for example on ‘identify what the best clips would be in this transcript’ they can only do a 7/10 job.

1. If you see the LLMs already doing a 7/10 job, the logical conclusion is that this will be 9/10 reasonably soon especially if you devote effort to it.