Anthropic is pulling its best model out of its own subscription
TL;DR [show]
Anthropic shipped Fable 5, its most powerful public model, then announced that on June 23 it leaves the Pro, Max, and Team plans and reverts to metered usage credits. The piece reads the access cliff as the symptom of a structural split: a pure-play lab earns only the token, so as frontier compute gets more expensive it has to push its best model up-market toward API and enterprise and away from flat-rate consumer subscriptions. The integrated platforms (Google, Microsoft, Apple) monetize you elsewhere and can give near-frontier away while grounding it in the data they already hold on you. An operator's read on why the company with the best model may lose the consumer, and why Gemini is the sharpest case.

I pay Anthropic for the Max plan, the top consumer tier, and for the next twelve days I have the best AI model in the world sitting inside it. On June 23 that stops. The model reverts to metered usage credits, billed by the token, and I have to decide whether to keep paying by the sip for the thing that was supposed to come with the subscription. I have already decided. I probably won't.
Watch the sequence, because the sequence is the story. On June 4 Anthropic published an essay warning that AI is approaching the point where it improves itself, that the technology is getting dangerous enough to be worth slowing down. Five days later it shipped Fable 5, the public version of Mythos, the model class it had previously called too dangerous to release at all. And folded into the launch was a quiet line about plans and capacity: enjoy the most powerful model we have ever sold, for two weeks, and then it goes behind a meter. Warn the world on Thursday. Ship the dangerous one on Tuesday. Bill your own subscribers for it the Tuesday after. There is a coherence to that week, but it is not the coherence the press release was going for.
The trade coverage read Fable as a flex, Anthropic putting its most capable system in public hands. The flex is real. The footnote is the actual news. A company shipped the best model it has ever made and, in the same breath, told the people already paying it every month that they cannot keep that model at the price they are paying. That is not a launch. That is a repricing, and the direction it prices in is away from me.
Anthropic frames the cliff as temporary. Capacity is tight, the model is expensive to serve, and they intend to fold it back into the subscriptions as soon as they can run it for everyone without the whole thing falling over. I believe them. It probably is temporary. But two things survive the caveat, and they are the two that matter. The first is that when you are capacity-constrained you reveal your priorities by who you cut, and Anthropic chose to cut its flat-rate consumer subscribers and keep serving the people who pay by the token through the API and the enterprise contracts. That is a priority order, stated under pressure, which is when priority orders are honest. The second is that switching is sticky in one direction. Tell your best-paying users to go metered or go without for two weeks, and some of them go without, and some of those try the alternative, and some of those do not come back. I am going to be in that last group, and I am not a marginal user. I am the user who pays for the top tier and reads the launch essays.
There is a version of this I would write off as a bad fortnight for a good company. The reason I am not is that the cliff is not a stumble. It is the cost structure talking, and the cost structure is not going to change its mind.
A pure-play lab makes its money on the token. That is the whole business. When I run a prompt, the only thing Anthropic gets for it is what it can charge for that prompt, and Fable is priced at ten dollars per million input tokens and fifty per million out, double the model it replaces. Every call has to clear margin on its own, because there is no second business underneath it making the call worth running at a loss. And the cost of the underlying token is going up, not down, because the frontier now runs on compute that the labs cannot get enough of. I wrote two weeks ago about Anthropic renting an entire data center from a direct rival because it could not get the megawatts any other way, and a few days before that about energy being the real constraint on the whole enterprise. This is the same story arriving at my invoice. When the input to your best product gets scarcer and dearer, and the only way you earn on that product is the per-unit price, you do the rational thing. You move it toward the customers who will pay the per-unit price without flinching, which means developers and enterprises, and you move it away from the customer who expected it bundled into a flat monthly fee, which means me. Fable going usage-credit-only is not Anthropic failing to love its consumers. It is Anthropic doing arithmetic.
Now stand the integrated platforms next to that, because the contrast is the whole point. Google does not make its money on the token either. It makes its money on the ad, the data, the Cloud contract that the model pulls through, the Android and Workspace seats it keeps sticky. For Google, a frontier-class model handed to a consumer is not a unit sold at a loss. It is a customer kept, a session logged, a reason to stay inside the tent where the actual money is made. So Google can do the thing Anthropic structurally cannot, which is give near-frontier intelligence away at the point of use and recover the cost somewhere the user never sees. The pure-play sells the token. The platform sells you, and gives you the token to keep you in the store.
And it gets worse for the pure-play, because the platform's model is not only cheaper to the user, it is better to the user in the one way that is hardest to copy. Gemini sits on top of Gmail, Search, Maps, Calendar, Android, Photos, Drive, the entire record of what you do all day. That context makes its answers more useful per token than a model that knows nothing about you until you paste it in, and Anthropic has no path to that context. It does not run your inbox. It is not in your phone. It can build the smartest model in the world and it will still be answering a stranger's questions, while Gemini answers yours. Cheaper because it is subsidized, and more personal because Google already has the data. That is the double advantage, and it is not a product gap a better benchmark closes.
This is not only a Google story, which is why it is a structural one and not a stock tip. Microsoft is running the same play from the other side, building agents directly into Windows and bundling small models into the operating system that most of the world already runs, so the personal assistant arrives as a feature of the machine you bought rather than a subscription you shop for. Apple is positioned to do it on the phone. The pattern is the same in every case: the company that monetizes you somewhere else can afford to make the frontier feel free, and the company whose only revenue is the frontier cannot. Gemini is just the cleanest example because Google's data on the average person is the widest, and the value of a personal agent is mostly the personal part.
So here is where it lands, and it is close to the opposite of the headline. The consensus read on Fable is that Anthropic is winning, because Anthropic shipped the most powerful public model and powerful models are how you win. The operator read is that the more powerful and more expensive that model gets, the more Anthropic is forced to hand the consumer to whoever is willing to lose money on the token. Quality going up does not pull the consumer toward the best model. It pushes the best model toward the people who pay per call, and leaves the everyday user to the platforms that treat the model as bait. The frontier stops being a thing you buy. You either build on it through the API and pay by the token, or you use it free inside a platform that is monetizing your attention and your data, in which case you are not the customer. You are the payment.
I will keep spending money on AI every month. That part is not in question. What this fortnight clarified is who I will probably be spending it with, and it is not the company that makes the best model. It is the company that already has my email.
—TJ