The Model Is No Longer Important - Behind the Doubao 2.1 Release, ByteDance Is Fighting an Entry War

AI Doubao ByteDance Seed 2.1 Agent Entry War

发布于 2026-07-02 10:54:25 5 次浏览

The Model Is No Longer Important - Behind the Doubao 2.1 Release, ByteDance Is Fighting an Entry War

180 trillion.

That is the daily token call volume of the Doubao large model in June. A year ago, this number was 17 trillion, and at its initial launch two years ago, it was only 120 billion. 1,500 times in two years.

Then at ByteDance's FORCE conference on June 23, they released Seed 2.1. The hottest discussion at the event wasn't "what are the model benchmarks", but "TRAE is connected, Coze is connected, Doubao office mode is connected".

I paused.

What Is the Inflection Point

Tan Dai, President of Volcano Engine, repeatedly mentioned a term at the conference: inflection point.

The gist — model capabilities only truly enter production workflows after crossing a certain threshold. Below this point, you're playing with demos; above it, you deliver working drafts.

He said the first global model to cross the inflection point in Coding and Agent domains was Claude Opus 4.6.

(Well, he didn't mention 4.7. To be honest, 4.7's SWE-bench Pro jumped from 53.4% to 64.3%, which is indeed a leap, but Terminal-Bench 2.0 only reached 69.4%, still below GPT-5.4's 75.1%.)

Then he said Seed 2.1 Pro also crossed it.

Evidence? A few benchmarks are indeed convincing: SciCode surpasses Opus 4.7 and GPT-5.5, MCP-Atlas tool calling tests also surpass, Terminal Bench 2.1 is roughly on par. Plus the chip RTL case — 18 hours, 9 iterations, 1303 lines of code, passed the entire simulation, test, and synthesis inspection pipeline.

I admit these numbers are persuasive. But I care more about another signal.

1/4 Price, Not Buying a Model

Seed 2.1 Pro pricing: 6 RMB per million input tokens, 30 RMB per million output tokens, 1.2 RMB for cache hits.

The total usage cost of the Claude Opus 4.6 to 4.8 series is roughly 4 times that.

What does a 4x price difference mean?

It means that for an enterprise scenario consuming 1 billion tokens daily, the monthly cost with Opus could be in the millions of RMB, while with Seed 2.1 Pro it can be compressed to 200,000-300,000 RMB. The savings are not trivial — enough to hire a 3-5 person AI application team.

But price has never been a moat. DeepSeek V4 Pro has already permanently reduced API prices to 1/4 of the original, and MiniMax M3 followed with a 5-10% cut. By the first half of 2026, the large model market will have little left to bleed from price wars.

What really makes me think ByteDance's move is deep is that the same model base simultaneously connects to four product lines.

Four Entrances, One Engine

Let's map out Seed 2.1 Pro's "distribution network":

Entrance	Target Audience	What It Sells
Volcano Engine Ark API	Developers/Enterprises	Model calls
TRAE / TRAE WORK	AI programming users	Coding Agent
Doubao "Office Task" Mode	General office workers	PPT/Data analysis/PRD
Coze	Agent builders	Low-code orchestration

The same underlying model, four forms.

This reminds me of an old industry — telecom operators.

In the 3G era, China Mobile, China Unicom, and China Telecom competed on frequency bands and base station counts. In the 4G era, they competed on user numbers and plan prices. In the 5G era? They compete on how many operator apps you have on your phone — payments, video, cloud storage, IoT — each entrance collecting toll fees.

ByteDance is doing almost exactly the same.

The model is the base station, the API is the frequency band, and TRAE/Doubao/Coze are the operator's entrance apps. It doesn't matter which entrance you use — the underlying token consumption settles at ByteDance.

180 trillion daily tokens, 49.5% public cloud MaaS market share — this is not a coincidence, it's a pipeline effect.

Three Things Others Don't Have

Why can ByteDance fight the entry war while others cannot? I break down three key points.

First, C-side scale. Doubao's DAU exceeds 100 million, monthly active 157 million surpassing DeepSeek. This user base generates massive real interaction data, which feeds back into model training, creating a flywheel. No other model manufacturer has a C-side entry of this magnitude.

Second, developer toolchain. After TRAE upgraded from a pure AI coding IDE to an Agent workspace, it added Agent Mode for non-developers — product managers, operations, data analysts can all use it. This opens another door beyond the developer community. Claude Code is powerful, but Anthropic doesn't have a localized IDE like TRAE.

Third, Agent building platform. Coze is one of the largest Agent low-code platforms in China. Coze Space's retention rate jumps directly by 10 percentage points after a model upgrade — when the model gets stronger, the platform benefits immediately. This is a channel for "instant monetization of model improvements".

Three channels, three flywheels. Stronger model → more entrance users → users generate more data → data feeds back to the model.

Cold Water Time

But I have to mention a few uncomfortable facts.

Seed 2.1 Pro claims to have "crossed the inflection point", but the "inflection point" itself is a marketing concept. There is no universally recognized threshold standard. Tan Dai says 4.6 crossed it, and 2.1 crossed it too — believe it or not, it's up to you.

In the actual test, those beautiful cases — 3D houses, PPT, data analysis — I've tried similar tasks. Models can produce first drafts, but detail consistency is not reliable. Data charts don't match numbers, industry data in PPTs requires manual verification, and code is still engineering distance from production systems.

The Quantum Bit article itself honestly said: "The most suitable position is the first productivity assistant — first finish 70% of the rough work, then let humans make the final 30% decisions."

The 70/30 ratio might still be optimistic.

Also, the Turbo version claims "performance comparable to Pro, price halved", but the official announcement lacks detailed benchmark comparisons. If you're considering switching to Turbo, it's recommended to first run your own business benchmarks before deciding.

So, Who Wins the Entry War?

I don't know.

But I know one thing: when ByteDance, Alibaba, and Tencent all start competing for "AI entrances" instead of "AI models", the model itself is no longer the endgame.

Alibaba has the penetration path through e-commerce and DingTalk, Tencent has the embedding capability through WeChat and its social matrix, ByteDance has the distribution network of Doubao + TRAE + Coze. All three are stuffing models into high-frequency scenarios, making users open, call, and generate new feedback every day.

Model capabilities will be caught up, prices will flatten. What is truly hard to replicate is the entrance that users open every day.

So back to the title — the model is no longer important.

Not because it's not important, but because it's becoming like water, electricity, and gas. Nobody cares which water plant the tap water comes from, only whether water comes out when you turn the faucet.

Seed 2.1 Pro is just another faucet built by ByteDance. Is the water itself good? Not bad. Are the faucets numerous enough? Possibly the most in China.

As for whether you'll drink this water —

That depends on whether your business is willing to let ByteDance's pipeline pass by your doorstep.

The Model Is No Longer Important - Behind the Doubao 2.1 Release, ByteDance Is Fighting an Entry War

What Is the Inflection Point

1/4 Price, Not Buying a Model

Four Entrances, One Engine

Three Things Others Don't Have

Cold Water Time

So, Who Wins the Entry War?

评论