OpenAI 内部报告：DeepSeek 刷屏后的一年|工作流|大模型|openai|chatbot|deepseek

分享至

这份报告，目前是 OpenAI 内部论坛 OpenAI Forum 的第一条

OpenAI Forum 截图

本文观点，均来自 OpenAI Global Affairs（国际事务部）

具体来说，是专门做情报分析的组
OpenAI's Intelligence & Investigations Team

同时...这个内容，也留了公开链接，可以自行访问
https://openaiglobalaffairs.substack.com/p/deepseek-at-1

Take Aways

OpenAI 的核心判断

• 美国在模型能力上继续领先，尤其是科学和复杂推理
• 中国在部署广度和价格竞争力上取得实质进展
• 算力是中国的真正瓶颈

OpenAI 点名的中国公司

三家公司，在报告中被点名：DeepSeek、智谱、阿里
其中，重点提到了智谱，以及其上周港股 IPO 的事件，OpenAI 认为这是一个标志：

北京，愿意把模型公司，变成国家基础设施

2026 年的三个关键问题

• 美国模型能否扩大实际有用性的领先优势
• 中国能否建设足够算力
• 中国能否大规模部署

一个值得注意的判断

OpenAI 在文末说：

以上判断随时可能变化，甚至可能出现像 DeepSeek R1 首次发布那样的地震级冲击如果发生的话，可能在硬件侧

相互印证
路透社于25年12月发布「中国如何建立自己的曼哈顿计划，来对抗西方 AI 芯片」，OpenAI 援引了这份报道

紧随其后，智谱联合华为发布了 GLM-Image，从数据预处理到大规模预训练，全程跑在昇腾 Atlas 800T A2

挺有意思的

下面，让我们先看原文
然后，我会结合里面的链接，基于原始信息进行拆解

原文 The DeepSeek moment at 1

/ DeepSeek 刷屏后的一年

A year after DeepSeek-R1 jolted the AI race, the US/China outlook is complex and challenging to forecast due to many variables at play. Here's what we know today, subject to change – or even another seismic shock like DeepSeek's first release brought. (Which, if it happens, could be on the hardware side.)

DeepSeek-R1 发布一年后，中美 AI 竞争格局变得复杂，难以预测——变量太多。以下是目前掌握的情况，随时可能变化，甚至可能再次出现像 DeepSeek 首发那样的地震级冲击（如果发生，可能会在硬件侧）

The US continues to lead on model capabilities, and US models also have maintained a meaningful lead in science and more complex reasoning. But China has made real progress in deploying models widely and cheaply.

美国在模型能力上仍保持领先，在科学和复杂推理任务上也维持着明显优势。但中国在模型的广泛部署和低成本运营上取得了实质性进展

On benchmarks, Chinese systems made up ground quickly in 2024, but despite headlines following DeepSeek-R1's release a year ago, China's progress in 2025 was uneven as US export controls limited access to the computing power needed to train and run frontier models.

从基准测试看，中国系统在 2024 年快速追赶。但尽管 DeepSeek-R1 发布后引发大量报道，2025 年中国的进展并不均衡——美国出口管制限制了训练和运行前沿模型所需的算力

What has changed more decisively is depth and deployability: China now has a broad field of near-frontier models, many of them open-weight and aggressively priced, making them easier to deploy across industries and government systems.

真正发生明显变化的是深度和可部署性：中国现在拥有大量接近前沿水平的模型，其中许多是开源权重（Open-weight）且定价激进，更容易在各行业和政府系统中部署

Usage data reflects that shift. On OpenRouter, Chinese open-source models grew from 1.2% to nearly 30% of weekly activity at their 2025 peak. Inside China, state-backed firms such as Z.ai (formerly Zhipu AI) are rolling large models into public-sector workflows, and Z.ai's IPO last week in Hong Kong signals Beijing's willingness to turn model builders into national infrastructure.

使用数据反映了这一转变。在 OpenRouter 上，中国开源模型的周活跃占比从 1.2% 增长到 2025 年峰值时的近 30%。在中国国内，Z.ai（原智谱 AI）等国资背景公司正在将大模型接入公共部门工作流；Z.ai 上周在香港 IPO，表明北京愿意将模型厂商打造成国家基础设施

Yet Chinese constraints are real. Training delays, outages, and delayed next-generation releases point to a persistent bottleneck: compute. Demand for advanced US chips from Chinese AI companies remains high, underscoring how difficult it is to scale frontier models without reliable access to cutting-edge hardware. Meanwhile, export controls on US chips, and Chinese government views on imports of US chips continue to evolve.

但中国的瓶颈确实存在。训练延迟、服务中断、下一代模型发布推迟——都指向一个持续性问题：算力。中国 AI 公司对美国先进芯片的需求依然很高，说明没有可靠的尖端硬件供应，前沿模型的规模化极为困难。与此同时，美国对芯片的出口管制，以及中国政府对进口美国芯片的态度，都在持续演变

Beijing's strategy has solidified around three ideas: 1) build a domestic AI stack; 2) push it everywhere at home (including as a key pillar of the People's Liberation Army's military modernization); and 3) export it abroad. Open-source releases from companies like Alibaba, updated in weeks rather than months, are designed to make Chinese models the default for local developers. At the same time, China is packaging models, cloud infrastructure, and hardware for overseas partners, testing "sovereign AI" stacks from Southeast Asia to the Middle East.

北京的战略已围绕三个思路成型：

1. 构建国产 AI 技术栈；
2. 在国内全面推广（包括作为解放军军事现代化的关键支柱）；
3. 向海外输出。阿里巴巴等公司的开源模型更新周期已从数月缩短到数周，目的是让中国模型成为本土开发者的默认选择。

同时，中国正在将模型、云基础设施和硬件打包输出给海外合作伙伴，在东南亚到中东测试「主权 AI」技术栈

The real contest in 2026 will turn on three questions.

2026 年的真正竞争将围绕三个问题展开：

1. Can US models extend their lead in real-world usefulness? The 2026 "gap" question increasingly turns on which AI ecosystem converts models into economically meaningful work. Western platforms such as OpenAI's Codex still lead in deployed agentic coding and workflow tools, even as Chinese models continue closing the gap.
2. 美国模型能否扩大在实际应用中的领先优势？ 2026 年的「差距」问题越来越取决于哪个 AI 生态能将模型转化为有经济价值的工作。OpenAI Codex 等西方平台在已部署的智能体编程和工作流工具上仍保持领先，尽管中国模型持续缩小差距

2. Can China build enough compute? Export controls have historically limited China's access to advanced chips and memory as US hyperscalers have continued to expand data-center capacity. Scaling fast enough to close that gap remains a challenge. At the same time, the CCP is reportedly limiting the use of Western chips in new data centers and subsidizing Chinese chips, doubling down on self-reliance for a full AI stack.
3. 中国能否建设足够的算力？ 出口管制长期限制了中国获取先进芯片和存储器的渠道，而美国超大规模云厂商持续扩张数据中心容量。快速扩张以弥合差距仍是挑战。同时，据报道中国正在限制新建数据中心使用西方芯片，并补贴国产芯片，加倍押注全栈自主可控

3. Can China deploy at scale? Much of Beijing's AI+ agenda and related incentives aim to accelerate diffusion by pushing AI into priority sectors through procurement, standards, and subsidies. Results will hinge on whether AI+ drives productivity advances in manufacturing and other key sectors, while creating a product ecosystem comparable to that in the US. Internationally, China is bundling infrastructure, cloud platforms, models, and domestic hardware to foster reliance on their stacks. – OpenAI's Intelligence & Investigations Team
4. 中国能否实现规模化部署？ 北京「AI+」议程及相关激励措施的目标是通过采购、标准和补贴将 AI 推入优先行业，加速扩散。成效将取决于「AI+」能否推动制造业等关键领域的生产力提升，同时构建出与美国相当的产品生态。在国际上，中国正在捆绑基础设施、云平台、模型和国产硬件，培育对其技术栈的依赖

—— OpenAI 情报与调查团队

更深度的挖掘

下面这些内容，是我基于 OpenAI 所表达的，以及它提供的参考链接，挖掘而来

OpenAI 的判断

先说 OpenAI 承认的部分

中国在部署广度和价格竞争力上取得了实质进展。中国现在有一批接近前沿的模型，很多是开源的，定价激进，更容易在各行业和政府系统中部署

再说 OpenAI 坚持的部分

美国模型在科学和复杂推理上仍然保持领先。原文用的词是 meaningful lead，不是一点点

中间有一句话值得注意：

despite headlines following DeepSeek-R1's release a year ago, China's progress in 2025 was uneven

尽管去年 DeepSeek-R1 发布后标题党满天飞，但中国 2025 年的进展并不均匀

OpenAI 认为，算力是真正的瓶颈。训练延迟、服务中断、下一代模型发布推迟，都指向同一个问题：compute

数据支撑

Stanford AI Index 2025 的数据
2023 年底，中美模型在 MMLU 上的差距是 17.5 个百分点
2024 年底，这个差距缩小到 0.3 个百分点

Stanford AI Index 中美差距数据(2023-2024)

其他几个 benchmark 也在收窄：

• MMMU 从 13.5 降到 8.1
• MATH 从 24.3 降到 1.6
• HumanEval 从 31.6 降到 3.7

开源和闭源的差距同样在缩小。2024 年初，闭源模型在 Chatbot Arena 上领先开源 8.04%，到 2025 年 2 月只剩 1.70%

OpenRouter 的使用数据
中国开源模型从 1.2% 增长到近 30% 的周活跃度峰值
DeepSeek 是 token 量最大的贡献者，14.37 万亿 tokens

OpenRouter 数据

但细分场景要分开看
编程任务上，Anthropic 的 Claude 系列占了超过 60% 的份额。中国模型在这个高价值场景上，份额还不高
角色扮演是另一回事。超过一半的开源模型使用量都是角色扮演，这个场景对模型限制少，开源有天然优势

北京的战略

OpenAI 总结了北京的三层战略

第一层：建立国产 AI 栈
从芯片到模型的全链条自主。据报道，中国正在限制新建数据中心使用西方芯片，同时补贴国产芯片

第二层：国内全面推广
这就是「AI+」政策。2027 年智能终端和 AI agent 使用率要超过 70%，2030 年超过 90%
OpenAI 特别提到，AI 是解放军军事现代化的关键支柱
智谱上周港股 IPO 被单独拎出来说，OpenAI 的判断是：这标志着北京愿意把模型公司变成国家基础设施

第三层：出口海外
中国正在向海外合作伙伴打包输出模型、云基础设施和硬件，在东南亚和中东测试「主权 AI」方案
阿里等公司的开源模型更新周期从几个月压缩到几周，目标是让中国模型成为本地开发者的默认选择

2026 年的三个问题

OpenAI 认为 2026 年的竞争取决于三个问题

第一：美国模型能否扩大实际有用性的领先优势？
benchmark 差距在缩小，但关键是哪个 AI 生态系统能把模型转化成有经济价值的工作
OpenAI 的 Codex 在 agentic coding 和工作流工具上仍然领先。中国模型在追赶，但还没追上

第二：中国能否建设足够算力？
出口管制限制了高端芯片获取，美国超大规模数据中心产能还在扩张
但同时，中国在限制新数据中心使用西方芯片，加大国产芯片补贴，在全栈自主上加倍下注

第三：中国能否大规模部署？
AI+ 政策通过采购、标准和补贴推动 AI 进入优先行业
结果取决于 AI+ 能否在制造业等关键领域带来生产力提升，同时建立起可以和美国比肩的产品生态
在国际上，中国正在打包基础设施、云平台、模型和国产硬件，培养对中国技术栈的依赖

几点观察

OpenAI 这份报告，比预想中克制

没有一边倒唱衰中国，承认了部署和价格上的进展

报告也没有盲目乐观，指出了算力瓶颈和 2025 年进展不均匀的事实

智谱被点名两次：
一次是 IPO，一次是「国家基础设施」的定性

DeepSeek 被当作中国 AI 的标志性事件
整篇报告就是以 DeepSeek-R1 一周年为时间锚点

特别声明：以上内容(如有图片或视频亦包括在内)为自媒体平台“网易号”用户上传并发布，本平台仅提供信息存储服务。

Notice: The content above (including the pictures and videos if any) is uploaded and posted by a user of NetEase Hao, which is a social media platform and only provides information storage services.