Daily /2026-06-07 / Weekly AI Roundup: Claude Limits Doubled, SpaceX IPO, Microsoft Model Data Contradiction

Weekly AI Roundup: Claude Limits Doubled, SpaceX IPO, Microsoft Model Data Contradiction

Source mp.weixin.qq.com Glean’d 2026-06-07 13:12 Read 7 min

AI summary

A roundup of 10 major AI and tech news items from the first week of June 2026. MiniMax M3 was released, beating GPT-5.5 on coding benchmarks at $0.6/M tokens, though independent verification is pending. DeepSeek raised ~$7.4B in its first external funding round, while Unitree completed its IPO review in a record 73 days. Kimi Work, Coze 3.0, and Qwen3.7-Plus all launched new Agent capabilities. Doubao announced subscription plans. ChatGPT surpassed 1 billion monthly active users. Anthropic doubled Claude Cowork's usage limits, secretly filed for an IPO, and published a report stating Claude writes 80% of its own code. NVIDIA unveiled the ARM-based RTX Spark at Computex. SpaceX is set to IPO on June 12, with Google disclosed paying $920M/month for compute. Microsoft's MAI-Thinking-1 faced backlash after its claimed 'clean data' was revealed to include Common Crawl, and GitHub Copilot's switch to metered billing caused developer bills to spike.

Original · 7 min

mp.weixin.qq.com ↗

§ 1

SWE-bench Pro 59%, input $0.6/million tokens. On June 1st, MiniMax released its flagship model, MiniMax M3, claiming it is the first domestically produced open-source model to integrate cutting-edge programming, million-token context, and native multimodal capabilities into one. Architecturally, it introduces MiniMax Sparse Attention (MSA), compressing per-token computation in million-token contexts to 1/20th of the previous generation, with prefill speeds over 9x faster and decoding speeds over 15x faster. The model natively supports text, image, and video input, and can directly control desktops to complete automated tasks.

MiniMax M3 在编程基准 SWE-bench Pro 上得分为 59%，输入价格仅为 0.6 美元/百万 token。6 月 1 日，MiniMax 发布了旗舰模型 MiniMax M3，并宣称这是首个将前沿编程能力、百万 token 上下文以及原生多模态三项能力集于一身的国产开源模型。在架构上，模型引入了 MSA（MiniMax Sparse Attention，MiniMax 稀疏注意力机制），将百万上下文下每个 token 的计算量压缩到上一代的二十分之一，预填充速度提升了 9 倍以上，解码速度更是提升了 15 倍以上。模型原生支持文本、图片和视频输入，并且能直接操控电脑桌面来完成自动化任务。

§ 2

In programming benchmarks, MiniMax M3's SWE-bench Pro score of 59% slightly exceeds GPT-5.5's 58.6%, and its 83.5 on the BrowseComp Agent benchmark surpasses Claude Opus 4.7's 79.3. However, a significant gap remains compared to Claude Opus 4.8's SWE-bench Pro score of 69.2%. Output pricing is $2.4/million tokens, with a concurrent 50% promotional discount. All scores are self-reported by MiniMax, achieved with agent scaffolding on their own infrastructure, and independent verification is not yet available. Open-source weights and a technical report are expected to be uploaded to HuggingFace within 10 days.

在编程跑分上，MiniMax M3 的 SWE-bench Pro 得分为 59%，略高于 GPT-5.5 的 58.6%；其 BrowseComp Agent 基准测试得分为 83.5，超过了 Claude Opus 4.7 的 79.3。不过，与 Claude Opus 4.8 在 SWE-bench Pro 上 69.2% 的成绩相比，差距仍然明显。模型输出定价为 2.4 美元/百万 token，发布同期还有五折的促销折扣。需要指出的是，这些跑分全是 MiniMax 自测的结果，在其自家基础设施上配合 Agent 脚手架完成，独立验证目前尚未出炉。开源权重和技术报告预计会在 10 天内上线至 HuggingFace 平台。

§ 3

According to multiple media reports on June 3rd, DeepSeek is advancing its first-ever external funding round, sized at approximately 50 billion RMB (about $7.4 billion), with a post-investment valuation between 350 billion to 400 billion RMB (about $52-59 billion). Founder Liang Wenfeng personally subscribed to about 20 billion RMB ($2.8 billion), accounting for nearly 40% of this round's total. Tencent invested approximately 10 billion RMB, and CATL about 5 billion RMB. The National AI Industry Investment Fund also participated. DeepSeek thus abandons its previous convention of not accepting external capital, where all prior funding came from Liang Wenfeng's quantitative hedge fund, High-Flyer.

据多家媒体 6 月 3 日报道，DeepSeek 正在推进公司创立以来的首轮外部融资，规模约为 500 亿元人民币（约 74 亿美元），投后估值在 3500 亿至 4000 亿元人民币之间（约 520 亿至 590 亿美元）。创始人梁文锋个人认购了约 200 亿元（约 28 亿美元），占本轮融资总额的近四成。此外，腾讯出资约 100 亿元，宁德时代出资约 50 亿元，国家人工智能产业投资基金也参与了此轮融资。这意味着 DeepSeek 告别了此前一直不接受外部资本的惯例，其过去的运营资金全部来源于梁文锋旗下的量化对冲基金幻方。

§ 4

On June 1st, Unitree Technology's IPO on the STAR Market was approved by the Shanghai Stock Exchange Listing Committee. From acceptance on March 20th to approval took only 73 days, setting a new 2026 speed record. Unitree plans to raise 4.202 billion RMB, and based on a public float ratio of no less than 10%, its estimated overall valuation is around 42 billion RMB. In 2025, its humanoid robot shipments exceeded 5,500 units, with a number one global market share of 32.4%, and a core component self-research and production rate over 90%. However, in Q1 2026, its year-over-year revenue growth rate dropped from 333% in the previous year to 68%, and adjusted net profit declined 53% year-over-year, showing a significant deceleration. If successful, Unitree will become the 'First Humanoid Robot Stock' on the A-share market.

6 月 1 日，宇树科技的科创板 IPO 申请通过了上交所上市委的审核。从 3 月 20 日被受理到上会，整个过程仅用了 73 天，刷新了 2026 年最快过会纪录。宇树计划募资 42.02 亿元，以不低于 10% 的公开发行比例估算，其整体估值约为 420 亿元。它在 2025 年的人形机器人出货量超过了 5500 台，全球市占率达 32.4%，位居第一，且核心部组件的自研自产率超过 90%。但值得注意的是，2026 年第一季度，其营收同比增速已从前一年的 333% 大幅回落至 68%，扣非净利润同比下降了 53%，增长势头明显放缓。如果最终顺利上市，宇树将成为 A 股市场上的“人形机器人第一股”。

§ 5

On June 3rd, Moonshot AI launched the Kimi Work Beta program, positioning it as a general-purpose local agent for knowledge workers. Unlike the previously mentioned Kimi WebBridge which controls browsers, Kimi Work processes tasks locally. In complex scenarios, it can autonomously split a task into up to 300 sub-agents for parallel execution, suitable for cross-document organization, batch data analysis, and multi-step workflows. The same day, ByteDance's Coze 3.0 went online, supporting flexible collaboration among multiple users and agents, and independent project management. Its biggest change is the ability to directly integrate with local agent platforms like Claude Code and Codex CLI, meaning Coze is no longer limited to pure cloud orchestration and can mix local and cloud usage.

6 月 3 日，月之暗面启动了 Kimi Work 的 Beta 内测，其定位是面向知识工作者的通用型本地 Agent。与此前介绍的用于操控浏览器的 Kimi WebBridge 不同，Kimi Work 直接在用户本地处理任务。在复杂场景下，它能自主地将任务拆分为最多 300 个子 Agent 并行执行，非常适合跨文档整理、批量数据分析以及多步骤工作流。同一天，字节跳动旗下的扣子 Coze 3.0 也正式上线，它支持多人多 Agent 灵活协作和项目独立管理。这款产品最大的变化是能直接接入 Claude Code、Codex CLI 等本地 Agent 平台，这意味着 Coze 不再局限于纯粹的云端编排，本地和云端的算力现在可以混用了。

§ 6

On June 2nd, Alibaba Cloud's Tongyi team launched Qwen3.7-Plus on the Bailian platform. Building on the previous Qwen3.7-Max, the new model adds image and video understanding, deep reasoning, and tool-use capabilities. It has a built-in Agentic RL mechanism that allows it to continuously optimize its own strategy based on execution feedback. This week, the three companies are converging on the Agent theme: Moonshot AI focuses on local task processing, ByteDance on cross-platform orchestration, and Alibaba on multimodal reasoning.

6 月 2 日，阿里云的通义团队在百炼平台上推出了 Qwen3.7-Plus。该模型在上期旗舰闭源模型 Qwen3.7-Max 的基础上，新增了图像和视频理解、深度推理以及工具调用能力。它还内置了 Agentic RL 机制，能够通过执行反馈，持续地优化自身的策略。本周，这三家公司不约而同地在 Agent 赛道上发力：月之暗面主打本地任务处理，字节跳动主打跨平台编排，而阿里则主打多模态推理。

§ 7

Doubao is going to charge. On June 3rd, an official announcement revealed four subscription tiers: Free, Standard at 68 RMB/month, Enhanced at 200 RMB/month, and Professional at 500 RMB/month. Annual subscriptions are 0/688/2048/5088 RMB respectively. The Professional plan targets software development, data analysis, financial research, and other professional scenarios, offering dedicated computing resources, API calls, and priority response during peak times. The basic free tier retains daily chat, Q&A, copywriting, translation, and image generation, promising to remain permanently free, unlimited, and with no speed throttling. The official launch is expected in late June.

豆包要开始收费了。6 月 3 日，官方发布公告，计划推出四档订阅方案：免费版、标准版 68 元/月、加强版 200 元/月、专业版 500 元/月，相对应的年付价格分别为 0元、688元、2048元和5088元。专业版主要面向软件开发、数据分析、金融研究等专业使用场景，提供专属算力、API 调用以及高峰期优先响应等服务。基础免费版则保留了日常聊天、问答、文案撰写、翻译和图片生成等功能，并承诺该版本将永久免费、不限使用次数且不会降低服务速度。新付费方案预计在 6 月下旬正式上线。

§ 8

As of March 2026, Doubao's daily token call volume surpassed 120 trillion, a full 1000 times the volume since its launch in May 2024. In an online poll, 76% of users chose 'Will not use it anymore,' with only 8% indicating they would pay. Looking at competitors, Kimi membership is priced at 49 RMB/month, Zhipu Qingyan at 49 RMB/month, and Baidu Wenxin Yiyan at 30 RMB/month. Doubao's Standard tier starting price of 68 RMB/month already surpasses its peers. For now, Qianwen and Yuanbao have chosen to hold firm and remain free, but they too will inevitably face the same computing cost bill sooner or later.

截至 2026 年 3 月，豆包的日均 token 调用量已经突破了 120 万亿，这个数字是 2024 年 5 月上线初期的整整一千倍。在一项网络投票中，76% 的用户选择了“不用了”，而仅有 8% 的用户表示愿意付费。对比同类竞品，Kimi 会员的价格为 49 元/月，智谱清言也是 49 元/月，百度文心一言则是 30 元/月。因此，豆包标准版 68 元/月的起步价，已经高出了同行。目前，千问和元宝都选择了按兵不动，维持免费策略，但它们迟早也要面对同一张高昂的算力账单。

§ 9

Three years, 1 billion monthly active users. Third-party data shows that ChatGPT surpassed 1 billion global MAUs in May 2026. It took only three years from launch to reach this milestone, faster than all previous applications including TikTok, YouTube, and Google Maps. Year-over-year growth was 62%, with paying subscribers exceeding 50 million. In the same week, OpenAI launched GPT-5.5 Instant, replacing Canvas with native 'Writing Blocks' and 'Code Blocks'. Plus and Pro users can leverage conversation history and saved memories to achieve personalized responses across sessions.

三年，10 亿月活用户。第三方数据显示，ChatGPT 在 2026 年 5 月突破了 10 亿全球月活跃用户的大关。从产品上线到达成这一里程碑，它只用了三年时间，这个速度超过了 TikTok、YouTube、Google Maps 等此前所有知名的应用。其用户数同比增长了 62%，付费订阅用户也超过了 5000 万人。同周，OpenAI 还上线了 GPT-5.5 Instant，该模型用原生的“写作块”和“代码块”功能取代了过去的 Canvas。Plus 和 Pro 用户现在可以利用历史对话和已保存的记忆，实现跨会话的个性化响应。

§ 10

On June 3rd, Codex launched the preview of its Sites feature. Available to Business and Enterprise users, it can directly generate and host interactive websites and applications. On the same day, six new job-role plugins were added, expanding Codex from a coding tool to non-technical roles like creative production, sales, and customer service, covering 62 applications and 110 skills. The ChatGPT memory system was upgraded to a 'dreaming' mechanism, which automatically updates outdated memories—for example, correcting 'You are going to Singapore next month' to 'You have already been to Singapore' after a trip. Lockdown mode was also opened to all individual users, which, when enabled, restricts web browsing and deep research to reduce the risk of data leakage from prompt injection.

6 月 3 日，Codex 推出了 Sites 功能的预览版。该功能面向 Business 和 Enterprise 用户开放，可以直接生成并托管交互式网站和应用。就在同一天，Codex 还新增了 6 个岗位插件，将其能力从编程工具拓展到创意制作、销售、客服等非技术岗位，总计覆盖了 62 个应用和 110 项技能。此外，ChatGPT 的记忆（memory）系统升级为了“dreaming”机制，能够自动更新过时的记忆，比如在旅行结束后，它会自动把“你下个月要去新加坡”这条记忆修正为“你已经去过新加坡”。Lockdown 模式也向所有个人用户开放了，启用后将限制网页浏览和深度研究等功能，以降低因提示词注入而导致数据外泄的风险。

§ 11

On June 5th, Anthropic announced that Claude Cowork usage limits would be doubled for all paid plans, lasting until July 5th. This marks the third round of capacity expansion, following the permanent doubling of Claude Code's five-hour limit in early May and a temporary 50% weekly limit increase in mid-May. It's backed by the computing power contract with SpaceX's Colossus 1 data center, giving access to over 220,000 NVIDIA GPUs. On June 3rd, Claude Code changed the trigger word for dynamic workflows from 'workflow' to 'ultracode'. Previously, many users accidentally triggered agent orchestration scripts when mentioning 'workflow' in normal conversation. After the change, only the explicit use of 'ultracode' or /effort ultracode will initiate multi-agent coordination mode.

6 月 5 日，Anthropic 宣布将 Claude Cowork 的使用限额翻倍，有效期为 7 月 5 日之前，适用于所有付费套餐。这已经是继 5 月初 Claude Code 五小时限额永久翻倍、5 月中旬周限额临时提升 50% 之后的第三轮扩容了。这一举措依靠的是与 SpaceX 签下的 Colossus 1 数据中心算力合同，该合同为 Anthropic 带来了超过 22 万块英伟达 GPU。同样是 6 月 3 日，Claude Code 将其动态工作流的触发词从“workflow”改为了“ultracode”。在此之前，不少用户在普通对话中一提到“workflow”这个词，就会误触发 Agent 编排脚本。改名之后，只有明确使用“ultracode”或 /effort ultracode 指令，才会启动多 Agent 协调模式。

§ 12

In the same week, Nous Research released the public preview of Hermes Desktop, supporting Windows, macOS, and Linux. The Hermes Agent, nicknamed 'Hermès' by the community, previously only had a command-line version. It already has a stable user base within the Chinese AI developer community. The desktop launch now allows those unfamiliar with the terminal to use it directly.

同一周，Nous Research 也发布了 Hermes Desktop 的公开预览版，支持 Windows、macOS 和 Linux 三大平台。Hermes Agent，也就是社区用户昵称的“爱马仕”，此前只有命令行版本，并且在中文本地 AI 开发者社区中已经积累了一批稳定的用户。这次桌面端版本的上线，意味着那些不熟悉终端操作的人也能直接上手使用了。

§ 13

Jensen Huang and Microsoft CEO Satya Nadella shared the stage in Taipei, announcing the reinvention of the Windows PC using ARM. The RTX Spark is a 'superchip,' integrating a 20-core ARM CPU and a Blackwell GPU with 6144 CUDA cores connected via NVLink C2C, supporting up to 128GB of LPDDR5X unified memory, reaching 1 petaflop of AI computing power. NVIDIA claims it can locally run 120-billion parameter models, agents with million-token context, 4K AI video generation, and render 3D scenes exceeding 90GB. It launches in the fall, with the Microsoft Surface Ultra among the first 8 laptops. Over 30 OEMs will launch RTX Spark devices.

黄仁勋和微软 CEO 萨提亚·纳德拉在台北共同登台，宣布要用 ARM 架构重新发明 Windows PC。RTX Spark 是一款“超级芯片”，它在一块芯片上集成了 20 核 ARM CPU 和拥有 6144 个 CUDA 核心的 Blackwell GPU，两者通过 NVLink C2C 总线互连，最高可配备 128GB 的 LPDDR5X 统一内存，AI 算力达到了 1 petaflop（千万亿次浮点运算每秒）。英伟达宣称，这款芯片可以在本地运行 1200 亿参数的模型、处理百万 token 上下文的 Agent 任务、生成 4K AI 视频，并渲染超过 90GB 的 3D 场景。该产品将于秋季上市，微软的 Surface Ultra 将位列首批推出的 8 款笔记本之中，并且有超过 30 家 OEM 厂商会推出搭载 RTX Spark 的设备。

§ 14

The next-generation AI superchip platform, Vera Rubin, was announced to be in full production, claiming a 10x increase in large-scale agent throughput over Grace Blackwell. The open-source Nemotron 3 Ultra, featuring 550 billion parameters, is designed specifically for always-on agents, claiming a 5x inference speed improvement and 30% cost reduction over comparable closed-source models, and is already adapted to mainstream agent platforms. The DGX Station for Windows was unveiled as 'the world's most powerful desktop AI supercomputer.' The DLSS 4.5 Ray Reconstruction feature will roll out to all RTX graphics cards in August, with support added for 11 new games. Jensen Huang teased a 'surprise product that no one has been told about' coming later in the year.

英伟达宣布，其下一代 AI 超级芯片平台 Vera Rubin 已全面投产。据称，相比上一代 Grace Blackwell，该平台在处理大规模 Agent 工作负载时的吞吐量提升了 10 倍。同时，公司还开源了拥有 5500 亿参数的 Nemotron 3 Ultra 模型。该模型专为全天候运行的 Agent 设计，与同级别的闭源模型相比，其推理速度提升了 5 倍，成本降低了 30%，并已适配了主流的 Agent 平台。此外，英伟达还发布了 DGX Station for Windows，并称其为“全球最强的桌面 AI 超算”。DLSS 4.5 的光线重建功能将于 8 月推送至全系 RTX 显卡，同时会新增 11 款游戏的支持。黄仁勋最后还预告，下半年将有一个“还没告诉过任何人的惊喜产品”。

§ 15

$135 per share, a $75 billion fundraising target, a $1.77 trillion valuation. SpaceX will begin trading on the Nasdaq under the ticker SPCX on June 12th. Goldman Sachs is the lead underwriter, joined by Morgan Stanley, Bank of America, Citigroup, and JPMorgan. If successful, this will be the largest IPO in history, surpassing Saudi Aramco's record $29.4 billion raised in 2019. The roadshow began on June 4th, several days earlier than originally planned. A 30% retail allocation will be distributed directly through Robinhood, Fidelity, and Schwab—an unprecedentedly high retail share for an IPO of this scale. Musk will still hold over 82% of voting rights after the offering.

每股 135 美元，计划融资 750 亿美元，估值高达 1.77 万亿美元。SpaceX 将于 6 月 12 日在纳斯达克挂牌交易，股票代码为 SPCX。高盛将担任主承销商，摩根士丹利、美国银行、花旗银行和摩根大通也联合参与承销。如果此次 IPO 顺利完成，它将成为全球有史以来最大规模的 IPO，超过沙特阿美在 2019 年创下的 294 亿美元融资纪录。路演活动已于 6 月 4 日启动，比原计划提前了几天。值得一提的是，本次将有 30% 的零售配额通过 Robinhood、Fidelity 和 Schwab 等平台直接向散户分发。在如此规模的 IPO 中，这么高的散户配售比例是前所未有的。马斯克在上市后仍将持有超过 82% 的投票权。

§ 16

A week before the IPO (June 5th), SpaceX disclosed a new computing contract to the SEC. Starting October 2026, Google will pay SpaceX $920 million monthly for the contract lasting until June 2029, totaling about $30 billion. Google will rent approximately 110,000 NVIDIA GPUs along with associated CPUs and storage. A Google Cloud spokesperson stated this is to meet the higher-than-expected customer demand for the Gemini Enterprise and Agent platforms. The compute power comes from SpaceX's two Colossus supercomputing centers in Memphis, originally xAI facilities. Adding this to the previous Anthropic contract, SpaceX's total disclosed compute contracts now exceed $70 billion.

就在 IPO 前一周（6 月 5 日），SpaceX 向美国证券交易委员会（SEC）披露了一份新的算力合同。根据合同，谷歌将从 2026 年 10 月起，每月向 SpaceX 支付 9.2 亿美元，合同将持续到 2029 年 6 月，总价值约 300 亿美元。谷歌将租用大约 11 万块英伟达 GPU 以及配套的 CPU 和存储设备。谷歌云的一位发言人表示，此举是为了满足 Gemini Enterprise 和 Agent 平台远超预期的客户需求。这些算力源自 SpaceX 位于孟菲斯的两座 Colossus 超算中心（原属于 xAI 的设施）。加上此前与 Anthropic 签订的合同，SpaceX 目前已公开的算力合同总金额已超过 700 亿美元。

§ 17

On June 1st, Anthropic confidentially filed its S-1 registration statement with the SEC, targeting a post-money valuation of approximately $965 billion. Morgan Stanley and Goldman Sachs are jointly leading the offering. Reportedly, its annualized revenue has surpassed $47 billion, a more than 4x increase from about $9 billion at the end of 2025. If SpaceX and OpenAI also go public this year, 2026 will see three trillion-dollar AI companies entering the public markets simultaneously.

6 月 1 日，Anthropic 向美国证券交易委员会（SEC）秘密提交了 S-1 招股说明书，投后估值目标约为 9650 亿美元。摩根士丹利和高盛将联合牵头此次上市。据报道，其年化营收已突破 470 亿美元，相比 2025 年底约 90 亿美元的水平，增长了超过 4 倍。如果 SpaceX 和 OpenAI 也都在今年上市，那么 2026 年将史无前例地有三家万亿级的 AI 公司同时登陆公开市场。

§ 18

Three days later (June 4th), the Anthropic Institute published a report titled 'When AI Builds Itself.' The report shows that Claude currently writes over 80% of Anthropic's production code. Internal engineering output is growing 8x per quarter, and the internal Mythos Preview model achieved a 52x speedup on an ML optimization benchmark, compared to just 3x for the previous generation. Co-founder Jack Clark and Institute head Marina Favaro call in the report for a coordinated global pause on frontier AI development by major AI companies, to buy time for alignment research and social institutions. They stress this isn't a unilateral pause, needing simultaneous execution and verifiable rule constraints across multiple countries, including the US and China.

三天后（6 月 4 日），Anthropic Institute 发布了一份名为《当 AI 自行构建》（When AI Builds Itself）的报告。报告显示，Claude 目前撰写了 Anthropic 超过 80% 的生产代码。公司内部的工程产出每个季度增长 8 倍。其内部模型 Mythos Preview 在一项 ML 优化基准测试中实现了 52 倍的加速，而上一代模型仅能实现 3 倍加速。Anthropic 联合创始人 Jack Clark 与研究院负责人 Marina Favaro 在报告中共同呼吁，全球主要的 AI 公司应协调一致，暂停前沿 AI 的开发，以便为对齐研究和社会制度的调整争取时间。他们强调，这并非单边暂停，而是需要美国、中国等多国同时执行，并接受可验证的具体规则约束。

§ 19

Critics are calling this 'fear marketing by a $965 billion company.' In the same week, OpenAI published a rebuttal, arguing that it should be governments, not private companies, who decide on such a pause.

对于这一呼吁，批评者称这是“一家估值 9650 亿美元的公司，在进行恐惧营销”。同一周，OpenAI 也发文反驳，认为是否暂停 AI 开发的决策权应该属于各国政府，而不是由私人企业来定夺。

§ 20

On June 2nd, at Build 2026, Microsoft unveiled its first self-developed reasoning model, MAI-Thinking-1. It features 35 billion active parameters in a roughly 1 trillion total parameter sparse Mixture-of-Experts (MoE) architecture and supports a 256k context window. Microsoft AI head Mustafa Suleyman emphasized at the launch that the model was 'trained entirely on enterprise-grade, clean, commercially licensed data with no distillation from third-party models.' Yet three days later, media uncovered that the paper accompanying the model explicitly states its training data includes Common Crawl—an open web crawl library that provides no copyright assurances—totaling 24.2 billion pages after filtering. Django co-founder Simon Willison was the first to spot this contradiction. As of June 5th, Microsoft had not issued a public response.

6 月 2 日，在 Build 2026 大会上，微软发布了其首款自研推理模型 MAI-Thinking-1。该模型拥有 350 亿激活参数，采用总参数量约 1 万亿的稀疏 MoE 架构，并支持 256k 的上下文窗口。微软 AI 负责人 Mustafa Suleyman 在发布会上强调，该模型“完全基于企业级、干净且拥有商业授权的数据进行训练，并且没有蒸馏过任何第三方模型”。然而仅仅三天后，就有媒体发现该模型随附的技术论文明确记载，其训练数据来源包含 Common Crawl（一个不提供任何版权许可保证的开放网页爬取库），经过滤后仍然有高达 242 亿个网页。Django 框架的联合创始人 Simon Willison 第一个发现了这个矛盾之处。截至 6 月 5 日，微软方面尚未对此作出公开回应。

§ 21

On June 1st, GitHub Copilot transitioned entirely to a pay-as-you-go billing model. One AI Credit equals $0.01. The Pro plan includes a $10 monthly credit allowance; Pro+ includes $39; and Business includes $19 per user. Code completions and edit suggestions remain unlimited, but chat, Agent mode, and code review now consume credits based on token usage. The developer community reacted furiously. Multiple people reported their monthly bills jumping from $29 to $750 or even $3,000. A single Agent coding session could burn through $30 to $40. By default, there is no spending cap; users must manually enable a hard budget limit in their billing dashboard. GitHub provided transitional promotional credits for Business and Enterprise users for June through August (an additional $30 and $70 per month, respectively), but individual developers received no such buffer.

6 月 1 日，GitHub Copilot 全面转向了按量计费的收费模式。其中，1 个 AI Credit（AI 积分）等于 0.01 美元。Pro 版套餐每月包含 10 美元的用额度，Pro+ 版每月包含 39 美元，Business 版则每月包含每位用户 19 美元。代码补全和编辑建议功能依然是无限使用的，但是聊天、Agent 模式以及代码审查等功能，现在都需要根据 token 消耗量来扣除积分。此举在开发者社区引发了激烈反应，许多人反馈自己的月账单从原来固定的 29 美元，飙升到了 750 美元甚至 3000 美元之高。一次 Agent 编程会话就可能烧掉 30 到 40 美元。最关键的是，默认设置下消费根本没有上限，用户必须自己手动在计费面板中开启硬性预算封顶功能。GitHub 为 Business 和 Enterprise 用户在 6 至 8 月提供了过渡期促销额度，每月分别额外赠送 30 美元和 70 美元，但个人开发者没有任何缓冲措施。

Open source ↗