日刊 · 2026-06-07 — Glean

Sun, Jun 7, 2026 4picks

13:12

Weekly AI Roundup: Claude Limits Doubled, SpaceX IPO, Microsoft Model Data Contradiction

2026年6月第一周AI快讯：Claude限额翻倍、SpaceX IPO、微软模型数据翻车

A roundup of 10 major AI and tech news items from the first week of June 2026. MiniMax M3 was released, beating GPT-5.5 on coding benchmarks at $0.6/M tokens, though independent verification is pending. DeepSeek raised ~$7.4B in its first external funding round, while Unitree completed its IPO review in a record 73 days. Kimi Work, Coze 3.0, and Qwen3.7-Plus all launched new Agent capabilities. Doubao announced subscription plans. ChatGPT surpassed 1 billion monthly active users. Anthropic doubled Claude Cowork's usage limits, secretly filed for an IPO, and published a report stating Claude writes 80% of its own code. NVIDIA unveiled the ARM-based RTX Spark at Computex. SpaceX is set to IPO on June 12, with Google disclosed paying $920M/month for compute. Microsoft's MAI-Thinking-1 faced backlash after its claimed 'clean data' was revealed to include Common Crawl, and GitHub Copilot's switch to metered billing caused developer bills to spike.

mp.weixin.qq.com · 7 min · AI Engineering · AI Industry · Cost Optimization · Developer Tools · LLM

06:01

Lessons from Building Claude Code: Prompt Caching Is Everything

构建 Claude Code 的教训：提示缓存就是一切

Anthropic engineer shares hard-won lessons from optimizing prompt caching in Claude Code. Prompt caching relies on strict prefix matching, so the order of static vs dynamic content is critical — static system prompts, tools, and context must come first. The post reveals counterintuitive pitfalls: don't update the system prompt mid-conversation (pass updates via messages instead), never switch models or modify tool sets mid-session (it invalidates the entire cache), and when compacting context, reuse the parent conversation's prefix to avoid paying full price for tokens. Practical patterns include using tools like EnterPlanMode to model state transitions, deferring tool loading, and running alerts on cache hit rate. A must-read for anyone building long-running agentic products.

x.com · 8 min · Agents · LLM · Performance

06:01

8 proven tips for crafting a CLAUDE.md that truly understands your project

让Claude Code更懂你：写好CLAUDE.md的8条实战经验

This article distills 8 practical tips for optimizing CLAUDE.md to make Claude Code better aligned with your project: keep it under 200 lines to avoid information overload; maintain a 'do not introduce' list; define actionable coding rules (e.g., use named exports, ban any type); treat CLAUDE.md as a router to other docs, not a library; localize configs for sensitive modules; enforce key rules via hooks; use a MEMORY.md file for cross-session memory; and predefine work style preferences. These insights come from real-world use, backed by concrete examples and contrast cases, targeting engineers who use AI coding assistants.

x.com · 5 min · Agents · AI · LLM

06:01

Why Your AI Agent Is Drowning in Tools (And How Code Mode Saves It)

为什么你的AI代理被工具淹没（以及代码模式如何拯救它）

When an AI agent integrates many MCP tools, it risks context bloat and tool hallucination — 50+ tools can eat 5–7% of the context window. Traditional remedies like agent-side filtering and MCP-side reduction have trade-offs. Code mode lets the LLM search and execute tools via code, slashing token usage, enabling complex control flow, but adding debugging and infrastructure overhead. Cloudflare and Anthropic examples show that the real lesson is to keep a reasonable toolset driven by use cases, not magic numbers.

engineering.leanix.net · 7 min · Agents · Cloudflare · LLM

A few picks a day.

Weekly AI Roundup: Claude Limits Doubled, SpaceX IPO, Microsoft Model Data Contradiction

Lessons from Building Claude Code: Prompt Caching Is Everything

8 proven tips for crafting a CLAUDE.md that truly understands your project

Why Your AI Agent Is Drowning in Tools (And How Code Mode Saves It)