Glean 拾遗
日刊Daily /2026-05-26 / OpenAI Agents SDK 新增原生沙箱与模型原生执行框架 The next evolution of the Agents SDK: native sandboxing and harness

OpenAI Agents SDK 新增原生沙箱与模型原生执行框架 The next evolution of the Agents SDK: native sandboxing and harness

原文Source openai.com 推荐By 收录Glean'd 2026-05-26 06:42 阅读Read 1 min
原文 openai.com ↗ openai.com ↗
§ 1

We’re introducing new capabilities to the Agents SDK that give developers standardized infrastructure that is easy to get started with and is built correctly for OpenAI models: a model-native harness that lets agents work across files and tools on a computer, plus native sandbox execution for running that work safely. For example, developers can give an agent a controlled workspace, explicit instructions, and the tools it needs to inspect evidence:

Agents SDK 新增两大能力:一是贴近模型原生运作方式的执行框架,让 agent 能跨文件与工具协同工作;二是原生沙箱执行,为这些工作提供一个安全的计算机运行环境。例如,开发者可以给 agent 分配一个受控工作区、明确指令以及检查证据所需的工具。

§ 2

Developers need more than the best models to build useful agents—they need systems that support how agents inspect files, run commands, write code, and keep working across many steps. The systems that exist today come with tradeoffs as teams move from prototypes to production. Model-agnostic frameworks are flexible but do not fully utilize frontier models capabilities; model-provider SDKs can be closer to the model but often lack enough visibility into the harness; and managed agent APIs can simplify deployment but constrain where agents run and how they access sensitive data. Here’s what some of the customers who tested the new SDK with us had to say: “The updated Agents SDK made it production-viable for us to automate a critical clinical records workflow that previous approaches couldn’t handle reliably enough. For us, the difference was not just extracting the right metadata, but correctly understanding the boundaries of each encounter in long, complex records. As a result, we can more quickly understand what's happening for each patient in a given visit, helping members with their care needs and improving their experience with us.”— Rachael Burns, Staff Engineer & AI Tech Lead, Oscar Health

构建有用的 agent 不仅需要好模型,还需要能支撑文件检查、命令执行、写代码和多步延续工作的系统。从原型到生产,常见的方案各有取舍:模型无关框架灵活但没用好前沿模型能力;模型供应商 SDK 更贴近模型但缺乏对框架的可观性;托管 API 简化部署却限制了运行位置与数据访问。早期测试客户 Oscar Health 表示,新版 SDK 使其能可靠地自动化关键临床记录流程,准确理解长文档中的每次就诊边界。

§ 3

With today’s release, the Agents SDK harness becomes more capable for agents that work with documents, files, and systems. It now has configurable memory, sandbox-aware orchestration, Codex-like filesystem tools, and standardized integrations with primitives that are becoming common in frontier agent systems. The harness also helps developers unlock more of a frontier model’s capability by aligning execution with the way those models perform best. That keeps agents closer to the model’s natural operating pattern, improving reliability and performance on complex tasks—particularly when work is long-running or coordinated across a diverse set of tools and systems. In addition, we realize each product is unique and rarely fits neatly into a mold. We designed Agents SDK to support this diversity. Developers get a harness that’s turnkey yet flexible—making it easy to adapt it to their own stack—including tool use, memory, and sandbox environment.

新版执行框架为处理文档、文件和系统的 agent 提供了可配置内存、沙箱感知编排、类 Codex 文件系统工具,以及对前沿 agent 系统通用原语的标准化集成。通过让执行方式与模型的最佳运作模式对齐,框架能提升复杂任务(尤其是长运行或多工具协同)的可靠性和性能。同时,SDK 保持开箱即用但高度可定制,方便开发者适配自己的工具、内存和沙箱环境。

§ 4

The updated Agents SDK supports sandbox execution natively, so agents can run in controlled computer environments with the files, tools, and dependencies they need for a task. Many useful agents need a workspace where they can read and write files, install dependencies, run code, and use tools safely. Native sandbox support gives developers that execution layer out of the box, instead of forcing them to piece it together themselves. Developers can bring their own sandbox or use built-in support for Blaxel, Cloudflare, Daytona, E2B, Modal, Runloop, and Vercel. To make those environments portable across providers, the SDK also introduces a Manifest abstraction for describing the agent’s workspace. Developers can mount local files, define output directories, and bring in data from storage providers including AWS S3, Google Cloud Storage, Azure Blob Storage, and Cloudflare R2. This gives developers a consistent way to shape the agent’s environment from local prototype to production deployment. It also gives the model a predictable workspace: where to find inputs, where to write outputs, and how to keep work organized across a long-running task.

原生沙箱执行让 agent 在带有必要文件、工具和依赖的受控计算机环境中运行。开发者无需自己拼装执行层,可直接使用或接入 Blaxel、Cloudflare、Daytona、E2B、Modal、Runloop、Vercel 等提供商。通过 Manifest 抽象描述工作区,开发者可以挂载本地文件、指定输出目录,并从 AWS S3 等存储引入数据,从而在本地原型到生产部署之间保持环境一致,模型也能在可预测的工作区中保持长任务的组织性。

§ 5

Agent systems should be designed assuming prompt-injection and exfiltration attempts. Separating harness and compute helps keep credentials out of environments where model-generated code executes. It also enables durable execution. When the agent’s state is externalized, losing a sandbox container does not mean losing the run. With built-in snapshotting and rehydration, the Agents SDK can restore the agent’s state in a fresh container and continue from the last checkpoint if the original environment fails or expires. Finally, it makes agents more scalable. Agent runs can use one sandbox or many, invoke sandboxes only when needed, route subagents to isolated environments, and parallelize work across containers for faster execution.

Agent 系统设计应假设存在提示注入和数据窃取风险。将执行框架与计算环境分离,可防止凭证进入模型生成代码的执行空间。分离还带来持久性:通过外置 agent 状态,即便沙箱容器失效,也能借快照与状态恢复在新容器中从上次检查点继续。此外,还提升了伸缩性——可按需启停沙箱、将子 agent 路由到隔离环境、跨容器并行加速。

§ 6

These new Agents SDK capabilities are generally available to all customers via the API and use standard API pricing, based on tokens and tool use. As we continue to develop the Agents SDK, we’ll keep expanding what developers can build with it, making it easier to bring more capable agents into production with less custom infrastructure, while preserving the flexibility and control developers need to fit agents into their own environments. The new harness and sandbox capabilities are launching first in Python, with TypeScript support planned for a future release. We’re also working to bring additional agent capabilities, including code mode and subagents, to both Python and TypeScript. In addition, we want to help bring the broader agent ecosystem together over time, with support for more sandbox providers, more integrations, and more ways for developers to plug the SDK into the tools and systems they already use.

新能力已通过 API 向所有客户开放,按 token 和工具调用标准计费。后续将持续扩展 SDK 功能,让开发者用更少定制基础设施即可将更强 agent 推向生产,并保持灵活性和控制力。当前功能率先在 Python 中推出,TypeScript 支持计划在未来发布;代码模式和子 agent 等能力也将陆续登陆两种语言。未来还将纳入更多沙箱提供商、集成和工具连接方式,推动 agent 生态融合。

打开原文 ↗ Open source ↗