Sandbox

@agentick/sandbox provides a sandboxed execution layer for agents — types, a <Sandbox> component, pre-built tools, and edit utilities. It's the foundation that provider adapters (@agentick/sandbox-local, @agentick/sandbox-docker) build on.

The model gets Shell, ReadFile, WriteFile, and EditFile tools that are scoped to their nearest <Sandbox> in the component tree. Multiple sandboxes in the same tree work naturally.

Installation

bash

pnpm add @agentick/sandbox

Quick Start

tsx

import { Sandbox, Shell, ReadFile, WriteFile, EditFile } from "@agentick/sandbox";
import { localProvider } from "@agentick/sandbox-local";

function CodingAgent() {
  return (
    <Sandbox provider={localProvider()} workspace="/tmp/project">
      <System>You are a coding assistant with sandbox access.</System>
      <Shell />
      <ReadFile />
      <WriteFile />
      <EditFile />
      <Timeline />
    </Sandbox>
  );
}

The <Sandbox> component initializes the provider, runs optional setup, and provides the sandbox handle to all children via React Context.

The `<Sandbox>` Component

tsx

<Sandbox
  provider={localProvider()}
  workspace="/tmp/project"
  mounts={[{ host: "./src", sandbox: "/app/src", mode: "rw" }]}
  allow={{ fs: true, net: false }}
  env={{ NODE_ENV: "development", API_KEY: () => process.env.API_KEY! }}
  limits={{ memory: 512_000_000 }}
  setup={async (sb) => await sb.exec("npm install")}
  persist={true}
>
  {children}
</Sandbox>

Prop	Type	Default	Description
`provider`	`SandboxProvider`	required	Provider adapter
`workspace`	`string \| true`	`true`	Path or auto temp dir
`mounts`	`Mount[]`	—	Host-sandbox path mappings
`allow`	`Permissions`	—	Advisory permissions
`env`	`Record<string, string \| (() => string)>`	—	Environment variables
`limits`	`ResourceLimits`	—	Resource constraints
`setup`	`(sandbox) => Promise<void>`	—	Post-creation callback
`persist`	`boolean`	`false`	Persist state in snapshots

Internally, <Sandbox> uses useData for async initialization and useOnUnmount for cleanup. Env functions are resolved at creation time.

Pre-built Tools

Four tools ship with the package. Each uses use() + useSandbox() for tree-scoped context injection.

Tool	Tool Name	Description
`<Shell />`	`shell`	Execute a shell command
`<ReadFile />`	`read_file`	Read file contents
`<WriteFile />`	`write_file`	Write content to a file
`<EditFile />`	`edit_file`	Apply surgical edits

Include all four or pick specific ones:

tsx

// Full toolkit
<Sandbox provider={provider}>
  <Shell />
  <ReadFile />
  <WriteFile />
  <EditFile />
  <MyAgent />
</Sandbox>

// Read-only
<Sandbox provider={provider}>
  <Shell />
  <ReadFile />
  <MyAgent />
</Sandbox>

Tree Scoping

Tools access their nearest <Sandbox> provider via React Context. Two sandboxes in the same tree scope their tools independently:

tsx

<Sandbox provider={localProvider()}>
  <Shell />          {/* local sandbox */}
  <ReadFile />
  <MyLocalAgent />
</Sandbox>

<Sandbox provider={dockerProvider()}>
  <Shell />          {/* docker sandbox */}
  <WriteFile />
  <MyDockerAgent />
</Sandbox>

This works because each tool uses use: () => ({ sandbox: useSandbox() }), which captures the value from its position in the tree.

Custom Tools

Build your own sandbox-backed tools with useSandbox():

tsx

import { createTool } from "agentick";
import { useSandbox } from "@agentick/sandbox";
import { z } from "zod";

const GrepTool = createTool({
  name: "grep",
  description: "Search files for a pattern",
  input: z.object({
    pattern: z.string(),
    path: z.string().default("."),
  }),
  use: () => ({ sandbox: useSandbox() }),
  handler: async ({ pattern, path }, deps) => {
    const result = await deps!.sandbox.exec(`grep -rn "${pattern}" ${path}`);
    return [{ type: "text", text: result.stdout || "No matches found." }];
  },
});

Edit Utilities

The EditFile tool and underlying applyEdits function support 5 editing modes with 3-level whitespace-tolerant matching — designed for LLM-generated edits.

Edit Modes

Mode	Fields	Description
Replace	`old`, `new`	Find `old`, replace with `new`
Delete	`old`, `delete: true`	Find `old`, remove it (auto-consumes trailing newline for complete lines)
Insert before/after	`old`, `insert`, `content`	Insert `content` relative to anchor `old`
Insert start/end	`insert`, `content`	Prepend/append `content` to file
Range	`from`, `to`, `content`	Replace everything from `from` through `to` (inclusive)

Mode is detected by field presence (precedence: range > insert > delete > replace).

typescript

// Replace
{ old: "return 1;", new: "return 2;" }

// Rename all occurrences
{ old: "oldName", new: "newName", all: true }

// Delete
{ old: "// TODO: remove\n", delete: true }

// Insert after anchor
{ old: "import { foo } from 'foo';", insert: "after", content: "import { bar } from 'bar';" }

// Append to end of file
{ insert: "end", content: "export default main;" }

// Range replacement
{ from: "function old() {", to: "} // old", content: "function new() {\n  return 42;\n}" }

Matching Strategy

Each anchor (old, from, to) is matched using 3 strategies in order:

Exact byte match — the string appears verbatim
Line-normalized — trailing whitespace stripped from both sides
Indent-adjusted — leading whitespace baseline stripped, replacement re-indented

Models don't need to perfectly reproduce whitespace — the tool recovers from trailing spaces and indentation mismatches.

The `Edit` Interface

typescript

interface Edit {
  old?: string; // Text to find (replace, delete, insert before/after)
  new?: string; // Replacement text (replace mode)
  all?: boolean; // Apply to all occurrences (default: false)
  delete?: boolean; // Delete matched text (sugar for new: "")
  insert?: "before" | "after" | "start" | "end"; // Insert mode
  content?: string; // Content to insert or range replacement
  from?: string; // Range start boundary (inclusive)
  to?: string; // Range end boundary (inclusive)
}

`applyEdits(source, edits)` — Pure Transform

No I/O. All edits resolve against the original source, validate for overlaps, and apply atomically.

typescript

import { applyEdits } from "@agentick/sandbox";

const result = applyEdits(source, [
  { old: "return 1;", new: "return 2;" },
  { old: "debugLog()", delete: true },
  { insert: "end", content: "\nexport default main;" },
]);

result.content; // Transformed source
result.applied; // Number of edits applied
result.changes; // [{ line, removed, added }, ...]

`editFile(path, edits)` — Atomic File I/O

Reads, applies edits, writes atomically (temp + rename).

typescript

import { editFile } from "@agentick/sandbox";

await editFile("/path/to/file.ts", [{ old: "const x = 1;", new: "const x = 42;" }]);

Provider adapters use applyEdits to implement their Sandbox.editFile() method.

Implementing a Provider

Provider adapters implement SandboxProvider:

typescript

import type { SandboxProvider, SandboxHandle } from "@agentick/sandbox";
import { applyEdits } from "@agentick/sandbox";

export function myProvider(): SandboxProvider {
  return {
    name: "my-provider",
    async create(options) {
      // Set up sandbox environment from options...
      return {
        id: crypto.randomUUID(),
        workspacePath: "/sandbox/workspace",
        async exec(command, opts) {
          /* ... */
        },
        async readFile(path) {
          /* ... */
        },
        async writeFile(path, content) {
          /* ... */
        },
        async editFile(path, edits) {
          const source = await this.readFile(path);
          const result = applyEdits(source, edits);
          if (result.applied > 0) await this.writeFile(path, result.content);
          return result;
        },
        async destroy() {
          /* ... */
        },
      };
    },
  };
}

The SandboxCreateOptions passed to create() include workspace, mounts, permissions, env, and limits. Optional restore() and destroy() methods on the provider support snapshot-based persistence.

useSandbox()

Access the nearest Sandbox from the component tree. Throws if no <Sandbox> is above the caller.

tsx

import { useSandbox } from "@agentick/sandbox";

// In a custom component or tool's use() hook
const sandbox = useSandbox();
const result = await sandbox.exec("ls -la");

Testing

Import test utilities from the /testing subpath:

typescript

import { createMockSandbox, createMockProvider } from "@agentick/sandbox/testing";

const sandbox = createMockSandbox({
  exec: vi.fn().mockResolvedValue({ stdout: "hello", stderr: "", exitCode: 0 }),
});

const provider = createMockProvider({
  create: vi.fn().mockResolvedValue(sandbox),
});

Both return objects with vi.fn() stubs and sensible defaults. Override any method via the options parameter.

Providers

Package	Strategy	Description
`@agentick/sandbox-local`	`seatbelt` (macOS), `bwrap`/`unshare` (Linux)	Host machine with OS-level sandbox. Safe by default — denies reads to home dirs, volumes, keychains.
`@agentick/sandbox-docker`	Docker containers	Full container isolation via Docker Engine API. One container per sandbox, commands via `docker exec`.

See each provider's README for platform requirements and security model details.

Sandbox ​

Installation ​

Quick Start ​

The <Sandbox> Component ​

Pre-built Tools ​

Tree Scoping ​

Custom Tools ​

Edit Utilities ​

Edit Modes ​

Matching Strategy ​

The Edit Interface ​

applyEdits(source, edits) — Pure Transform ​

editFile(path, edits) — Atomic File I/O ​

Implementing a Provider ​

useSandbox() ​

Testing ​

Providers ​

Sandbox

Installation

Quick Start

The `<Sandbox>` Component

Pre-built Tools

Tree Scoping

Custom Tools

Edit Utilities

Edit Modes

Matching Strategy

The `Edit` Interface

`applyEdits(source, edits)` — Pure Transform

`editFile(path, edits)` — Atomic File I/O

Implementing a Provider

useSandbox()

Testing

Providers