Learn / Agent Framework

A production framework for AI agents.

The Agent Framework is an 11-part operating model for taking AI agents from demo to production. It groups the work into Build, Run, and Govern so teams can design the agent, operate it safely, and prove what happened when it acts.

Start with Build Download PDF Checklist

Hardening Layer

From framework to delivery system.

The source framework names the right production concerns. This adaptation keeps the structure, then turns it into a path an engineering team can work through in order.

Inspired by Sandeep Mehta and the 11factor.ai framework.

01-04BuildThe structural choices that make an agent portable, grounded, useful, and maintainable before it ever reaches production traffic.4 factors 05-08RunThe operating controls that keep agentic systems responsive, affordable, recoverable, and reviewable while real users depend on them.4 factors 09-11GovernThe quality, safety, and audit practices that make an agent trustworthy enough to improve rather than merely impress.3 factors

Build

Design and Development. The structural choices that make an agent portable, grounded, useful, and maintainable before it ever reaches production traffic.

4 factors

01build

Model Serving Layer

Put every model provider behind a stable serving contract so the product can route, fail over, compare, and upgrade models without rewriting feature code.

Read factor

02build

Context Management

Design the full context pipeline: instructions, task state, retrieved evidence, user history, tool results, compression, ordering, and expiry.

Read factor

03build

Memory Management

Give the agent deliberate short-term and long-term memory with retention rules, retrieval policy, ownership, and stale-memory controls.

Read factor

04build

Integrations and MCP

Expose external tools, APIs, files, and data sources through a consistent integration layer that is discoverable, permissioned, and observable.

Read factor

Run

Production Operations. The operating controls that keep agentic systems responsive, affordable, recoverable, and reviewable while real users depend on them.

4 factors

05run

Orchestration

Coordinate models, tools, workflows, agents, queues, retries, checkpoints, and recovery paths for complex tasks.

Read factor

06run

Human in the Loop

Design approval, review, escalation, and override paths for decisions where autonomy is too risky or too ambiguous.

Read factor

07run

Rate Limits and Latency

Keep agents responsive under provider limits, slow tools, concurrent users, long context, retries, and multi-step workflows.

Read factor

08run

Cost Control

Track and shape LLM spend by task, model, tenant, user, feature, token type, retry path, and business value.

Read factor

Govern

Risk and Quality. The quality, safety, and audit practices that make an agent trustworthy enough to improve rather than merely impress.

3 factors

09govern

Evaluation and Observability

Measure quality and expose system behaviour through evals, traces, labels, dashboards, alerts, and regression checks.

Read factor

10govern

Safety and Guardrails

Constrain agent behaviour with input handling, output validation, permissions, sandboxing, policy checks, and circuit breakers.

Read factor

11govern

Reproducibility and Audit

Capture enough state to replay, explain, review, and defend important agent decisions after the fact.

Read factor

Use the framework as a production checklist.

Work through the factors in order, then loop back whenever a new model, tool, workflow, or risk tier changes the system.

Read the first factor