Term	Definition
Agent harness	The runtime that executes a coding agent. It handles the basic loop: read files, run commands, talk to the LLM, and apply changes. Stripe's harness is forked from Goose.
Agentic engineering	Dan's term for designing AI coding systems with enough structure that you can predict outcomes without watching every step. The opposite of hoping a prompt will work.
Blueprint	A workflow defined in code that mixes deterministic steps with agent-driven steps. Think of it like a recipe where some steps say "add exactly 200g flour" and others say "season to taste."
CI (Continuous Integration)	An automated system that runs tests every time code is pushed. If tests fail, the code doesn't merge. Stripe runs over 3 million tests on every push.
CLI (Command-Line Interface)	A text-based way to interact with software by typing commands in a terminal. One of three ways Stripe engineers can trigger a Minion run.
Dev box	A sandboxed development environment where an agent can safely read, write, and test code without affecting the main codebase. Stripe keeps a pool of these pre-warmed and ready.
Deterministic step	A step in a blueprint that always executes the same way, regardless of what the AI decides. Checking out a git branch or running a test suite are deterministic steps.
EC2	Amazon's cloud computing service (Elastic Compute Cloud). Stripe uses EC2 instances as sandboxed environments for each Minion run.
Goose	An open-source coding agent built by Block (the company behind Square and Cash App). Stripe forked Goose as the foundation for their agent harness.
In-loop	A working mode where the developer sits at the desk and interacts with the agent step by step. Gives full control but limits throughput to one task at a time.
LLM (Large Language Model)	An AI model trained on massive amounts of text that can understand and generate code and natural language. The "brain" behind coding agents like Minions.
MCP (Model Context Protocol)	A standard for connecting AI agents to external tools and data sources. Stripe's tool shed is an MCP server that manages nearly 500 tools.
Minions	Stripe's internal name for their AI coding agents. Each Minion handles a single task end-to-end: read code, write changes, run tests, open a pull request.
Out-loop	A working mode where agents run fully unattended in parallel sandboxes. The engineer dispatches tasks and reviews results later. This is where the throughput gains come from.
PR (Pull Request)	A request to merge code changes into the main codebase. Other engineers review the changes before approving. Stripe's Minions open PRs automatically after CI passes.
Scoped rule files	Context files that contain coding standards and conventions for specific parts of a codebase. Only loaded when the agent works in that area, keeping prompts focused.
Tool shed	Stripe's centralized MCP server that indexes and serves nearly 500 tools. Agents query it to discover which tools are relevant to their current task instead of loading all tools at once.
Vibe coding	Dan's term for the practice of prompting an AI and hoping for good results without understanding the underlying system. The opposite of agentic engineering.
Warm pool	A collection of pre-initialized development environments kept ready for immediate use. Like having rental cars already running in the parking lot instead of making customers wait for the engine to start.
ZTE (Zero Touch Engineering)	A future vision where sufficiently validated code goes from prompt to production with no human review step. Not yet implemented at Stripe.

Key insights

In Brief

Agentic engineering vs. vibe coding

The six components of Stripe's agentic layer

1. API layer: three ways in

2. Warm dev box pool

3. Agent harness (forked from Goose)

4. Blueprint engine

Design the blueprint

Spin up a dev box

Execute the blueprint

Run CI validation

Open a pull request

5. Scoped rule files

6. Tool shed

What practitioners can learn

The CI feedback limit

The human review bottleneck

Applying these patterns to your own work

Start with scoped context files

Design simple blueprints

Add CI as a feedback loop

Run agents in parallel

Common pitfalls: What goes wrong with coding agents

Practical implications

For individual developers

For teams adopting coding agents

For engineering leaders evaluating agent platforms

Test yourself

Glossary

Sources and resources