Layer	Mechanism	Purpose
Self-testing	Claude runs tests locally and launches itself in a subprocess to verify end-to-end	Catches basic breakage before a PR is created
CI code review	Claude Code (via the Agent SDK) reviews every pull request in CI	Catches roughly 80% of bugs (43:02)
Best-of-N passes	Multiple parallel agents review and deduplicate false positives	Compensates for LLM non-determinism
Human review	An engineer does a second pass on every change	Required before anything reaches production

Term	Definition
Agentic AI	AI that can use tools, run commands, and take actions on its own, rather than just generating text in a chat window.
RAG (Retrieval-Augmented Generation)	A technique where the AI searches through stored documents before answering, to give more accurate responses. Claude Code tried this for code search and abandoned it.
Agentic search	When the AI searches code files itself using basic tools like glob (file pattern matching) and grep (text search), rather than relying on a pre-built index.
Plan mode	A Claude Code feature where the AI creates a plan and gets user approval before making any code changes.
Work tree (git)	A way to have multiple copies of a codebase checked out at the same time, each on a different branch. Used for running parallel agents without conflicts.
Best-of-N	Running the same task multiple times and picking the best result, to compensate for the fact that AI models give slightly different outputs each time.
Swiss cheese model	A safety approach using multiple overlapping layers of protection, like slices of Swiss cheese stacked so the holes do not line up. Used in Claude Code's security architecture.
Prompt injection	When malicious instructions hidden in content that the AI reads try to trick it into doing something harmful.
Uncorrelated context windows	When multiple AI agents each start with their own fresh conversation history, so they do not share biases or assumptions from a parent session.
Test-time compute	Using more computing power when the AI generates its answer (not during training), for example by running multiple agents in parallel.
Agent SDK	A programming interface that lets developers build automated systems using Claude as the core AI model. Used to power Claude Code's CI review.
The bitter lesson	A concept from machine learning researcher Rich Sutton, arguing that methods relying on raw computation consistently outperform those that try to encode human knowledge.

Key insights

In Brief

From chatbot to coding agent

The zero-handwritten-code workflow

A rejected PR and a new way of working

Five tabs and a phone

Code review in layers

Why RAG lost to grep

Agent swarms and 200-agent weekends

Opposing perspectives

The experience problem

The scale of the claim

The grief of mastery

How to interpret these claims

Self-reported metrics

Survivorship in tooling choices

The Anthropic environment

What strong evidence would look like

The printing press analogy

Practical implications

For working engineers

For engineering teams

For the industry

Glossary

Sources and resources