The Maturity Ladder - Agentic Engineering Acceleration

Flexibility is the skill — The goal isn’t to always operate at Level 4. It’s to match the level to the stakes. A throwaway script doesn’t need interactive code reviews. A payments refactor does.

The four levels

Vibe coding

Let AI go wild. No version control, no review, no guardrails. You accept whatever it produces and move on.When it’s appropriate: Prototypes, throwaway scripts, learning experiments, anything you’ll delete tomorrow.When it’s not: Anything that ships to users, touches production data, or lives longer than a week.

Agentic coding with discipline

You’re using AI to write code, but with basic engineering hygiene:

Git for version control — commit before and after delegating
File-level understanding of what changed and why
Basic security awareness (no secrets in code, no open permissions)
Sufficient test coverage to catch regressions

This is where most engineers should start. It’s fast and the safety floor is reasonable.

Agentic software engineering

The AI is doing more, but you’ve built guardrails around it:

Pre-commit hooks enforce formatting, linting, and secret scanning
CI pipelines catch what local checks miss
Function-level understanding — you know what every function does, even if you didn’t write it
Thorough testing — unit, integration, edge cases
One-shot AI code reviews — ask Claude to review the diff before you merge

High-quality software engineering

Maximum quality. You understand every line, and you use AI to deepen that understanding:

Line-by-line understanding of all changes
Self-reflection loops — “Are you sure about this approach? What about X?” Forces the model to catch its own mistakes before you have to
Interactive AI code reviews — back-and-forth dialogue about design decisions, not just a thumbs-up
Advanced research — use Claude to explore alternatives, read docs, compare approaches before committing to one
AI-powered quality control — background testing sessions, Playwright for web apps, automated regression checks

At a glance

Level 1: Vibe coding

Quality: Low · Speed: LightningTesting: Bare minimum · Reviews: None

Level 2: Agentic coding with discipline

Quality: Okay · Speed: FasterTesting: Sufficient · Reviews: Brief

Level 3: Agentic software engineering

Quality: Medium · Speed: FastTesting: Thorough · Reviews: One-shot

Level 4: High-quality SE

Quality: High · Speed: DecentTesting: Rock solid · Reviews: Interactive

More tokens ≠ more slop — A common misconception: spending more tokens means more AI-generated slop. That’s only true if the tokens go toward generating more code. When tokens go toward research, self-reflection, and deeper understanding, quality goes up — not down. Level 4 uses more tokens than Level 1. But the tokens buy verification, not volume.

← Prev: Agentic Engineering · Next: Robots-First →

Agentic Engineering Robots-First Engineering

⌘I

Mindset

​The four levels

​At a glance

Level 1: Vibe coding

Level 2: Agentic coding with discipline

Level 3: Agentic software engineering

Level 4: High-quality SE

The four levels

At a glance