The four levels
Vibe coding
Let AI go wild. No version control, no review, no guardrails. You accept whatever it produces and move on.When it’s appropriate: Prototypes, throwaway scripts, learning experiments, anything you’ll delete tomorrow.When it’s not: Anything that ships to users, touches production data, or lives longer than a week.
Agentic coding with discipline
You’re using AI to write code, but with basic engineering hygiene:
- Git for version control — commit before and after delegating
- File-level understanding of what changed and why
- Basic security awareness (no secrets in code, no open permissions)
- Sufficient test coverage to catch regressions
Agentic software engineering
The AI is doing more, but you’ve built guardrails around it:
- Pre-commit hooks enforce formatting, linting, and secret scanning
- CI pipelines catch what local checks miss
- Function-level understanding — you know what every function does, even if you didn’t write it
- Thorough testing — unit, integration, edge cases
- One-shot AI code reviews — ask Claude to review the diff before you merge
High-quality software engineering
Maximum quality. You understand every line, and you use AI to deepen that understanding:
- Line-by-line understanding of all changes
- Self-reflection loops — “Are you sure about this approach? What about X?” Forces the model to catch its own mistakes before you have to
- Interactive AI code reviews — back-and-forth dialogue about design decisions, not just a thumbs-up
- Advanced research — use Claude to explore alternatives, read docs, compare approaches before committing to one
- AI-powered quality control — background testing sessions, Playwright for web apps, automated regression checks
At a glance
Level 1: Vibe coding
Quality: Low · Speed: LightningTesting: Bare minimum · Reviews: None
Level 2: Agentic coding with discipline
Quality: Okay · Speed: FasterTesting: Sufficient · Reviews: Brief
Level 3: Agentic software engineering
Quality: Medium · Speed: FastTesting: Thorough · Reviews: One-shot
Level 4: High-quality SE
Quality: High · Speed: DecentTesting: Rock solid · Reviews: Interactive
More tokens ≠ more slop — A common misconception: spending more tokens means more AI-generated slop. That’s only true if the tokens go toward generating more code. When tokens go toward research, self-reflection, and deeper understanding, quality goes up — not down. Level 4 uses more tokens than Level 1. But the tokens buy verification, not volume.
← Prev: Agentic Engineering · Next: Robots-First →