Professional Agentic Product Engineering

Chapter

Climb the eight tiers — to spend effort only where your work needs it

This guide is one ladder: eight tiers, simple → hard. A tier is a level of skill and a level of where you're applying effort.

As you climb, the work shifts from wording the prompt to engineering the system around the model — prompts shrink, the system carries the intelligence.

You don't need all eight. Climb until the agent is as reliable as the work demands, then stop.

The trick is knowing what pushes you to the next tier — it's always a specific pain, not ambition. When you feel the pain in the right column, you're ready for the next tier.

Tier What you work on Level it applies at What pushes you up
T1 — Professional Prompting The single request One message The agent keeps doing almost the right thing — vague asks get literal, wrong results
T2 — Shaping & Slicing The task before you start One task Big asks go sideways; it edits the wrong things or tries to do everything in one pass
T3 — Context Management The project the agent sees The repo You re-explain the same conventions every session; it can't see your DB/browser/docs
T4 — Loop Until Done A "done" the agent can check itself The task, automated You can't trust the output without reading every line; "done" means nothing concrete
T5 — Checkpointing & Hardening Safe, revertible runs The session A long run goes wrong and you lose good work; nothing to roll back to
T6 — Orchestration Many subagents, long horizons Multi-step / multi-agent One agent is too slow or floods its context; the build is too big for one pass
T7 — Fleet Ops Where & how runs execute Your machines Runs die when your laptop sleeps; parallel agents collide; you want to drive from your phone
T8 — Agent Execution Layer Agents as async workers Your org / production The team needs it: agents must pick up tickets and open PRs without anyone babysitting a terminal

Read the right column as a diagnostic. Stuck re-typing the same context every session? That's the Tier 3 pain — go engineer CLAUDE.md and Skills.

Losing work on long runs? Tier 5 — commit on every green. Each tier exists to kill a specific failure of the one below it:

Prompts drift → T2 plans them. Plans are forgotten across sessions → T3 makes context durable. Context still can't prove correctness → T4 verifies. Long runs lose good work → T5 checkpoints. One agent is too slow → T6 orchestrates many. Runs die when your laptop sleeps → T7 runs a fleet. Humans still babysit terminals → T8 makes agents async workers.

The destination, if you go all the way, is professional agentic product engineering: agents running in a loop against a clear, testable standard, inside a real repo you own.

Which tier do I need?

Match the job to a target tier and stop there — climbing higher than the work demands is wasted effort.

What you're building Aim for Why
A landing page, throwaway script, or one-off for yourself Tier 1–2 — or skip the agent and use a one-shot builder (Lovable/v0/Bolt) Low stakes, short-lived. A clear prompt is enough; you don't need tests, git discipline, or a loop
A feature in a side project you intend to keep Tier 1–4 Now correctness matters over time — you want a verify loop and checkpoints so it doesn't rot
Production code in a shared team repo Tier 1–5, reaching into 6–8 as scale demands Multiple people, real consequences. Context engineering, git hygiene, and review become non-negotiable; orchestration and a hosted execution layer follow when one agent or one machine isn't enough

So for that simple landing page: Tier 1–2 is the answer — and a one-shot builder may serve you better than Claude Code (see Agents vs one-shot builders).