Skip to content

Summary: Planning: breaking a goal into steps

Planning is the turn from reacting one move at a time to deciding the shape of the whole job first. The agent breaks a goal into an ordered set of sub-tasks before it starts, so a large task does not have to be improvised step by step. That breakdown is the whole idea; everything else (when to replan, how big each step should be) follows from it. This summary is the scan-in-five-minutes version of the full lesson.

  • Until now the agent reacted: each loop, it looked at the current state, picked one move, and acted. That works until the task is too big to wing. Ask it to plan a multi-day trip under a budget, and pure reaction books a hotel before checking flight dates, or loses the thread halfway through.
  • A purely reactive agent fails on long tasks in three ways: wrong order (it cannot see dependencies), repeated or skipped work (it has no map of what is done), and drift off the goal (nothing holds the overall shape in view).
  • Planning fixes all three by giving the agent a map. Before diving in, it lays out the steps, then has something to follow, to check progress against, and to stay pointed at the goal.
  • Planning is decomposition: taking one large goal and breaking it into an ordered list of smaller sub-tasks, each achievable in a step or a few. The order captures the dependencies (you cannot build the itinerary before you know the dates). The agent produces this breakdown first, as its own step, then executes it.
  • Producing a plan is a reasoning move, the same skill as a model thinking step by step: ask it for the intermediate steps first, not the final answer. The plan is usually written in a structured, executable form (a numbered list or machine-readable steps), not loose prose, so each step can be followed, checked off, and run one at a time.
  • Plan-then-execute builds the whole plan upfront and runs it; it works when the task is predictable. Replanning revises the remaining plan as results come in: the agent executes a step, observes the result, and adjusts when reality contradicts the plan. It is the same self-correction instinct as retrying a weak retrieval, lifted to the level of the whole task. Most capable agents sit between the two.
  • Grain size matters. Steps that are too big hide their complexity (the goal in disguise); steps that are too small bury the strategy in a keystroke transcript. The useful grain is one clear intention per step, where reading the list back tells you the strategy at a glance.
  • Plan only when the task earns it. Multi-step, interdependent goals benefit; single-step tasks just pay the overhead of an extra reasoning step. Match planning to the task, like every other capability in this track.

Before this lesson, an agent “figuring out a complex task” was a black box. Now you can see the move inside it: the agent decomposes the goal into an ordered set of sub-tasks, then executes them, replanning when a result contradicts the plan. When you meet an agent that handles a big, multi-step job, you can ask the sharper question: what is its plan, is the grain right, and does it replan when reality pushes back? And you can see why planning is layered on top of the loop, not a replacement for it: each step is still one loop iteration with a tool call. Planning just decides what those iterations should be and in what order.