Build a Basic AI Agent from Scratch: Long Task Planning

Build A Basic AI Agent From Scratch: Long Task Planning

08 Jun 2026 Build A Basic AI Agent From Scratch: Long Task Planning

52 minute read Artificial Intelligence

In the previous part of the Build A Basic AI Agent From Scratch series, we added the essential tools to our agent to allow it to work autonomously for us. We gave it the ability to find files, read and write files, run bash commands and get content from the web. We got a very capable agent with just these tools.

What happens when the agent runs long and complex tasks?

The current agent works very well, but we want our agent to get a lot of work done, and this requires staying on the task for long spans of time. Right now, if we try to give our agent long and complex tasks we will find that it does not think long term, and it stops working after the littlest progress.

This is to be expected because the LLM is trained to behave conversationally. It expects to go back and forth in a question-answer basis. This is fine for a simple chatbot, but our agent needs to be able to get a request and work for a long time on it before returning a result.

Long task planning

The next ability we will give to our agent is the ability to plan for long and complex tasks.

The abilities our agent needs are:

Understand the goal of the task

Plan how to tackle the task beforehand

Break the task into concrete steps

Keep track of pending, in progress and completed tasks

If something goes wrong with the current plan, rethink the approach

Check that everything planned is actually done before stopping

To give our agent these abilities, we will rely on the last part's addition: tools . We will also explain the model how to use long task planning in the model's system prompt .

New tool: Scratchpad

This is a very simple but powerful tool. We are just giving the model a place to write it's thoughts and read them again at a later time.

The main benefit of this tool is that it forces the model to think through the goal and plan the whole approach before starting working on it.

The tool saves the scratchpad content into memory instead of a file or database, which is fine because we don't want to share the scratchpad content between sessions.

Here's the python implementation:

class Scratchpad: """Read and write from a in-memory scratchpad"""

def __init__(self): self._content = ""

def read(self) -> str: if self._content == "": return "(empty)" return self._content

def write(self, content: str) -> str: self._content = str(content).strip() return self._content

scratchpad = Scratchpad()

def read_scratchpad(): """Read the contents of the scratchpad""" return scratchpad.read()

def write_scratchpad(content: str): """ Write into the scratchpad. The previous content will be overwritten. """ scratchpad.write(content) return "Successfully written content into scratchpad"

> You can find and clone this code in this blog series' a href="https://github.com/rogiia/basic-agent-harness" target="_blank">Github repo/a>.

New tool: To-do list

A to-do list allows the agent to decompose the work into tasks and keep track of them to know what's left to do (pending), what it's working on currently (in progress) and what is already done (done).

This tool also enforces some good practices: it doesn't allow multiple tasks to be in progress at the same time, it doesn't allow invalid task statuses and it doesn't allow repeated tasks.

Just like the scratchpad, this tool saves the to do list into memory instead of a file or database. This is also fine because we don't want to share the to-do list between agent sessions.

RETRY_LIMIT = 3

class ToDoList: """Helper class to hold a to-do list in memory"""

statuses = ["pending", "in_progress", "done", "cancelled", "failed"]

def __init__(self): self._items = []

def read(self, include_completed=False): """Read the to-do list""" if include_completed: return [item.copy() for item in self._items] else: return [item.copy() for item in self._items if item["status"] != "done" and item["status"] != "cancelled"]

def append(self, id, content, status): if status not in ToDoList.statuses: raise Exception(f"Invalid status {status}. " "Valid to-do statuses: pending, in_progress, done, " "cancelled, failed") if self.contains(id): raise Exception(f"To do item {id} already exists!") new_item = {"id": id, "content": content, "status": status, "retries": 0} self._items.append(new_item) return new_item.copy()

def contains(self, id) -> bool: """Check if the to do list contains an item with a specific id""" for item in self._items: if item["id"] == id: return True return False

def update(self, id, content, status): if status is not None and status not in ToDoList.statuses: raise Exception(f"Invalid status {status}. " "Valid to-do statuses: pending, in_progress, done, " "cancelled, failed") idx = 0 while idx len(self._items): if self._items[idx]["id"] == id: if content is not None: self._items[idx]["content"] = content if status is not None: prev_status...

Build a Basic AI Agent from Scratch: Long Task Planning

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

It's Not Just X. It's Y

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy