Busting performance issues, AI edition

The Hub of Heliopolis - Busting performance issues, AI edition

Coding agents are fast becoming the main generators of new source code, but they are also being used for code review, finding and fixing bugs. How can they be used to the best extent to discover and fix performance issues?

Table of contents: Enter AI

bytecode: Our Case Study A Bit of Setup Required

Establishing the Baseline

Communicating Our Intent to the Agent

On a Mission

A Guided Tour of Profiles A Bonus: Learning to Read Profiles

Is Profiling Data Really Necessary?

The Takeaway

In a previous post we looked at how to find and resolve performance issues lurking in any codebase by simply running some of its code through a profiler. The simplest thing we can do, especially when we come to a new codebase that is totally unknown to us is to run its test suite (or a subset of it) and see what code paths are being exercised. When we look at them through a profiler, the typical approach is to start investigating its plateaux, that is functions at the top (or bottom, depending on the orientation of the flame graph!) that have the largest own time. This is generally where the largest opportunity of improvements are. Of course, there could be cases where a function is simply repeating unnecessary work, and this won't show up through a mere visual plateau scan.

Enter AI

Can we automate this process, and perhaps put it on steroids, with the prowess of the most powerful, state-of-the-art coding agents?

Modern coding agents have reached a level of sophistication where they can effectively understand and modify codebases with good practical results. Benchmarks such as SWE-bench (which tests multi-file bug fixes in real GitHub repositories) and LiveCodeBench (which uses fresh competitive programming problems to avoid data contamination) show that frontier models now achieve high pass rates. However, there still is quite the margin for improvement: studies on real-world class-level code generation show a significant gap between synthetic benchmark performance (84-89%) and actual project performance (25-34%) (arxiv-2510.26130), highlighting both the progress made and the room for improvement in practical coding scenarios.

In this post we will explore how well a coding agent can come up with performance improvements when it is provided with focused profiling data. The focus, however, is on the method and the tooling involved, rather than benchmarking a set of models to determine which one is better at finding performance issues.

bytecode: Our Case Study

To make the exposition concrete we will focus on an actual project: we will try to improve the performance of the bytecode library.

bytecode is a Python library for generating and modifying Python bytecode. It provides an abstraction over Python's low-level bytecode instructions, allowing you to programmatically create bytecode objects, convert them to actual code objects, and even inspect and modify existing code's bytecode. It is useful for metaprogramming, dynamic code generation, and bytecode analysis.

These are the tools that we will use for our little experiment, all free:

Austin, for Python profiling, easily installed with "pip install austin-dist"

The Austin VS Code extension, easily installed from the VS Code marketplaces. The latest version comes with an MCP server that can feed profiling data and flame graph navigation commands to a coding agent.

Opencode with the MiniMax M2.5 free model as our coding agent setup.

The Opencode VS Code extension to never leave VS Code throughout our experiment

A Bit of Setup Required

After having cloned the bytecode repository that we will be working on, and opened it in VS Code, there is a minimum of setup required. We need to allow the coding agent to discover the Austin MCP. If you're using GitHub Copilot instead of Opencode, there is no extra setup required for this, since the Austin MCP server integrates natively with that. Otherwise we need to allow other coding agents to discover the server. This is as simple as running the Austin: Generate .mcp.json command: press Ctrl+Shift+P, type austinmcp and press Enter. That's it. Well, not quite, because we decided to use Opencode for our experiment, and as at the moment of writing, it still does not support a .mcp.json file. So instead we open an Opencode session with Ctrl+Escape and politely ask the agent to convert the .mcp.json configuration to something that opencode understands

The Austin MCP server uses an ephemeral port to run, so every time you re-open VS Code there will be a new port assigned to it. If a .mcp.json was previously created, the Austin extension will update the port automatically so that you don't have to regenerate it every time. However, Opencode won't do this unless you instruct it to.

After Opencode configured itself, you might have to start a new session for it to pick up the connection to the server.

We have asked Opencode to configure itself based on the contents...

Busting performance issues, AI edition

Related Articles

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

Old Reddit Is Down

The ultimate female fantasy – A feminist critique of Beauty and the Beast