PyTorch's playbook for AI coding, as of May 2026

PyTorch's playbook for AI coding, as of May 2026 — PyTorch DevLogPyTorch's playbook for AI coding, as of May 2026 Edward Yang (@ezyang) · May 30, 2026 · 9 min read ai-agentscode-reviewossllm One of the important topics being discussed among the PyTorch team is how the PyTorch codebase should engage with AI coding agents. Today, many PRs to PyTorch are AI-authored, and there have been obvious growing pains as we’ve figured things out. Based on discussions at the most recent PyTorch compiler offsite (May 2026), I’ve assembled this playbook for AI coding in PyTorch. It is half descriptive, half prescriptive: it is trying to codify practices that are being used among some members of the team, and bring everyone else along. Hopefully, this post is just the beginning of our ongoing conversation about how to engage with AI coding agents. Norms for AI coding We can think of AI generated code as living in a spectrum, where on one hand we have code that is almost exactly the same as human code, except that it was typed by an AI, and on the other hand completely vibe-coded software which has never been read by a human. PyTorch is production software, used and relied upon by many people. We have a duty to our users to ensure that the code we ship is correct, understandable and maintainable. We think that SOTA coding agents can help us build better software than we could have built purely by hand today, but they present us with novel situations that require adapting our old rules. We think different norms are required depending on where code lives on the spectrum. As a substitution for human written code On the most conservative end, we are adding AI coding but trying to keep as many other aspects of the process fixed. The human should read every line of code. You are responsible for every line of code. Not everything stays the same though. We propose these new norms: In an age of cheap code, we are human review bottlenecked. Authors should work hard to make code review easy. Think about what information your intended reviewer needs and write it down (an AI written commit message is good for completeness, but the LLM is unlikely to know what your reviewer knows and doesn’t know). If a PR is big, make sure that there is a coherent order to engage with the change and write down the “read order.” Don’t mix unrelated or cosmetic changes with semantic changes; ask an LLM to separate these out.

It is extremely tempting to ask an AI agent to directly respond to code review comments. Because we believe in human understanding on this end of the spectrum, we think it is important for humans to engage in dialog in review comments. If a human spent time to write a question, they deserve a human response in return. On the flip side, many traditional questions one might ask in code review can be easily answered by an AI agent. Code reviewers should consult AI first, only escalating unresolved questions to humans. You can use @claude on GitHub, or check out a PR locally and use your coding agent locally on it for questions.

It is OK to use a coding agent to autonomously fix code review comments (especially nits), but you are still responsible for reading and owning all the fixes. This especially includes checking that the comment was actually fixed!

Consider asking to directly edit someone else’s code. You should ask the author first before, e.g., pushing a commit, but with AI agents this is a very compact way to transmit small nitty feedback that would have been fed straight into an AI agent anyway. It is also a good way to communicate more dramatic changes that would take more time to explain in text–an AI agent can expand text into code and help you verify that the intent of your text is clear. The original author still has to read and take ownership of these changes.

Mass AI PRs Mass AI PRs are when we use agents to generate many PRs in parallel; e.g., using agents to burn down issues on an issue tracker. Many bugs are not individually important enough for a human to dedicate a few days fixing, but in aggregate, fixing bugs is important, and AI coding agents are a big opportunity to kill low hanging fruit (in the same way AI agents are really good at discovering security vulnerabilities.) The general ask here is that we should have high-level agreement that these fixes, in aggregate have an ROI that justifies the human time spent on it. While the operator of the agent swarm is responsible for doing initial reviews, guiding it and improving it based on feedback, a mass of AI PRs will increase reviewer burden. The point is to have agreed that this review burden is worth it! Well-encapsulated unreviewed code As of today, we do not accept unreviewed AI generated code (aka slop) to the main pytorch/pytorch repo. However, we think the capability of SOTA models today enables the creation of systems that otherwise could not have existed (e.g., via hill climbing.) We have several live...

PyTorch's playbook for AI coding, as of May 2026

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

It's Not Just X. It's Y

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy