i ve been building agents for a while and kept running into the same problem: change the sys prompt, swap models, or tweak for the agent calls a tool — and something subtly breaks. The only way I found out was manually running it or a user reporting it.