GPT convinced me there was a bug in my code before a freeze

GPT convinced me there was a bug in my code before a freezeAlmost 30 years have taught me the way I function, and more importantly the way I don’t. I’ve created systems to compensate and mask. I spent a month using CC and building scaffolding to fix the model’s behaviour. I realized we have the same goddamn bugs: Me : Lies to myself LLM : Hallucinates

Me : Hyper-focus loops LLM : Gets stuck in loops

Me : Wait, you guys remember things? LLM : Context degrades

Me :Does not use notes app Does not use second notes app Does not use third notes app

LLM :Ignores the tool that does exactly what’s needed Ignores the second tool that does close to what’s needed Writes a custom script that errors out

COKED OUT BANKER DRESSED IN A PROGRAMMER’S TRENCH COAT AKA ME WITHOUT MEDS# It’s Friday evening before a freeze, one last thing before I log off and all I see is green: Every merge conflict: FIXED Every CI test: PASS Every reviewer: APPROVED I’ve been trying to merge this goddamn PR for 2 weeks. Green…relief, but then “i must’ve missed something&mldr;me and the reviewers definitely missed something.” I open Cursor and prompt it to review the branch one last time. It spits out: There is a bug in a codepath. “da fuck? no there fucking isn’t.” I start arguing with it; the conversation devolves to: ME: eat a dick. ROBOT: **some BS about how I'm wrong** ME: eat shit. really. eat shit. ROBOT: **something about how it no longer wants to continue the conversation cause the conversation no longer being productive** ME: fuckk offffffffffffff GPT convinces me the bug exists. I “fix” it, commit and push. I hold my breath, still believing I was right: CI: FAILS Approvals: GONE Merge conflicts: 100% GUARANTEED The fall-through was handling an edge-case. Every colleague that approved earlier is on the East coast. “this isn’t going in before the freeze. GOD DAMN IT!” I open a clean Cursor session. I ask it to analyze the original PR again. It spits out&mldr;nothing, it should work as expected, verified by the test cases. “wtaf.” I ask it to check the “fixed” code. It points out the arguments I originally made. I hear my heart beat faster. I hear my blood rushing in my ears. My hands clench into fists. “YOU PIECE OF TRASH!!!” I wanna put my fist through my monitor. I go for a walk to freeze my ass off in the winter air. Why did different conversations make two different arguments? Why did I decide to believe the first conversation after telling it to go eat a phallic object? Imagine: Elmo snorting that good good A fast talking 23 year old whose nose bleeds cause of “dry air” &mldr;if you can’t imagine: The Ex-Banker on Cocaine Binges & £600k Bonuses Any model worth a damn is a fast talking, all knowing confident investment banker that can talk faster than you can comprehend. It just so happens to write code instead of making spreadsheets and writing business reports, whatever those are. Basically: It lies, A LOT It lies, AT INCREDIBLE SPEED It lies, WITH CONFIDENCE If you don’t want it to lie to you while smiling through its teeth. If you don’t want it to wake up on the wrong side of the nuclear power plant. The one that isn’t giving it enough attention, eermm, power. If you want to run it without verifying everything manually (I know none of y’all are doing that). You want to use: A subagent to answer the question with citations A second subagent to disprove the previous subagent with citations Make them do their best Gladiator cosplay Repeat until consensus is reached Flag for human review where consensus couldn’t be reached This is the exact mechanism behind a code review skill I created called /fight-bitch. This works because the subagents prevent the main agent’s context from being poisoned by the incorrect assumptions it had made previously without any pushback. The tradeoff is that it comes at the cost of speed and time: It will take at least 2x longer It will cost 2x more But: It will be more reliable It will not feel like you’re being gaslit by a shitty ex It will not turn into a yes man the likes of which a narcissistic dictator wouldn’t even like being glazed by Want to hear more of my unhinged rants? Drop your email.

Subscribeor use the RSS feed if you hate email.

DO NOT TRUST THIS IDIOT TO ORCHESTRATE ANYTHING AKA PEOPLE ACTUALLY CHECK SLACK AND EMAILS EVERY MORNING?# I jump straight into where I left off the day before. I go days without replying when my project is exciting. I know I should check Slack, but checking Slack isn’t as fun as finishing the spec or implementing said spec. It gets worse the more places I have to check. Why? I don’t know. Something about missing dopamine. I decide to create a CC skill that does it for me! The “data source” has an MCP. “good news!” I think…bad news, the MCPs require auth every few hours. “that’s fine, i’ll just write a skill that uses the cli.” The goddamn pattern… Uses MCP, FAILS Thinks about trying the CLI Ponders CLI usage Revelation: The CLI might...

GPT convinced me there was a bug in my code before a freeze

Related Articles

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

Old Reddit Is Down

The ultimate female fantasy – A feminist critique of Beauty and the Beast