Learning-focused CTFs are Facing a Restructure

Learning-focused CTFs are Facing a Restructure — exploiting.systems

There have been a large number of blog posts in the past month or two talking about the end of the CTF as we know it. I've been working on this post since April, and have trimmed it down to ensure it is additive and not redundant.

Background

About a decade ago, I participated in my first Capture the Flag (CTF) competition and was immediately hooked. The rush of solving a challenge, getting the flag, and inching up the scoreboard scratched an itch I didn't know I had.

CTFs became a cornerstone of my career. Each one would teach me something new, and as I progressed there was always headroom to grow and learn. I wouldn't say I truly entered the competitive scene, but I eventually co-ran a hobby team with mild success before life pulled the members in different directions.

As I took on more leadership-oriented positions, it became not just a great way to learn, but also an excellent way to mentor. I often extended an invitation to new team members, hoping they would discover the same passion and competitive drive I had previously found, which many did. They loved the combination of competition and learning, and grew their knowledge grew quickly as a result.

Things have changed. Here are some screenshots are from recent CTFs which still had the old score graphs available.

Before: Dice CTF 2023

After: Dice CTF 2025

(Note: I included the 2025 score graph instead of 2026 because DiceCTF changed their score graph design)

Before: srdnlen CTF 2025

After: srdnlen CTF 2026

Before: UMass CTF 2025

After: UMass CTF 2026

I'm pretty confident you can guess what's going on here.

The Modern Landscape

Agentic workflows have taken over. Whether it's Claude Code, Codex, or even a harness with a local model, teams and individuals are being forced to incorporate agents in order to remain competitive. On top of this, GPT 5.5 and Opus 4.7 have all proven to be a large step above the models from even a year ago.

This reflects the state of the industry. Anthropic's Mythos marketing push has seen a surge in the use of LLMs for discovering real-world vulnerabilities in significant projects such as Firefox, the Linux kernel, and even cURL. There are a lot of caveats nested in there, and I encourage you to read the details if you haven't already. But regardless of how much is hype versus a truly significant capability (it's probably somewhere in the middle), the increase in the capabilities of models are affecting CTFs significantly.

In some aspects, top CTF competitions have always represented the cutting edge, showcasing newly discovered vulnerabilities with a twist or demonstrating new technologies (AIxCC in 2025 or the Cyber Grand Challenge in 2016). So it's no surprise that the trend is continuing. The winning team being decided by who has the most efficient agentic harness is just a reflection of the times, right?

But what about academic settings where CTFs are often used as a learning tool?

Only two years prior, studies identified LLM assistance as a potential issue in academic settings. But due to the lack of capability in models at the time the study concluded that the assistance provided was limited and there was no need for structural changes. Yet even then, separate studies in 2024 were showing that LLMs were able to achieve a "higher success rate than an average human participant".

In 2026, leveraging a fully automated or hybrid (human-in-the-loop) workflow has become the most effective strategy for winning competitions. And on certain learning platforms, specifically Hack the Box, this has manifested as a significant decrease in average time-to-solve. This suggests participants in large, competitive learning platforms are already using LLMs for automation. Hack the Box has historically been a very competitive platform, so this isn't too much of a surprise.

However, there are significant implications for CTFs intended explicitly for academic and learning purposes, especially those targeted at a beginner-level audience. Examples include PicoCTF (which is now CyLab Security Academy) or those hosted as part of college courses.

The reason CTFs were such an effective learning tool was because they leveraged the psychology of competitiveness to motivate new participants to achieve new heights. That leverage evaporates if someone else is foregoing learning and using LLMs to simply win.

The Information Search Process

Some may be familiar with Kuhlthau's Information Search Process. This is the process most CTF participants went through prior to agentic workflows and today's models.

Using agentic workflows completely removes every part of the ISP. Build, clone, or vibecode a good framework once, point it at a CTF server, and rocket up the leaderboard. A quick git clone of ctf-agent, some first-time setup, and the flags start pouring in.

This compromises two major components of CTFs as a learning tool: the incentive structure and the...

Learning-focused CTFs are Facing a Restructure

Related Articles

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

Old Reddit Is Down

The ultimate female fantasy – A feminist critique of Beauty and the Beast