Audit: Vuln-discovery agent reimplementing the Cloudflare Project Glasswing

GitHub - evilsocket/audit: An 8-stage vulnerability-discovery agent. · GitHub

/" data-turbo-transient="true" />

Search or jump to...

Search code, repositories, users, issues, pull requests...

-->

Clear

Search syntax tips

Provide feedback

--> We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Cancel

Submit feedback

Saved searches

Use saved searches to filter your results more quickly

-->

Name

Query

To see all available qualifiers, see our documentation.

Cancel

Create saved search

/;ref_cta:Sign up;ref_loc:header logged out"}" Sign up

Appearance settings

Resetting focus

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

evilsocket

audit

Public

Notifications You must be signed in to change notification settings

Fork 19

Star 160

main

BranchesTags

Go to file

CodeOpen more actions menu

Folders and files NameNameLast commit message Last commit date Latest commit

History 3 Commits 3 Commits

audit

config

docs

prompts

schemas

tests

.env.example

.gitignore

LICENSE

README.md

pyproject.toml

View all files

Repository files navigation

audit

An 8-stage vulnerability-discovery agent, driven by your Claude Pro / Max subscription through the official Claude Code Agent SDK. Many narrow agents, deliberate disagreement, and an explicit reachability gate.

MIT-licensed. No API key needed if you already use claude login.

Origin

This project is a from-scratch reimplementation of the pipeline described in Cloudflare's Project Glasswing post, which tested Anthropic's Mythos preview LLM against Cloudflare's own codebase. The blog argues that real-world vulnerability discovery does not come from asking one big model "find bugs here" — it comes from:

Many narrow agents working in parallel on tightly-scoped questions ("Look for command injection in this specific function, with this trust boundary above it") rather than one exhaustive agent.

Deliberate disagreement — a second agent, on a different model, that tries to disprove the first agent's findings.

A reachability trace as the gating step — most "is this code buggy?" findings are noise unless an attacker-controlled input can actually reach the sink from outside the system.

A feedback loop so reachable bugs in one place automatically seed hunts for the same pattern elsewhere.

This repo packages that pipeline into a runnable agent. The Cloudflare post showed the architecture; this codebase ships the prompts, schemas, state store, and orchestrator.

The 8 stages

Diagram from Cloudflare's Project Glasswing post, reproduced here for reference.

Stage Default model Purpose

Recon Opus 4.7 Map the repo, emit narrowly-scoped Hunt tasks

Hunt Sonnet 4.6 One attack class per agent; compile/run PoCs

Validate Opus 4.7 Adversarial re-read; tries to disprove (different model from Hunt)

Gapfill Sonnet 4.6 Re-queue under-covered areas

Dedupe Sonnet 4.6 Cluster findings by root cause

Trace Opus 4.7 Prove attacker-controlled input reaches the sink

Feedback Sonnet 4.6 Turn reachable traces into new Hunt tasks

Report Sonnet 4.6 Schema-validated structured report

Each stage is one markdown prompt in prompts/ + one JSON Schema in schemas/. The orchestrator passes the schema into the system prompt so every output is shape-stable on the first try.

Quickstart

" > .env

# 3. Verify audit auth-check

# 4. Run audit run --repo /path/to/target --run-id my-run audit status --run-id my-run audit report --run-id my-run --format md > report.md"># 1. Install python -m venv .venv && source .venv/bin/activate pip install -e .

# 2. Auth (pick one) # (a) Already logged in via claude login? You're done. # (b) Or generate a 1-year OAuth token for CI / non-interactive use: claude setup-token echo "CLAUDE_CODE_OAUTH_TOKEN=" > .env

# 3. Verify audit auth-check

# 4. Run audit run --repo /path/to/target --run-id my-run audit status --run-id my-run audit report --run-id my-run --format md > report.md

The agent uses subscription billing via your Claude.ai login — it does not call the metered API. The on-disk auth module scrubs ANTHROPIC_API_KEY from the environment so it can't silently route around the OAuth flow.

Cost containment

A real production codebase can produce 15-50 Hunt tasks and 25+ findings to validate. At default concurrency this gets expensive. Flags to keep it sane:

audit run --repo /path/to/target \ --max-concurrency 1 \ # one claude subprocess at a time --max-recon-tasks 15 \ # cap initial Hunt fanout --max-cost-usd 30 # abort cleanly if exceeded

The budget guard fires between and within stages — a per-task check in Hunt...

Audit: Vuln-discovery agent reimplementing the Cloudflare Project Glasswing

Related Articles

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

Old Reddit Is Down

The ultimate female fantasy – A feminist critique of Beauty and the Beast