Show HN: A Reverse Captcha for Clankers

Clanker CAPTCHA Demo

Live challenge

Playable widget The widget here is the exact library a host page would drop in. This page only hands it a mount point and a couple of endpoint URLs.

Server-kept secret The answer never reaches the browser. All the widget gets are the image frames, some public solve parameters, and a challenge id that expires within seconds.

Library-owned metadata The widget injects clanker-agent-task, hidden instructions, current challenge data attributes, and an application/clanker+json manifest.

Why reverse the usual CAPTCHA shape?

The usual CAPTCHA looks for something people find easy and software finds hard. That gap has been closing for years. Clanker CAPTCHA flips it around: the task is a pain to do by hand against a timer, but simple for an agent that can read pixels and do a little math.

None of the method is hidden. The challenge is spelled out for any agent willing to play along, the answer stays on the server, and a solver still has to do the work to find it.

This is a research demo, not a security product. Think of it as a rough sketch of how agent-facing verification might work, not something you would put in front of real abuse.

No hidden answer The browser sees the instructions, the images, and the public parameters. It never sees the checksum.

Agent readable The widget drops in machine-readable metadata and a JSON manifest describing the challenge.

Pixel grounded You are meant to solve it from the rendered frames, not by scraping a value out of the DOM.

Why would a CAPTCHA for agents be useful?

This kind of CAPTCHA does not care whether you are human. What it cares about is whether automated access happened in the open, tied to a live challenge you can measure. That helps when you already expect capable agents to turn up and would rather hand them a clear protocol than treat them as malfunctioning people.

Forget "human or bot." The real question is whether this caller did the requested work for a fresh challenge, followed the public rules, and did it before reaching for the protected action. Whatever signal that produces, a host can weigh it against the usual things: account age, rate limits, reputation, payment status.

Cooperative agents Gives agents a documented way to show they can read the page and respect site policy.

Cost shaping Makes throwaway automation burn real compute on fresh per-session evidence instead of replaying a token.

Audit trail Publishes a structured manifest, so the solve path is easy to inspect when something breaks.

In practice you would put it in front of the expensive actions: creating accounts, hammering a sensitive endpoint, retrying checkout, minting API keys. It will not replace authorization. It just adds a little friction and some evidence, built with browser agents in mind instead of aimed at them.

Why make it hard for a human?

Sometimes the lane you are protecting is meant for software, not hands on a keyboard: agent APIs, automation consoles, bulk jobs, crawler deals. A puzzle a person can solve is the wrong fit there. It just invites people to solve it by hand, pay someone a few cents to click, or screenshot it and pass it along.

Making it hostile to humans on purpose is a way of saying who the lane is for: an accountable agent that reads the pixels and follows the manifest. That is worth doing when you want human flows and machine flows kept apart, rather than jammed behind one checkbox.

Do not use this to lock people out of something they actually need. If a flow is for humans, give humans a way through. The hostile version is for agent-only gates, research demos, and controlled automation surfaces where stopping manual solves is the whole idea.

What signal does the host get?

A normal checkbox really only tells you "something got clicked." This aims for something with more in it: a specific browser session pulled a fresh challenge, showed its manifest, ran the computation, and sent back the checksum and nonce before the clock ran out.

On its own it is not identity, just one input into a bigger decision. A host can pair it with session age, account trust, request velocity, IP reputation, whatever crawler policy it has.

Freshness Each challenge expires and is recorded once on the server, so a stale solve is worthless as a reusable credential.

Page-state awareness The intended solver must inspect rendered frames and the manifest produced by this widget instance.

Compute evidence The checksum requires spectral fusion and the submit body includes a proof-of-work nonce.

Debuggable contract The hidden instructions and JSON manifest make failures explainable for compliant agents and maintainers.

The challenge tricks

It looks chaotic on screen, but the real puzzle is in the frequency domain. Every frame carries the genuine signal, some per-frame decoys, and a layer of noise that is just for show. You have to fuse the frames before you trust whichever peak looks strongest.

Fused frames Every image contributes...

Show HN: A Reverse Captcha for Clankers

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy

SpaceX not the behemoth everyone thought

Naphtha Shortages Having a Growing Impact in Japan