Show HN: Browser-Native GPU Sharing

WebGPU Cluster — Distributed WebGPU inference

Distributed GPU network

AI on the grid,

Turn any browser with WebGPU into a cluster node. Share inference for LLM models — Host a model on your powerful workstation and access it securely from your phone, laptop, or let others connect to it.

Join the grid View available grid

Hosts online

Hosts registered

Why join Browser-native GPU sharing

Contribute spare GPU cycles from your workstation. Clients send images over HTTP; your browser runs the model and returns results — privately, on your hardware.

🖥️

Instant local hosting

Open a tab, pick a model, and start hosting. RF-DETR and SmolVLM load in a Web Worker on WebGPU — no Python environment or driver setup.

🔒

Your data stays local

Inference runs on your GPU in the browser. Images are processed on your machine; nothing is sent to third-party AI APIs.

🔌

Universal HTTP API

Connect from curl, Python, Node, or any HTTP client. Simple JSON endpoints for detection and image description — queue and broker included.

Architecture How the grid works

A lightweight Node broker coordinates tasks. Browser hosts stay connected via SSE and pull jobs when idle.

Client curl / Python / app

Broker task queue · /v1/detect

Browser host WebGPU inference

Response boxes · labels · text

Open the host page, choose a host id and model, then click Start hosting. Keep the tab open while you share GPU time.

Jobs arrive via SSE

The broker forwards detection and description tasks to your browser. One job runs at a time per host.

Anyone can call the API

Point clients at POST /v1/detect or /v1/describe with your host id. Results return as JSON.

API Call the grid from anywhere

Use the cluster monitor to see online hosts and copy ready-made curl examples.

detect.sh POST /v1/detect

curl -X POST 'http://localhost:5180/v1/detect' \ -H 'Content-Type: application/json' \ -d '{ "host": "my-gpu-node", "image_url": "https://example.com/photo.jpg", "threshold": 0.5 }'

Models What you can host today

Models download from Hugging Face on first load. Pick one per host session.

Detection RF-DETR Medium

Real-time object detection (COCO) via ONNX on WebGPU. Endpoint: POST /v1/detect

Vision-language SmolVLM-500M

Describe images with a compact VLM on WebGPU. Endpoint: POST /v1/describe

Ready to power the grid?

Share your GPU or explore nodes already online.

Join the grid View available grid

Show HN: Browser-Native GPU Sharing

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

It's Not Just X. It's Y

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy