WebGlean – API that turns any site into clean Markdown for LLMs

WebGleanNewExtract API is live — Claude-powered structured data extraction from any URL.Extract API is now live.See the docs

All 6 APIs live — scrape, crawl, extract, map, monitor, search Turn any website into clean data for your AI. WebGlean converts any URL to Markdown, JSON, or plain text — with full JavaScript rendering, AI-powered extraction, and dead-simple credit-based pricing. Try it live

APIs available

avg. response time

output formats

500 free credits to start

Trusted by developers building with AI Acme AIBuilderOSDataStackResearchAIAgentFlowShipFast

6 endpoints Everything you need to ship One API key. One credit system. Pay only for what you use.

POST/v1/scrape Scrape Convert any URL to Markdown, HTML, JSON, or plain text. Full JS rendering via Playwright. 1 credit / pageView docs →

POST/v1/crawl Crawl BFS-crawl an entire site. Set max depth and page limits. Returns all pages as Markdown. 1 credit / pageView docs →

AIPOST/v1/extract Extract Scrape a URL then use Claude to pull structured JSON matching any schema you define. 5 credits / pageView docs →

POST/v1/map Map Discover all discoverable URLs on a site. Tries sitemap.xml first, falls back to BFS. 1 credit / callView docs →

POST/v1/monitor Monitor Register a URL for change detection. Get webhook notifications when content changes. 1 credit / checkView docs →

POST/v1/search Search Search the web and get scraped Markdown for each result — ready for your LLM. 2 credits / resultView docs →

Built different What makes WebGlean different Designed around developer experience and AI pipeline needs from day one.

Persistent browser pool No cold Chromium starts. Playwright runs in an always-on worker so your requests return in 1–3 seconds, not 5–8. Every scrape gets a warm, authenticated browser.

AI-native from day one The Extract API uses Claude to pull any structured schema from any URL — no XPath, no CSS selectors, no custom parsers. Describe what you want in JSON and get it back.

💳 Pricing that makes sense One credit per page, across all endpoints. Credits never expire and never reset. Top up without upgrading. No per-endpoint pricing, no seat fees, no surprises.

🧹Content Cleaning Mozilla Readability strips nav, ads, footers, and cookie banners. Your LLM gets signal, not noise.

📄5 Output Formats Markdown, HTML, plain text, JSON metadata, and screenshots — one call, any combination.

🔑Simple API Key Auth One key covers all 6 endpoints. SHA-256 hashed, shown once on creation, revokable instantly.

🛡️Rate Limiting Built In 60 req/min per key by default. No extra config. Higher limits on Pro and Scale.

🔔Webhooks Monitor registers a URL and POSTs a diff to your endpoint when content changes.

🗺️Site Mapping Map tries sitemap.xml first — falls back to recursive link extraction. Returns every discoverable URL.

Live demo Try it now — no account needed Paste any URL and see clean output in seconds.

Scrape

Pricing Simple, transparent pricing Credits never expire. Top up anytime without upgrading.

Free $0 500 credits / month 500 credits / month All 6 APIs Community support

Hobby $19 5,000 credits / month 5,000 credits / month All 6 APIs Email support

Most popularPro $79 50,000 credits / month 50,000 credits / month All 6 APIs Priority support

Scale $299 250,000 credits / month 250k credits / month All 6 APIs SLA + dedicated support

Need more? Top-up packs: $10 = 1,000 credits — available on any plan, no upgrade required.

Frequently asked questions Everything you need to know before you start.

How does WebGlean handle JavaScript-heavy sites? What counts as one credit? Do unused credits roll over? How does the Extract API work? Is there a rate limit? Can I use WebGlean without creating an account? What output formats does the Scrape API support? Do you have Python or Node SDKs?

WebGlean – API that turns any site into clean Markdown for LLMs

Related Articles

(no title)

Scientists reverse brain aging, with a nasal spray

AI has torched the market for junior programmers

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org