WebGleanNewExtract API is live — Claude-powered structured data extraction from any URL.Extract API is now live.See the docs
All 6 APIs live — scrape, crawl, extract, map, monitor, search<br>Turn any website into<br>clean data for your AI.<br>WebGlean converts any URL to Markdown, JSON, or plain text — with full JavaScript rendering, AI-powered extraction, and dead-simple credit-based pricing.<br>Try it live
APIs available
avg. response time
output formats
500<br>free credits to start
Trusted by developers building with AI<br>Acme AIBuilderOSDataStackResearchAIAgentFlowShipFast
6 endpoints<br>Everything you need to ship<br>One API key. One credit system. Pay only for what you use.
POST/v1/scrape<br>Scrape<br>Convert any URL to Markdown, HTML, JSON, or plain text. Full JS rendering via Playwright.<br>1 credit / pageView docs →
POST/v1/crawl<br>Crawl<br>BFS-crawl an entire site. Set max depth and page limits. Returns all pages as Markdown.<br>1 credit / pageView docs →
AIPOST/v1/extract<br>Extract<br>Scrape a URL then use Claude to pull structured JSON matching any schema you define.<br>5 credits / pageView docs →
POST/v1/map<br>Map<br>Discover all discoverable URLs on a site. Tries sitemap.xml first, falls back to BFS.<br>1 credit / callView docs →
POST/v1/monitor<br>Monitor<br>Register a URL for change detection. Get webhook notifications when content changes.<br>1 credit / checkView docs →
POST/v1/search<br>Search<br>Search the web and get scraped Markdown for each result — ready for your LLM.<br>2 credits / resultView docs →
Built different<br>What makes WebGlean different<br>Designed around developer experience and AI pipeline needs from day one.
Persistent browser pool<br>No cold Chromium starts. Playwright runs in an always-on worker so your requests return in 1–3 seconds, not 5–8. Every scrape gets a warm, authenticated browser.
AI-native from day one<br>The Extract API uses Claude to pull any structured schema from any URL — no XPath, no CSS selectors, no custom parsers. Describe what you want in JSON and get it back.
💳<br>Pricing that makes sense<br>One credit per page, across all endpoints. Credits never expire and never reset. Top up without upgrading. No per-endpoint pricing, no seat fees, no surprises.
🧹Content Cleaning<br>Mozilla Readability strips nav, ads, footers, and cookie banners. Your LLM gets signal, not noise.
📄5 Output Formats<br>Markdown, HTML, plain text, JSON metadata, and screenshots — one call, any combination.
🔑Simple API Key Auth<br>One key covers all 6 endpoints. SHA-256 hashed, shown once on creation, revokable instantly.
🛡️Rate Limiting Built In<br>60 req/min per key by default. No extra config. Higher limits on Pro and Scale.
🔔Webhooks<br>Monitor registers a URL and POSTs a diff to your endpoint when content changes.
🗺️Site Mapping<br>Map tries sitemap.xml first — falls back to recursive link extraction. Returns every discoverable URL.
Live demo<br>Try it now — no account needed<br>Paste any URL and see clean output in seconds.
Scrape
Pricing<br>Simple, transparent pricing<br>Credits never expire. Top up anytime without upgrading.
Free<br>$0<br>500 credits / month<br>500 credits / month<br>All 6 APIs<br>Community support
Hobby<br>$19<br>5,000 credits / month<br>5,000 credits / month<br>All 6 APIs<br>Email support
Most popularPro<br>$79<br>50,000 credits / month<br>50,000 credits / month<br>All 6 APIs<br>Priority support
Scale<br>$299<br>250,000 credits / month<br>250k credits / month<br>All 6 APIs<br>SLA + dedicated support
Need more? Top-up packs: $10 = 1,000 credits — available on any plan, no upgrade required.
Frequently asked questions<br>Everything you need to know before you start.
How does WebGlean handle JavaScript-heavy sites?<br>What counts as one credit?<br>Do unused credits roll over?<br>How does the Extract API work?<br>Is there a rate limit?<br>Can I use WebGlean without creating an account?<br>What output formats does the Scrape API support?<br>Do you have Python or Node SDKs?