Show HN: Kreuzberg Cloud – ultra fast content intelligence

Kreuzberg | Document intelligence

CommunitySign In

Toggle menu

Document intelligencefor AI engineering workflows. Extract structured, machine-readable content from any document and feed it directly into AI agents, pipelines, and applications.

One API

97 file formats

Contact SalesStart Free

Drop a file here Or click to browse

• Limited to 1 page per document • File size capped at small uploads (under 1MB) • Limited to 10 demo requests per IP

Why Kreuzberg? Built for AI engineering workflows

Speed That Unblocks Your Team Process documents in milliseconds instead of seconds! Your RAG pipeline moves at the speed of API calls, not extraction bottlenecks. Index millions of documents without waiting weeks for processing to complete.

Batch-Processing at Scale Effectively process large number of documents in bulk. Kreuzberg is built for batch processing, and our cloud infrastructure is designed to scale.

LLM-Powered Intelligence Go beyond extraction. Use vision language models as an OCR backend, extract structured JSON from documents using a schema, and generate embeddings - all via 146 LLM providers, including local models with zero API key configuration.

Built for AI Teams Kreuzberg is a full toolbox - text extraction, metadata extraction, NER, embedding and chunking, all in a CPU optimized binary

Code Intelligence Extract functions, classes, imports, and symbols from code files across 305 programming languages. Structured output, ready for semantic chunking and RAG pipelines.

Polyglot and multiplatform Get native performance in the language of your choice. Kreuzberg is written in Rust and is shipped for 11 other programming languages. It supports Linux, MacOS and Windows runtimes.

How it works Three steps. One API

01 Send the file Upload via API, SDK, CLI, or Docker. Supports PDFs, images, scanned docs, DOCX, PPTX, XLSX, HTML, and 90+ more formats.

02 We process it Layout detection, OCR when needed, table extraction, optional VLM, and schema validation - all in a single call.

03 Pipe it anywhere JSON response with full document structure. Webhook delivery for async workflows. Plug directly into your embeddings pipeline or RAG framework.

Pricing Pay only for what you extract - no seats, no minimums

Cloud · Pay-as-you-go Production-ready extraction, managed by us.

$0.008/page

First 10,000 pages free

92 file formats, 305 code formats

Images and scanned PDFs supported

OCR, layout detection, table extraction

No monthly minimum

Get started instantly, no card required Try it For Free!

High volume 100K+ pages a month? Let's talk pricing.

Custom/page

Everything from the Pay as you go plan

Discounted per-page rate on the cloud

Contact Sales

Frequently Asked Questions

How fast is 'fast'? Kreuzberg is built on a high-performance Rust core, so most documents are processed almost instantly- in milliseconds instead of seconds. For bulk jobs that's thousands of pages per hour on a single API key.

What file types do you support? PDFs (native and scanned), images (JPG, PNG), Microsoft Office (DOCX, PPTX, XLSX), web content, and plain text. We detect document type automatically and optimize extraction for each format.

Do you handle scanned documents? Yes. Built-in OCR recognizes text in images and scanned PDFs. No additional configuration needed—just send the file and get structured output back.

What happens to my documents? Documents are processed in memory and deleted immediately after extraction. No storage, no indexing. We don't train on your data or use it for model improvement.

I already use your open-source library with good results. Why should I try Kreuzberg cloud? The open-source engine is fully usable and powerful on its own. Kreuzberg Cloud removes the operational complexity, so you can run it in production without worrying about managing infrastructure.

Start Building Today Join thousands of developers already building document intelligence pipelines using Kreuzberg - in their language of choice!

Contact SalesStart Free

We value your privacy Kreuzberg uses cookies to improve your experience, personalize content, and analyze traffic. You can manage your preferences at any time.

CustomizeAccept All

Show HN: Kreuzberg Cloud – ultra fast content intelligence – in public beta

Related Articles

Show HN: Kreuzberg Cloud – ultra fast content intelligence – in public beta

Related Articles

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

Old Reddit Is Down

The ultimate female fantasy – A feminist critique of Beauty and the Beast