Kreuzberg | Document intelligence
CommunitySign In
Toggle menu
Document intelligencefor AI engineering workflows.<br>Extract structured, machine-readable content from any document and feed it directly into AI agents, pipelines, and applications.
One API
97<br>file formats
Contact SalesStart Free
Drop a file here<br>Or click to browse
• Limited to 1 page per document<br>• File size capped at small uploads (under 1MB)<br>• Limited to 10 demo requests per IP
Why Kreuzberg?<br>Built for AI engineering workflows
Speed That<br>Unblocks Your Team<br>Process documents in milliseconds instead of seconds! Your RAG pipeline moves at the speed of API calls, not extraction bottlenecks. Index millions of documents without waiting weeks for processing to complete.
Batch-Processing at<br>Scale<br>Effectively process large number of documents in bulk. Kreuzberg is built for batch processing, and our cloud infrastructure is designed to scale.
LLM-Powered Intelligence<br>Go beyond extraction. Use vision language models as an OCR backend, extract structured JSON from documents using a schema, and generate embeddings - all via 146 LLM providers, including local models with zero API key configuration.
Built for AI<br>Teams<br>Kreuzberg is a full toolbox - text extraction, metadata extraction, NER, embedding and chunking, all in a CPU optimized binary
Code<br>Intelligence<br>Extract functions, classes, imports, and symbols from code files across 305 programming languages. Structured output, ready for semantic chunking and RAG pipelines.
Polyglot and multiplatform<br>Get native performance in the language of your choice. Kreuzberg is written in Rust and is shipped for 11 other programming languages. It supports Linux, MacOS and Windows runtimes.
How it works<br>Three steps. One API
01<br>Send the file<br>Upload via API, SDK, CLI, or Docker. Supports PDFs, images, scanned docs, DOCX, PPTX, XLSX, HTML, and 90+ more formats.
02<br>We process it<br>Layout detection, OCR when needed, table extraction, optional VLM, and schema validation - all in a single call.
03<br>Pipe it anywhere<br>JSON response with full document structure. Webhook delivery for async workflows. Plug directly into your embeddings pipeline or RAG framework.
Pricing<br>Pay only for what you extract - no seats, no minimums
Cloud · Pay-as-you-go<br>Production-ready extraction, managed by us.
$0.008/page
First 10,000 pages free
92 file formats, 305 code formats
Images and scanned PDFs supported
OCR, layout detection, table extraction
No monthly minimum
Get started instantly, no card required<br>Try it For Free!
High volume<br>100K+ pages a month? Let's talk pricing.
Custom/page
Everything from the Pay as you go plan
Discounted per-page rate on the cloud
Contact Sales
Frequently Asked Questions
How fast is 'fast'?<br>Kreuzberg is built on a high-performance Rust core, so most documents are processed almost instantly- in milliseconds instead of seconds. For bulk jobs that's thousands of pages per hour on a single API key.
What file types do you support?<br>PDFs (native and scanned), images (JPG, PNG), Microsoft Office (DOCX, PPTX, XLSX), web content, and plain text. We detect document type automatically and optimize extraction for each format.
Do you handle scanned documents?<br>Yes. Built-in OCR recognizes text in images and scanned PDFs. No additional configuration needed—just send the file and get structured output back.
What happens to my documents?<br>Documents are processed in memory and deleted immediately after extraction. No storage, no indexing. We don't train on your data or use it for model improvement.
I already use your open-source library with good results. Why should I try Kreuzberg cloud?<br>The open-source engine is fully usable and powerful on its own. Kreuzberg Cloud removes the operational complexity, so you can run it in production without worrying about managing infrastructure.
Start Building Today<br>Join thousands of developers already building document intelligence pipelines using Kreuzberg - in their language of choice!
Contact SalesStart Free
We value your privacy<br>Kreuzberg uses cookies to improve your experience, personalize content, and analyze traffic. You can manage your preferences at any time.
CustomizeAccept All