Is grep all you need? Lexical VS Sematic Search for Agents

grep vs. RAG: Choosing the Right Search Strategy for AI Agents

Live Webinar 5/27: Dive into ParseBench and learn what it takes to evaluate document OCR for AI Agents Product

LlamaParse Industry-leading document processing

Parse

Extract

Index

Open Source OSS repos trusted by millions of developers

LiteParse

Workflows

LlamaIndex

Solutions

Persona

Engineering & R&D Accelerate product development

Administrative Operations Streamline business processes

Financial Analysts Build AI-powered financial models

Industry

Insurance Automate claims and underwriting

Finance Power financial research

Manufacturing Optimize system uptime

Healthcare & Pharma Accelerate clinical research

Use cases

Financial Due Diligence Speed up compliance reviews

Invoice Processing Automate manual review

Technical Document Search Find answers in complex docs

Customer Support Instant, accurate responses

Docs Resources

-->

Customer stories See real-world success stories

How Jeppesen (a Boeing Company) Saves ~2,000 Engineering Hours with Unified Chat Framework

Company

Persona -->

About us Our mission and story

Careers Join our growing team

-->

Brand Logos and brand guidelines

View open roles at LlamaIndex

Blog Pricing

Grep and what comes with it

Grep and what comes with it Unlocking unstructured documents Building for scale: semantic search and RAG Conclusion: is grep all you need?

Content

Grep and what comes with it Unlocking unstructured documents Building for scale: semantic search and RAG Conclusion: is grep all you need? Follow us on

In a recent paper, Sen et al. argued that grep might be the best interface for a world where search is heavily reshaped by agent harnesses. The idea that filesystem tools will soon overthrow semantic search, and RAG in general, has been circulating for a while, but the debate is mostly structured around text-based documents (markdown files, source code), and rarely accounts for what agents actually encounter day-to-day in most enterprise settings: unstructured documents (PDFs, Office files, images). In this post, we'll look at how and when grep and, more generally, lexical search can be faster and more accurate than RAG, and when it isn't. Grep and what comes with it Lexical search with grep rests on two assumptions: A text corpus is available, typically as a filesystem of text-based files. The corpus is small (hundreds to thousands of documents), so an agent can pick which files to search without drowning in a low signal-to-noise ratio. In most setups, grep is exposed as a bash tool, assuming the agent has access to a shell. grep is an excellent tool for precise substring and regex matching. It has no semantic representation of the query, which is fine as the agent manipulates semantics itself by issuing different search patterns across calls. This makes it great for retrieving highly specific information: a known token, a function name, an error string. Context is king, and the agent rarely uses grep in isolation: it searches for a passage, then expands context by reading the surrounding file. grep has also been around for more than 50 years, so examples of its use are everywhere, which lets LLMs apply it effectively and generalize across patterns and optimizations. Still, lexical search has two big limitations: You can't grep a PDF, an image with text, or an Office document. The corpus lexical search unlocks is exclusively plain text. Despite the growing adoption of markdown and other text formats, most enterprise knowledge remains locked behind unstructured files. Scalability falls apart at corpus size. The original grep takes more than 4 seconds to scan 1,000,000 files for a small pattern. Even with sub-second alternatives like ripgrep , the noise from random, out-of-scope matches quickly fills the agent's context window and pushes relevant information out. For both limitations, there are enterprise-grade approaches that improve the scalability of agentic search while preserving accuracy and latency. Unlocking unstructured documents Most CLI agent harnesses today ship with rudimentary tools for reading PDFs, plus multimodal capabilities for "seeing" images. PDF reading in coding agents, though, is generally inaccurate and lossy: layout-unaware extractors clump tables together, ignore images, and shred columns. Vision is more expensive at inference time, and since models are primarily trained on text, vision-based reading is slower and more prone to errors and hallucinations. A tooling layer that balances accuracy and latency is required to unlock unstructured documents and expose their text content to downstream tools like grep . LlamaIndex offers a set of agent-native tools for this: The LlamaParse MCP is a plug-and-play MCP server that lets agents call the LlamaParse platform for parsing, splitting, and classifying files, with support for 130+ formats and strong accuracy on tables, charts, images, complex layouts, and handwritten text — driven by agentic...

Is grep all you need? Lexical VS Sematic Search for Agents

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy

SpaceX not the behemoth everyone thought

The Mirror Is Part of the Machine