Show HN: Lore – LLM proxy for coding agent context and memory management

Lore.AI — Shared Context for AI Agents

loading… Lore. The memory that compounds.

Stop re-explaining

your project to your AI. Your team's memory, in every session. Lore gives AI agents persistent shared context — capturing decisions, file paths, and patterns across sessions lasting days and hundreds of turns. No context files to maintain. No workflow changes. {this.querySelector('.install-copied').classList.add('show');setTimeout(()=>this.querySelector('.install-copied').classList.remove('show'),2000)})"> $ curl -fsSL https://withlore.ai/install | bash Copied! or npx @loreai/gateway

Join the Waitlist View Repository Read Docs

Lore.AI Gradient Context Lore Distillation Any Provider* Recall Tool .lore.md Sync On-Device Vector Search Import History Cost-Aware Caching Sessions Lasting Days

+67% vs Compaction at 2.3M Tokens

2.6x Total Recall vs Compaction

2.3M+ Token Sessions Tested

2.3M tokens, 5 days, 2.6x total recall ◆ Compaction: 2.4/5. Lore: 4.0/5. 68 min/day re-explaining ◆ Lore remembers for you Your tools change. Your memory doesn't. ◆ Lore is your constant Total amnesia on new sessions ◆ Lore persists across sessions 49 manual learnings ◆ Lore curates automatically 5 feedback loops ◆ Your agent improves every session 2.3M tokens, 5 days, 2.6x total recall ◆ Compaction: 2.4/5. Lore: 4.0/5. 68 min/day re-explaining ◆ Lore remembers for you Your tools change. Your memory doesn't. ◆ Lore is your constant Total amnesia on new sessions ◆ Lore persists across sessions 49 manual learnings ◆ Lore curates automatically 5 feedback loops ◆ Your agent improves every session

The Problem Context loss is invisible.

There's no error message when your AI forgets. Just worse answers, undone decisions, and hours spent re-explaining.

01 Compaction destroys details When the context window fills up, your AI tool compacts the conversation. In a real 5-day coding session, compaction reduces 2.3 million tokens to an 11K summary — a 200x compression that loses which issues were picked, what alternatives were rejected, and why. It scores 2.4/5 on recall. Lore scores 4.0/5.

02 Starting fresh is starting from zero Most developers see "Compacting conversation" and start a new session. That trades compaction for total amnesia. The new session produces output that looks fine — but it's working from incomplete information, and you can't tell.

03 Manual context files don't scale The alternative is maintaining context files, key technical learnings, and decision rationales — by hand. It works, but it's a second full-time job. One team tracked 49 technical learnings manually. Every decision needs the "why" or the AI will refactor it away.

The Solution How Lore replaces all of that

01 Intercept Lore sits between your AI client and the upstream API. It captures every message — no client changes needed, just change the base URL. Works with Claude Code, OpenCode, Pi, Codex, and any Anthropic/OpenAI-compatible tool.

02 Distill Lore replaces compaction entirely. Instead of lossy summaries that forget your file paths and decisions, it distills conversations into timestamped observation logs — the operational details your AI actually needs to keep working. Your manual "Key Technical Learnings"? Lore extracts and maintains them automatically.

03 Recall Details from every session are searchable — even hundreds of turns later. When the distilled context isn't enough, your agent's recall tool retrieves the exact file path, error message, or decision rationale it needs. In our 2.3M-token benchmark: 2.6x total recall over compaction — 13 perfect scores vs 5.

Why not both? Context management and memory are the same problem.

Other tools force you to solve them separately. Lore treats them as one continuous pipeline. See how Lore compares →

01 Memory alone isn't enough Storing past conversations and searching them later is only half the problem. If your AI still gets compacted mid-session and loses track of what it's doing right now, a memory layer can't help — it doesn't know what's missing until you ask. Memory is only useful if it reaches the AI at the right time.

02 Context management alone doesn't learn Compressing conversation history keeps the current session alive, but nothing is extracted from the compression. Start a new session and you're back to zero. Switch tools and the knowledge stays behind. Nothing transfers to other projects, team members, or even other models.

03 Lore connects them into one pipeline In Lore, context compression is the memory pipeline. Distillation feeds the gradient context manager, which feeds the knowledge curator, which feeds .lore.md — and with Folk Lore, your team. Every conversation makes every future session smarter, across any provider, any tool, any team member. Every new session starts with the relevant facts and gets a fresh injection after the first turn. Read the docs →

Persistence Decisions stick Your AI won't refactor away deliberate decisions. Lore preserves the "why"...

Show HN: Lore – LLM proxy for coding agent context and memory management

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

Apple WWDC 2026 Livestream

Claude Fable 5

It's Not Just X. It's Y

Show HN: GoPeek – open links in live mini browser windows without new tabs