Mnemosyne – Memory for AI Hermes Agents, Sub-Millisecond Recalls, Local First

Mnemosyne — Native AI Memory for Hermes Agent | Zero-Dependency, Sub-Millisecond SupportWe need your help.Read our story →

English DiscordGitHubpip install

v3.0.0 — MEMORIA Fact Engine Memory for Hermes The native memory system for Hermes Agent. SQLite-backed, sub-millisecond, zero dependencies. No cloud. No API keys. Just pure speed. Get StartedRead the Docs Migrate from Zep, Mem0, Honcho, or Hindsight in one command — see migration docs Query latency 0Dependencies 98.9%LongMemEval 100%Local & private

SimpleThree lines. Infinite memory. No configuration files. No environment variables. No cloud accounts. Import, remember, recall. That is all. pip install mnemosyne-memory Zero external services required Works offline, always Hermes Agent integration built-in

from mnemosyne import remember, recall

# Store a memory remember( "User prefers dark mode", importance=0.9, scope="global"

# Retrieve relevant context results = recall("user preferences") # => [{"content": "User prefers dark mode", ...}]

FeaturesEverything you need. Nothing you do not. Built from the ground up for AI agents that need fast, reliable, persistent memory.

Sub-Millisecond Latency Direct SQLite access delivers <1ms queries. No network overhead. No HTTP roundtrips.

100% Private All data stays on your machine. No cloud services. No data leaves your device, ever.

Native Vector Search sqlite-vec integration for semantic search. Hybrid ranking: 50% vector + 30% FTS + 20% importance.

Beam Architecture Three-tier memory: working_memory for hot context, episodic_memory for long-term, scratchpad for reasoning.

Auto Consolidation Old working memories are automatically summarized and moved to episodic storage via sleep cycles. Configurable auto_sleep intervals.

Hybrid Search Combines vector similarity, full-text search, and importance scoring for the best recall accuracy.

Streaming & DeltaSync Real-time incremental memory updates via DeltaSync. Stream results as they arrive — no more waiting for full batches.

Smart Filtering ignore_patterns blocks noisy or irrelevant content from entering memory. Keep your context window clean and focused.

SpeedNumbers that speak Measured on CPU with sqlite-vec + FTS5. No GPU required.

Write0.81ms56x faster Read0.076ms500x faster Search1.2ms43x faster Cold Start0msInstant

OperationHonchoZepMem0MnemosyneWrite45ms85ms50ms0.81msRead38ms62ms45ms0.076msSearch52ms78ms60ms1.2msCold Start500ms800ms300ms0ms BEAM Benchmark (ICLR 2026) End-to-end memory retrieval at scale. LLM-as-judge against published baselines.

100K Context35.4%Retrieval from 100K-token conversations 500K Context19.3%Retrieval from 500K-token conversations 1M Context19.2%Retrieval from 1M-token conversations

CompareMnemosyne vs. cloud memory providers See exactly what you gain — and what you trade — when you switch.

FeatureMnemosyneHonchoZepMem0CostFree forever$$$ Paid (credits)$$$ Paid (Flex+)Freemium ($0-$249/mo)HostingLocal - your machineCloud onlyCloud / BYOCCloud onlyPrivacy100% local, zero exfilExternal API callsExternal API callsExternal API callsOffline modeYes - airplane modeNoNoNoSetuppip installDocker + API keysDocker + PostgresAPI key + signupVector storesqlite-vec (built-in)pgvector (external)pgvector (external)pgvector (external)Full-text searchFTS5 (built-in)Separate serviceSeparate serviceSeparate serviceAuth requiredNoneSupabase authOAuth / API keyAPI keyRate limitsUnlimitedPlan-dependentCredit-basedPlan-dependentData ownershipYou own the SQLite fileVendor-hostedVendor-hostedVendor-hostedExport / importOne JSON fileLimitedLimitedLimitedDependenciesPython stdlib + ONNXDocker, PostgresDocker, Postgrespip + API keyMemory architectureBEAM (3-tier)Session + factsGraph RAG + factsSession + factsAuto-consolidationSleep cycles built-inManual / paidManualManualTemporal triplesNative with validityNoNoNoLongMemEval98.9% Recall@All@5Not publishedNot publishedNot publishedBEAM-100K35.4% / 19.3% / 19.2%Not publishedNot publishedNot published Switching from Honcho You gain500x faster reads, zero monthly bill, 100% offline, no Docker, no credit system

You loseCloud dashboard, managed scaling, team sharing

Switching from Zep You gain43x faster search, no PostgreSQL to maintain, no deployment overhead, instant cold start

You loseGraph RAG viz, SOC 2 certs, managed BYOC

Switching from Mem0 You gainSub-millisecond everything, no rate limits, no vendor lock-in, full data portability

You loseManaged platform, 90K+ community, YC ecosystem

Switching from Hindsight You gainZero dependency, no network calls, SQLite-native, BEAM architecture

You loseCloud sync, managed inference, web dashboard

The bottom line ✓Speed: 43-500x faster than cloud alternatives — zero HTTP roundtrips. ✓Privacy: Data never leaves your machine. No API calls. No telemetry. ✓Cost: Zero ongoing cost. No credits. No tiers. No "contact sales." ✓Simplicity: One pip install. No Docker. No config. No signup.

Trade-off: You manage your own backup (one SQLite file). No web...

Mnemosyne – Memory for AI Hermes Agents, Sub-Millisecond Recalls, Local First

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

SpaceX not the behemoth everyone thought

The Mirror Is Part of the Machine

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits