LMIM OS – an offline AI ecosystem. Voice, RAG, WhatsApp. ++ One file. 0 setup

LMIM OS v2.1 'Tezcat · Sharpened' — Linux + Windows Live Now

What can LMIM do? 19+

Everything. On your machine.

19+ tools — no cloud, no API key, no subscription. All in one AppImage / Installer.

⚡ Core AI

🧠Local LLMQwen 3.5 · bundled

📄RAG LiteDocument Q&A · local embeddings

📁WorkspaceSandboxed file ops

🎙VoiceSTT + TTS · 5 languages

💭Persistent memoryEpisodic + semantic

⚡CUDA acceleration3–15× faster on NVIDIA

🎯Prime DirectiveStanding instructions

🛠 Operations

🤖Build systemPlanner · Builder · Inspector

🚀Campaign BlasterBulk WhatsApp + email

🧰Developer toolboxHash · minify · JSON

📅Visual agendaCalendar + scheduling agent

📡 Services

💬WhatsAppBaileys · QR · auto-reply

✈️TelegramBot API · polling

📧EmailSMTP · IMAP

💼SlackBot · webhooks

🎮DiscordBot · webhooks

🧬 Intelligence

🕷Web Scraper10 URLs · batch · LMIM analysis

📇ContactsNatural name resolution

🎨Image generationOn-device · v2.2

🛡Horus securityAudit trails · v3.0

⬇ Download Learn more

⬢ Tezcat · Sharpened · v2.1.1

LMIM OS v2.1 — The same fire. Sharper edge.

RAG. Workspace sandbox. Web scraper. Contacts. Voice on GPU. Bundled model. CUDA. Linux + Windows live now.

📄 RAG Lite — doc Q&A

📁 Workspace sandbox

🕷 Web Scraper · batch

📇 Contacts · names

✦ Also in v2.1 / v2.1.1

🎯 Prime Directive — standing instructions 🧠 Hardened 4-layer tool parser 🎙 Voice device routing + CPU bug fixes

Download v2.1 — Linux + Windows →

Download manual (.md)

Shape v3 — what should we build?

🐧 Linux · live now 🪟 Windows · live now $0 forever

What should we focus on? (tap any · multi-select)

🧠 More models / model variety ⚡ Speed & performance 🔗 New integrations / daemons 🎨 UI / UX polish 🎙 Voice & audio 🤖 Smarter agents / planning 🧰 More developer tools 📱 Mobile / remote access 📚 Documentation & tutorials 🐛 Bug fixes & stability 🏢 Enterprise / team features ✨ Something else

Tell us more (optional · the more specific the better)

Want a follow-up? (optional · only used to reply)

Maybe later

Send to LMIM → Sending…

Stored locally on the LMIM server as JSON. Email used only to reply. No tracking. Privacy policy.

Logged. Thank you.

Your input goes straight to the v3 planning notes. If you left an email, expect a reply within a week.

⬢ v2.1 'Tezcat · Sharpened' is live now — RAG, workspace, scraper, contacts, CUDA, bundled model. Linux + Windows Live · Download v2.1 →

⬢ v2.1 — Tezcat · Sharpened 🐧 Linux Live Now · 🪟 Windows Live Now ✦ $0 forever · model included

LEAN MEAN INFERENCE MACHINE

Local AI That

Actually Does Things

The same fire. Sharper edge. RAG, sandboxed workspace, web scraper, contacts, voice on GPU.

Bundled model. CUDA acceleration. One file. Everything included. Linux + Windows live now.

Download v2.1 — Live Now

See what's new →

LMIM OS v2.1 · Tezcat · Sharpened · Linux + Windows Live

1,000+ downloads

20+ countries

Zero cloud / telemetry

MIT open source

$0 forever

Scroll to explore

⬢ v2.1 'Tezcat · Sharpened' — Live Now · Linux AppImage + Windows Installer

RAG Lite: Drop a PDF/TXT/MD — ask anything about it. Local embeddings, never leaves your machine.

Workspace sandbox: Point LMIM at a folder. Reads, writes, creates, edits. Path traversal blocked at backend.

Web Scraper: Batch 10 URLs in parallel. LMIM analyzes them based on your stated purpose.

Contacts: Natural-language name resolution — say "Send WhatsApp to Maria" and she's found.

4-layer tool parser: Strict → embedded → relaxed → regex. No more silent failures mid-chain.

CPU/GPU toggle + one-click model download: Qwen 3.5 9B streamed straight to ~/.lmim_os/models/.

Windows CPU Voice Fix: Solved audio routing bugs on CPU. Whisper STT and Piper TTS now run flawlessly on Windows without GPU.

Chat Performance: Optimized token streaming and UI rendering for smoother, faster conversations.

✦ v3 in planning — Horus security engine, image generation, plugin system. See what's coming →

📄

RAG Lite + Workspace

Drop a document. Ask anything about it. Point it at a folder and let it build.

Local embeddings · all-MiniLM-L6-v2

Smart chunking · respects headings + code blocks

MMR reranking · non-redundant retrieval

Sandboxed file ops · safe_path() blocks traversal

Just say it.

"Summarize this contract in 3 bullets." Reads the PDF, answers only from what's inside.

"Refactor main.py to use async." Reads, edits, runs — entirely within your workspace.

"What are the risks in this doc?" Suggested prompts auto-generated on upload.

🕷

Scraper + Contacts

Batch 10 URLs in parallel. Name-resolve anyone in your contacts.

Parallel fetch · 30s per URL

Basic mode or LMIM analysis mode

Natural-language contact lookup

"Send WhatsApp to Maria" — resolved automatically

Token generation · Qwen 3.5 2B tok/s

CPU only

GTX 1650 Ti

RTX 3060

120

measured · same model · same prompt

Hardened.

4-layer tool parser. Strict → embedded → relaxed → regex. No more silent failures.

Qwen3 reasoning_content...

LMIM OS – an offline AI ecosystem. Voice, RAG, WhatsApp. ++ One file. 0 setup

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

SpaceX not the behemoth everyone thought

The Mirror Is Part of the Machine

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits