Lamini: Build mini-agents with 90%+ accuracy

doener1 pts0 comments

Introduction - Lamini Docs

Skip to content

Initializing search

Perfect For

Real-World Applications

Getting Started Is Easy

Who are we?

What's new?

Quick Start

FAQ

Memory Experiments

Memory Tuning

Inference

Platform

Self-Hosted Lamini

License

Python SDK

REST API

Perfect For

Real-World Applications

Getting Started Is Easy

Who are we?

What's new?

Welcome to Lamini 🦙

Build mini-agents with 90%+ accuracy, whether you're a solo developer or an enterprise team. Get started with $300 in free credits.

Quick Navigation

Goal<br>Description<br>Link

🚀 Get Started<br>Boost your mini LLMs from 50% to 90%+ accuracy<br>Quick Start

💡 Try It Out<br>Test your PDF knowledge base<br>Playground (with Memory RAG)

🎯 Memory Tuning<br>Build accurate, efficient models<br>Memory Tuning

🤖 RAG Tools<br>Create reliable mini-agents<br>Memory RAG

🎯 Classification<br>Deploy scalable classifiers<br>Classifier Agent

🔒 Self-Hosted<br>Install Lamini on your own GPUs<br>Kubernetes Install

Questions? Contact us. We read every message — or at one of our mini-agents does :)

Core Products

Memory Tuning [paper] [class with Andrew Ng & Meta] [about]

Build the most accurate and efficient fine-tuned models:

Inject precise facts to eliminate hallucinations

Start with only 10 facts & examples, scale to 100,000+

Reliably get 95%+ accuracy (removes accuracy ceilings on many tasks)

Keep latency and costs low, by getting away with memory-tuned smaller LMs and mini-agents

One API, any open model

Memory RAG [paper] [about]

Skip the complex RAG setup. Easier than Memory Tuning. Get 90%+ accuracy out of the box:

Boost accuracy from 50% to 90-95% compared to GPT4, after just a few iterations on your data and telling the model how to improve

Smart embedding that expands your data representation to capture true meaning and relationships

Build reliable, specializedmini-agents that work together

Simple API, powerful results

Classifier Agent Toolkit [demo] [about]

Build accurate classifiers in minutes, not months:

Handle any number of categories, from 2 to 1000+

Process unstructured data at scale with 400K tokens/second

Route requests automatically, with 99.9% accuracy

Triage code and content efficiently

Perfect For

Developers & Startups

Simple SDK and API

Start free, scale as you grow

Clear documentation and examples

Fast integration into your stack, OpenAI API compatible

Enterprise Teams

Production-ready security

Air-gapped deployment option

Scale across departments

Custom deployment support

Reduce production risks with 99.9% accuracy

Real-World Applications

Build what matters to you:

SQL Generator : Convert natural language to database queries

Customer Support Agent : Scale customer service intelligently

Data Classifier : Automate manual sorting and labeling

Code Helper : Build assistants for any programming language

Mini-Agent : Automate planning and execution of specialized tasks

Getting Started Is Easy

Start with $300 in free credits

Choose your deployment (cloud or self-hosted)

Use our SDKs or API

Monitor through our dashboard

Who are we?

Lamini's team has been training, fine-tuning, and preference-tuning LLMs over the past two decades. We invented core LLM research like LLM scaling laws, shipped LLMs in production to over 1 billion users, taught nearly a quarter million students about Finetuning LLMs, and mentored the tech leads that went on to build the major foundation models: OpenAI’s GPT-3 and GPT-4, Anthropic’s Claude, Meta’s Llama 3.1, Google’s PaLM, and NVIDIA’s Megatron.

What's new?

Check out our blog for the latest updates.

Made with

Material for MkDocs

memory build accuracy mini tuning lamini

Related Articles