Unsloth: Easily run and train models locally

Unsloth - Train and Run Models LocallyunslothModelsBlogUnsloth Studio✨Docs

New

✨Introducing Unsloth StudioEasily run & train models locally.

Join our DiscordStart for free

Latest News

Gemma 4 12B and QAT is here!Jun 5, 2026

Unsloth joins PyTorch ecosystemMay 11, 2026

Unsloth API endpointMay 5, 2026

Qwen3.6 is out now!Apr 22, 2026

View more news Run models locally Unsloth Studio runs 100% offline on your Mac and Windows device. Run GGUF and Safetensors models with tool-calling, web search, and OpenAI compatible API.

Compare models side by side and upload images, docs, audio, code files and more. Learn more

No-code training Auto-create datasets from PDF, CSV, JSON docs and start training with real-time observability.

Unsloth's custom kernels supports optimized training for LoRA, FP8, FFT, PT and 500+ models including text, vision, audio and embeddings. QuickstartLearn more

Model Arena Chat with and compare 2 different models, such as a base model and a fine-tuned one, to see how their outputs differ.

Just load your first GGUF/model, then the second, and voilà! Learn more

Data Recipes Data Recipes transforms your docs into useable datasets via graph-node workflow. Upload unstructured or structured files like PDFs, CSV and JSON. Unsloth Data Recipes auto turns documents into your desired formats. QuickstartLearn more

Export models Export any model, including your fine-tuned models, to safetensors, or GGUF for use with llama.cpp, vLLM, Ollama, and more. Learn more

Don’t believe us? Why not try our fully free open source version? Finetune 2X faster on a single NVIDIA GPU for free on Google Colab or Kaggle Notebooks. Get access now

Subscribe now

Train your own custom model in 24 hrs, not 30 days.

30x faster than FA2 + 30% accuracy

90% less memory usage than FA2

audio, embedding, vision support

The details We're making AI more accessible to everyone Find out moreUnsloth makes everything greener As hardware costs rise and performance gains plateau, we use our math and coding skills to make models train and run smarter + faster.

Want lightning fast inference? We’re working on it! Contact us

Don't forget to join our newsletter! Submit By registering you agree to unsloth's Terms of Service and Privacy Policy, Subscribe now

MultiGPU DocsEven better multiGPU in the works!

Don't forget to join our newsletter! Subscribe

Pricing FreeFreeware of our standard version of unsloth Get started Open-source

Supports Mistral, Gemma

Supports LLama 1, 2, 3

MultiGPU - coming soon

Supports 4 bit, 16 bit LoRA

unsloth Pro2.5x faster training + 20% less VRAM Contact us 2.5x number of GPUs faster than FA2

20% less memory than OSS

Enhanced MultiGPU support

Up to 8 GPUS support

For any usecase

unsloth EnterpriseUnlock 30x faster training + multi-node support + 30% accuracy Contact us 32x number of GPUs faster than FA2

up to +30% accuracy

5x faster inference

Supports full training

All Pro plan features

Multi-node support

Customer support

Ready to use unsloth? Get started for free

Company About📰 NewsletterPrivacy PolicyTerms of Service Product Introduction🐋 DockerDownloadDocumentation🦥 Models

Community

Twitter (X)

Hugging Face

Discord

unsloth [email protected]

Join Our Discord

Unsloth: Easily run and train models locally

Related Articles

Apple WWDC 2026 Livestream

Claude Fable 5

US Government directive to suspend access to Fable 5 and Mythos 5

German ruling declares Google liable for false answers in AI Overviews

Britain Became as Poor as Mississippi