Skip to main content
npx supermemory local downloads and runs Supermemory’s self-hosted memory engine on your machine. It’s the same engine behind the hosted platform — ingestion, memory extraction, hybrid semantic search, and the full API — as a single self-contained binary. No Docker, no database to provision, no config files. It boots in seconds with everything built in, and it’s open source.

Run it

npx supermemory local
The installer detects your OS and architecture, downloads the right binary, verifies it, and (when run interactively) prompts you for an LLM API key. Then start the server:
supermemory-server
First boot sets everything up — the embedded graph engine, local embeddings, and your credentials — and prints an API key:
  ┌──────────────────────────────────────────────────┐
  │  url       http://localhost:6767                 │
  │  database  ./.supermemory                        │
  │  api key   sm_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx     │
  │  org id    xxxxxxxxxxxxxxxxxxxxxx                │
  └──────────────────────────────────────────────────┘
Save that API key — it’s your bearer token for every request. Then point any Supermemory SDK at your local server with a one-line change:
const client = new Supermemory({
  apiKey: "sm_...",                  // printed on first boot
  baseURL: "http://localhost:6767",  // that's the only change
})

Cloud CLI vs. local server

Both ship under npx supermemory, but they’re different things:
supermemory <command>supermemory local
What it talks toThe hosted platformA server running on your machine
AuthYour account (login)Auto-generated key on first boot
Use caseManage cloud memory from the terminalLocal-first, air-gapped, privacy-sensitive workloads
ReferenceCommand referenceThis page + Self-hosting docs

Learn more

The local server has its own full documentation — bringing your own model (or running fully offline with Ollama), configuration, and how it compares to Enterprise:

Self-hosting quickstart

Install, run, and store your first memory in under two minutes.

Configuration

Every environment variable: LLM providers, local models, storage, tuning.

Overview

What’s built in, what runs offline, and how it matches the platform API.

Local vs. Enterprise

When to run it yourself and when to move to the managed platform.