Local server - supermemory | Memory API for the AI era

npx supermemory local downloads and runs Supermemory’s self-hosted memory engine on your machine. It’s the same engine behind the hosted platform — ingestion, memory extraction, hybrid semantic search, and the full API — as a single self-contained binary. No Docker, no database to provision, no config files. It boots in seconds with everything built in, and it’s open source.

Run it

npx
bunx
curl

npx supermemory local

bunx supermemory local

curl -fsSL https://supermemory.ai/install | bash

The installer detects your OS and architecture, downloads the right binary, verifies it, and (when run interactively) prompts you for an LLM API key. Then start the server:

supermemory-server

First boot sets everything up — the embedded graph engine, local embeddings, and your credentials — and prints an API key:

  ┌──────────────────────────────────────────────────┐
  │  url       http://localhost:6767                 │
  │  database  ./.supermemory                        │
  │  api key   sm_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx     │
  │  org id    xxxxxxxxxxxxxxxxxxxxxx                │
  └──────────────────────────────────────────────────┘

Save that API key — it’s your bearer token for every request. Then point any Supermemory SDK at your local server with a one-line change:

const client = new Supermemory({
  apiKey: "sm_...",                  // printed on first boot
  baseURL: "http://localhost:6767",  // that's the only change
})

Cloud CLI vs. local server

Both ship under npx supermemory, but they’re different things:

	`supermemory <command>`	`supermemory local`
What it talks to	The hosted platform	A server running on your machine
Auth	Your account (`login`)	Auto-generated key on first boot
Use case	Manage cloud memory from the terminal	Local-first, air-gapped, privacy-sensitive workloads
Reference	Command reference	This page + Self-hosting docs

Learn more

The local server has its own full documentation — bringing your own model (or running fully offline with Ollama), configuration, and how it compares to Enterprise:

Self-hosting quickstart

Install, run, and store your first memory in under two minutes.

Configuration

Every environment variable: LLM providers, local models, storage, tuning.

Overview

What’s built in, what runs offline, and how it matches the platform API.

Local vs. Enterprise

When to run it yourself and when to move to the managed platform.

​Run it

​Cloud CLI vs. local server

​Learn more