Overview — What is Supermemory? - supermemory

Supermemory is the memory and context engine for AI agents — and it’s the state of the art, ranked #1 on every major memory benchmark: LongMemEval, LoCoMo, and ConvoMem. Your AI forgets everything between conversations. Supermemory fixes that. It learns from every interaction, reasons over a directed knowledge graph to resolve contradictions and track how facts change over time, forgets what’s expired, and serves the right context at the right moment — through one API.

State of the art, by the numbers

Most memory systems claim to be better. Supermemory is measurably ahead — first place across all three independent benchmarks the field uses to grade AI memory.

Benchmark	What it measures	Supermemory
LongMemEval	Long-term memory across sessions, with knowledge updates	81.6% — #1
LoCoMo	Fact recall across long conversations (multi-hop, temporal, adversarial)	#1
ConvoMem	Personalization and preference learning	#1

Read the full methodology on the research page, or reproduce it yourself with MemoryBench, our open-source benchmarking framework.

What makes Supermemory different

Most “memory layers” are a vector store that retrieves the nearest chunk. Supermemory is an engine that understands — which is why it tops the benchmarks instead of just claiming to.

It reasons — not just retrieves

Resolves contradictions, tracks temporal change, follows multi-hop relationships, and forgets expired facts automatically. Reasoning is why it wins every benchmark.

A directed knowledge graph

One evolving directed graph and a single ontology across all your data — not a flat vector store, and not a raw graph you have to traverse yourself.

The whole context stack, one system

Memory, user profiles, hybrid search (RAG), connectors, and file processing — together, sharing one context pool. No stitching five tools together.

Multi-modal — everything in

Text, conversations, PDFs, images (OCR), video (transcription), and code (AST-aware chunking). Upload it and it just works.

Built to build on

One API, plus SDKs, a CLI, a memory filesystem, and MCP. Infrastructure you ship products on — not a closed box.

Run it anywhere

Hosted for zero-ops scale, or the full engine as one self-hosted binary — fully offline if you want, same API either way.

Hosted or self-hosted — same API. Use the managed platform for zero-ops scale, or run the entire engine on your own machine — one binary, zero config, fully offline. Move between them by changing a single baseURL.

How does it work? (at a glance)

You send Supermemory text, files, and chats.
It indexes them intelligently and builds a directed knowledge graph on top of an entity (a user, document, project, or organization).
At query time, it fetches only the most relevant context and passes it to your models.

Three ways to add context

Memory, profiles, and search all draw from the same context pool for a given user (containerTag) — so they reinforce each other instead of living in silos. Mix and match as your use case needs.

Memory API — learned user context

Supermemory learns and builds memory for each user — extracted facts that:

Evolve on top of existing context about the user, in real time
Handle knowledge updates, temporal changes, and contradictions
Power a user profile that acts as the default context provider for the LLM

User profiles

The evolving context produces a User Profile — the facts your agent should always know, in one ~50ms call:

Static: stable facts the agent should always know.
Dynamic: episodic context from the last few conversations.

RAG — advanced semantic search

Run hybrid search over the raw content too: advanced metadata filtering, contextual chunking, and reranking — tightly integrated with the memory engine, in a single query.

Start building

Quickstart

Make your first API call in minutes.

Add your first memory

Ingest text, files, and conversations into a container.

Search it

Retrieve the most relevant context with hybrid semantic search.

Ways to use Supermemory

The API is the core — but you don’t have to talk to it directly. Reach Supermemory however fits your workflow:

SDKs

Official TypeScript and Python SDKs, plus drop-in plugins for the AI SDK, OpenAI, LangChain, and more.

Command line (CLI)

Manage memories, search, and scripting from your terminal — it’s all npx supermemory.

Memory filesystem (SMFS)

Mount a container as a real directory your agent can ls, cat, and semantically grep.

Self-host it

Run the full memory engine on your own machine — one binary, zero config, fully offline.

​State of the art, by the numbers

​What makes Supermemory different

It reasons — not just retrieves

A directed knowledge graph

The whole context stack, one system

Multi-modal — everything in

Built to build on

Run it anywhere

​How does it work? (at a glance)

​Three ways to add context

​Memory API — learned user context

​User profiles

​RAG — advanced semantic search

​Start building

Quickstart

Add your first memory

Search it

​Ways to use Supermemory

SDKs

Command line (CLI)

Memory filesystem (SMFS)

Self-host it

State of the art, by the numbers

What makes Supermemory different

How does it work? (at a glance)

Three ways to add context

Memory API — learned user context

User profiles

RAG — advanced semantic search

Start building

Ways to use Supermemory