memtomem v0.2.4 · memtomem-stm v0.1.27 · Apache 2.0

AI That Never Forgets —
Memory Infrastructure for Agents

Stop re-explaining your project to your AI every session. memtomem turns your notes, docs, and code into a searchable memory that any MCP-compatible agent can use — across sessions, across agents, all on your machine.

$ uv tool install 'memtomem[all]'
86LTM MCP Tools
10Compression Strategies
12STM Control Tools
Why Do Agents Forget?
Tool integration (MCP), safety (Guardrails), and observability (Langfuse) are mature — but the memory layer still has no standard.
01

No Memory Between Sessions

All context is lost when a session ends. Architecture decisions, coding patterns, and debugging history must be re-explained every time.

02

Memory Silos Between Agents

Knowledge from Claude Code can't be carried over to another client. Each agent is trapped in its own isolated memory silo.

03

Limitations of Existing Solutions

Current memory systems only work when agents explicitly search, are locked to specific runtimes, and offer only a single LTM layer.

memtomem Solves This
Applying the cognitive science working-memory / long-term memory model to agents. Short-term compression and long-term search as independent MCP servers.

Proactive Surfacing

Your agent doesn't have to ask. STM watches proxied MCP calls, and v0.1.24 adds mms hook for Claude Code native-tool surfacing with a warm daemon.

MCP Proxy Gateway

STM sits invisibly between your agent and its tools. Existing MCP servers keep working unchanged, while LTM can be reached over stdio, SSE, or streamable HTTP.

Cross-Runtime Sync

Define an agent, skill, or command once. Context Gateway syncs it across Claude Code, Codex CLI, Antigravity CLI, and others in each runtime's native format.

Smart Compression

Big tool responses get trimmed to fit your context window. Ten strategies now use query-aware ranking, safer JSON tiers, and zero-loss progressive mode for huge payloads.

Multi-Agent Knowledge Sharing

Each agent gets its own private memory, plus a shared one. Knowledge flows between agents — or from you to every agent at once.

Fully Local

SQLite + ONNX under the hood. No GPU, no external API, no cloud dependency — your memory stays on your machine.

Two-Layer Architecture
STM proxy and LTM server connected via MCP, transparently providing surfacing and compression to agents.
Core AI Runtimes / Others
Claude Code
Codex CLI
Antigravity CLI
Other MCP clients
MCP
memtomem-stm
STM Proxy
CLEAN → COMPRESS → SURFACE → (INDEX)
Surfacing
MCP
memtomem
LTM Server
86 MCP Tools
Upstream MCP Servers
filesystem, GitHub, …
Core Runtimes & Compatibility
Optimized for the main CLI runtimes, while staying MCP-native for the rest.
Claude Code
Codex CLI
Antigravity CLI
Other MCP clients
Framework adapters
Docs & Tutorials
From getting started to advanced usage.

Get Started Now

No GPU. No external services. One uv install is all you need.