There is no shortage of tools claiming to be the best AI agent memory solution in 2026. There is a shortage of honest buyer's guides that help you figure out which one fits your situation. This is an attempt at the latter.
A word on method. Every capability claim about a competitor below was checked against their official documentation, site, or GitHub repository at the time of writing. Where we could not confirm a detail cleanly, we say so and link to where you should look. Products — especially pricing — change. Treat specifics as "true when we checked, confirm before you commit." We are not neutral about AgentPrizm (that's us), but we have tried hard to be fair about everyone else's.
What a memory layer is, and why the context window is not one
A memory layer is infrastructure that gives an AI agent persistent, governed memory across sessions. It stores what an agent learns — facts about a user, past decisions, standing rules, expressed preferences — outside the model, then retrieves the relevant pieces on demand.
That definition excludes the context window. A context window is working memory: temporary, finite, and gone at session end. Anthropic's own engineering guidance calls context a "finite resource with diminishing marginal returns" and names the failure mode — context rot, where accuracy degrades as the window fills. The context window handles what is happening right now. A memory layer handles what happened before.
The difference matters when your agent takes real actions across many sessions, many users, or many projects. For a deeper treatment, see What Is an AI Memory Layer? or the AgentPrizm memory primer.
How to evaluate a memory layer: the criteria checklist
Before you look at any vendor, know what you are actually buying for. Here is the checklist we would use.
Hosting model. Is it a managed cloud service, a self-hosted open-source package, or both? Self-hosted gives you full data control and no vendor dependency. Managed removes operational burden. Some tools offer both; some only one. Know which you need before you start the trial.
Integration path. How does your agent talk to it? Options range from a plain REST API (works with anything, no SDK lock-in) to language-specific SDKs (faster to integrate, dependency to manage) to a remote MCP server (zero-install for MCP-compatible agents — just a URL and a key). The right answer depends on your agent's runtime and your team's stack.
Memory model. How does the layer represent what it stores? Flat key-value, auto-classified natural language, typed categories, or a knowledge graph? The model determines how well the layer handles conflicting facts, evolving truths, and priority at recall time. An undifferentiated blob of text is harder to govern than a set of explicit memory types.
Governance primitives. Can you set validity windows on facts that expire? Does a corrected fact supersede the old one or sit alongside it confusingly? Is there an audit trail per operation? Is there a right-to-forget path for GDPR erasure? At a minimum, a production memory layer needs scoping (so one agent's memories don't leak to another), validity handling, and a clear deletion mechanism.
Pricing shape. Per-request, per-memory, flat seat, or credit-based? Spiky workloads behave very differently across pricing models. Understand the overage story before you scale — the number that matters is not the plan headline, it is the per-unit cost once you exceed the included quota.
Ecosystem maturity. How long has it been in production? Are there published benchmarks? What compliance certifications does it carry (SOC 2, HIPAA)? A newer tool may have a cleaner design; an older one may have the operational track record your organization requires.
The contenders
What follows are capability-level profiles of the main options as of mid-2026. This is not a leaderboard. It is an honest map of what each tool is designed to do well, so you can match it to what you actually need.
Mem0
Mem0 (mem0.ai) calls itself "drop-in memory infrastructure for AI agents and apps." Its defining characteristic is that the core is open source — there is a self-hosted path and a hosted platform on top. The documentation splits cleanly between the two, and the GitHub repository is public (mem0.ai/github), which means you can inspect the extraction and recall logic rather than taking it on faith.
For framework integrations, Mem0's reach is wide. Their docs advertise compatibility with LangChain, CrewAI, Vercel AI SDK, and 20+ partner frameworks at the time of writing — confirm the current list at docs.mem0.ai. They offer a Python SDK (pip install mem0ai) and Node.js support, plus a REST API.
Mem0 also ships OpenMemory — an MCP server they describe as "a private, local-first memory layer" that runs on your own infrastructure via Docker and works with Cursor, VS Code, Claude, and other MCP-compatible agents (Mem0: OpenMemory MCP). If you want coding-agent memory that never leaves your machine, that is a specific and real option to evaluate.
On compliance, Mem0's homepage listed SOC 2 (Type 1) and HIPAA at the time of writing — verify current certification status on their site before you rely on it. Free tier available; paid plans price on request volume.
Best fit: teams that want open-source auditability, framework ecosystem breadth, or a local MCP option for coding agents.
Zep
Zep (getzep.com) describes itself as a "context engineering platform" built around a temporal knowledge graph. In their own words, memory is "a temporal Context Graph — nodes are entities, edges are facts and relationships" that "updates dynamically as new data arrives, invalidating outdated facts while preserving history." When a fact changes, Zep records valid_at and invalid_at on the graph edge rather than overwriting — the history is kept, not lost. This is built on Graphiti, their open-source temporal graph framework (Zep on GitHub).
Important nuance: Graphiti is open source, but the Zep product is a managed service. Their GitHub README states the Community Edition "is no longer supported and has been deprecated" in favor of Zep Cloud — so if self-hosting the entire product is a requirement, read their current documentation carefully before assuming it.
On compliance, Zep's site listed SOC 2 Type II and HIPAA BAA at the time of writing, plus three deployment models: managed cloud, cloud with your own keys, and bring-your-own-cloud within your VPC. SDKs for Python, TypeScript, and Go. Confirm current specifics at getzep.com — this is the kind of detail that changes.
Best fit: organizations with many evolving facts across many entities, or those that need enterprise compliance (SOC 2 Type II, HIPAA BAA) and flexible deployment topology.
Letta (formerly MemGPT)
Letta (letta.com) is the company that grew out of the MemGPT research project. Their approach centers on stateful agents — agents that maintain memory across interactions as part of the agent architecture itself, rather than as an external service you bolt on. Their open-source agent server (pip install letta) runs locally or is deployable as a service, and they offer a managed cloud platform. The GitHub repository is public (Letta on GitHub).
Letta is worth evaluating if your use case is "I want the memory model to be part of the agent's reasoning loop" rather than "I want an external API my agent calls." The tradeoff is architectural: tighter integration can mean less operational flexibility. Check their current documentation at docs.letta.com for the current hosting and API surface.
Best fit: teams building agents from scratch who want memory as a first-class part of the agent architecture, or those comfortable running their own infrastructure.
Cognee
Cognee (cognee.ai) takes a knowledge-graph angle similar in spirit to Zep but from an open-source-first position. Their positioning focuses on structured memory — building a graph of entities and relationships from unstructured inputs — and they publish a Python package (pip install cognee). The GitHub repository is public (Cognee on GitHub).
If a structured, graph-based memory that you can run yourself is your requirement, Cognee is worth a look alongside Letta and Mem0's OSS path. Verify their current hosted offering status at cognee.ai, as the managed-cloud story was still evolving at the time of writing.
Best fit: teams who want graph-structured memory they can self-host, or who are doing research/evaluation where open-source inspection matters.
Supermemory
Supermemory (supermemory.ai) offers a hosted memory API oriented around personal knowledge management and AI agent memory. They offer an API for storing and searching memories with semantic recall, and have an open-source project on GitHub (Supermemory on GitHub). Their positioning is lighter-weight than the knowledge-graph vendors — closer to "a smart, hosted store for things your agent should remember."
Worth evaluating if you want a simple hosted API without the overhead of a full knowledge-graph setup. Check their current API surface and pricing at supermemory.ai — the product was actively developing at the time of writing.
Best fit: developers who want a lightweight hosted memory API to prototype with, or who find the larger tools architecturally heavier than their use case warrants.
AgentPrizm
We're the product behind this blog, so you should weight this section accordingly — we're not neutral, and we'll say so plainly.
AgentPrizm is a hosted memory layer. Agents reach it two ways: a REST API at /api/v1/agent/*, or a remote MCP server at https://agentprizm.com/api/mcp that agents connect to with a URL and a Bearer key — no local subprocess, no Docker, no install. Both transports expose the same tool surface.
What's distinctive is the memory model. Rather than one undifferentiated store, AgentPrizm uses six explicit types: fact, lesson, directive, preference, contact, bookmark. The reasoning is that each type fails differently. A standing safety rule (directive) needs to outrank a stylistic habit (preference) at recall time. A lesson about a past mistake should age and get revisited; a fact about a user's database version stays sticky until corrected. Recall is hybrid — semantic (embedding-based) plus keyword — and returns confidence scores on every result.
On governance: facts carry validity windows and can supersede earlier facts with a traceable trail rather than leaving two contradictory entries both ranking high. Memory scoping uses containers — per-project or per-agent boundaries — so one project's lessons don't bleed into another's. Every operation leaves an audit receipt. Forgetting is a first-class operation, designed to satisfy GDPR's right-to-erasure requirement.
Now the honest limitations. AgentPrizm is newer and smaller than Mem0 and Zep. We do not publish third-party benchmarks. We are not SOC 2 certified and we do not carry a HIPAA BAA — and this is not the place to fudge that. If your compliance bar requires a signed BAA today, be clear-eyed: we cannot meet it yet. We also don't offer a self-hosted path; if data leaving your infrastructure is off-limits, look at the open-source options.
Pricing: Hobby ($0 forever — 1,000 memories, 4,500 recalls/mo, 2 containers, no card required), Builder ($29/mo — 250K memories, 25K recalls/mo included + overage, 10 containers), Scale ($249/mo + usage — 1M memories, 225K recalls/mo included + overage), Enterprise (custom). The API is available on every tier including the free one. Current limits at pricing.
Best fit: developers who want an opinionated, typed memory model with governance built in (validity windows, audit trails, right-to-forget), reachable by REST or MCP, at a price that starts at zero and proves out before you commit.
Capability comparison
Rather than a ranking, here is how the tools line up against the evaluation criteria. Everything in non-AgentPrizm cells comes from the sources linked above and reflects each vendor's advertised capabilities at the time of writing — products change fast, so verify on their own docs before you rely on it. "Check docs" means we could not confirm a single clean answer on an official page.
| | Mem0 | Zep | Letta | Cognee | Supermemory | AgentPrizm |
|---|---|---|---|---|---|---|
| Core model | Self-improving memory layer | Temporal knowledge graph | Stateful agent architecture | Knowledge graph | Hosted semantic store | Six typed memory categories |
| Hosting | Hosted + OSS self-host | Managed cloud; BYOK; BYOC in your VPC | OSS self-host + hosted cloud | OSS self-host; check hosted status | Hosted | Hosted only |
| Open source | Yes (src) | Graphiti engine only (src) | Yes (src) | Yes (src) | Yes (src) | No |
| Integration | REST + Python/Node SDKs | REST + Python/TS/Go SDKs | REST + Python SDK | Python package | REST API | REST + remote MCP |
| MCP | OpenMemory local MCP (src) | Check docs | Check docs | Check docs | Check docs | Remote HTTP MCP (no install) |
| Memory model | Auto-classified | Temporal graph (entities + edges) | Memory in agent state | Graph (entities + relationships) | Semantic store | Six explicit types |
| Temporal/validity | Check docs | valid_at / invalid_at on edges (src) | Check docs | Check docs | Check docs | Validity windows + supersedes |
| Typed categories | Check docs | Graph-based model; verify | Check docs | Graph-based model; verify | Check docs | Yes — 6 types |
| Confidence scores | Check docs | Check docs | Check docs | Check docs | Check docs | Yes |
| Audit trail | "Every read and write logged" (src) | Advertised; verify (src) | Check docs | Check docs | Check docs | Yes — per-operation receipt |
| Right-to-forget | Check docs | Check docs | Check docs | Check docs | Check docs | Yes — first-class operation |
| Compliance | SOC 2 Type 1, HIPAA (verify) | SOC 2 Type II, HIPAA BAA (verify) | Check docs | Check docs | Check docs | GDPR-aligned; no SOC 2/HIPAA |
| Free tier | Listed; verify pricing | Listed; verify pricing | OSS is free | OSS is free | Check pricing | Hobby — $0, no card |
Recommendations by use case
Indie developer or prototype stage. Start with something that costs nothing and has a real API on the free tier. Mem0's OSS path, Cognee's open-source package, and AgentPrizm's Hobby plan all let you wire up a real integration before paying. Pick the model that fits your agent's architecture: Mem0 if you want framework integrations out of the box, AgentPrizm if you want typed governance from day one.
Production agents needing governance and audit. You need validity handling, scoping, audit trails, and a clear forgetting mechanism. Zep's temporal graph and AgentPrizm's typed model both address these — their approaches differ (graph vs. typed categories), so evaluate which fits how your data actually looks. If your agents deal with many evolving entity relationships, Zep's graph model may be the better fit. If you have a handful of agent types and want explicit type semantics, AgentPrizm is worth evaluating.
Compliance-heavy environments (HIPAA, SOC 2 Type II required). Go to Zep first. They advertise SOC 2 Type II and HIPAA BAA — confirm with their sales team and get the paperwork before you commit. Mem0 advertised SOC 2 (Type 1) and HIPAA at the time of writing — verify their current certification status and whether a BAA is available. If no one in this list can meet your bar, the self-hosted OSS options (Mem0, Cognee, Letta) give you data control at the cost of operational responsibility.
MCP-native coding agents (Cursor, Claude Code, etc.). Mem0's OpenMemory MCP is a local-first, Docker-based option that keeps data on your machine. AgentPrizm's remote MCP connects with a URL and a key — zero local setup, works anywhere an MCP client can make an HTTPS call. If you want memory that travels with you across machines without running infrastructure, the remote path wins. If you want the data never to leave your network, OpenMemory is the right answer.
Frequently asked questions
What's the best free memory layer for AI agents?
It depends on what you mean by "free." For a hosted API at zero cost with no credit card, AgentPrizm's Hobby plan gives you 1,000 memories and 4,500 recalls/month with the full API. For open-source software you run yourself, Mem0, Cognee, and Letta are all worth evaluating — the operational cost is yours, but the software is free. For a local MCP memory that costs nothing and runs on your own machine, Mem0's OpenMemory is worth a look. None of these are "free at unlimited scale."
Self-hosted vs. hosted: which should I choose?
Self-hosted wins on data control and vendor independence. Hosted wins on operational simplicity and usually on time to first integration. The practical question is: does your team have the capacity to run, monitor, patch, and scale a memory service? If yes, the open-source options are viable. If not — or if you need to move fast — a managed service is usually the right call, and you can always migrate later once your memory schema and recall patterns are well-understood.
Do I need a memory layer or just RAG?
They solve different problems. RAG (retrieval-augmented generation) pulls relevant chunks from a fixed reference corpus — your documentation, knowledge base, product catalog — and injects them into the prompt. A memory layer captures new knowledge the agent generates through interaction — user preferences, past decisions, lessons from prior runs — and feeds it back later. RAG answers "what does our documentation say about X?" Memory answers "what did this user tell me about their database setup three weeks ago, and have they changed it since?" Most production agents need both. For the longer version of this distinction, see Agentic Memory vs. RAG.
Can I use more than one of these?
Yes, and some teams do — different memory layers for different agent types within the same product. The overhead of multi-vendor is real (different APIs, different consistency guarantees, different billing), but if you have a regulated agent (needs HIPAA BAA) running alongside a lightweight coding assistant, mixing tools is a reasonable choice. Just be deliberate about which agent reads from which store.
The context window is not memory — that is still the right starting point. What has changed in 2026 is that the alternatives are varied and mature enough to require a real evaluation, not just a default. Use the criteria checklist above to narrow to two or three candidates, put your actual agent against each one, and let the integration tell you what works.
For the AgentPrizm side of that evaluation: pricing covers the plan limits, the comparison with Zep and Mem0 goes deeper on those three tools specifically, and the docs cover the API, MCP setup, memory types, and container scoping. If you are evaluating a competitor, go to their docs — the same standard applies.