Persistent memory for AI agents
auditable by default.

Kumiho gives your AI assistants graph-native memory that survives context window resets, model swaps, and long-running projects.

Sound familiar?

The amnesia problem

Your agent forgets everything when the context window resets. Every session starts from scratch.

The hallucinated memory problem

You stuff old conversations into the prompt. The model confuses its own summaries with facts.

The black box problem

The agent "remembers" something, but you can't trace where that memory came from or verify it.

Kumiho solves all three. With a graph.

What Kumiho does

Persistent recall

Memories stored as immutable revisions in a graph. Survive context resets, model swaps, and provider changes.

Typed reasoning edges

DERIVED_FROM, DEPENDS_ON, REFERENCED — every memory knows what it came from and what depends on it.

Dream State consolidation

Async background process that enriches, links, deduplicates, and prunes memories. Like sleep for your agent.

LLM-decoupled

Memory lives in Neo4j + Redis, not inside any model. Switch providers without losing history.

A day in the life

1

Your AI assistant handles a design review. Kumiho stores the conversation summary, decisions, and references as a memory revision.

2

Three days later, the user asks "What did we decide about the hero layout?" The agent recalls the exact revision with full provenance.

3

Dream State runs overnight — links the design decision to the project brief, flags stale references, enriches metadata.

4

Two months later, a new team member asks the agent for project history. Every decision is traceable, every source is cited.

No prompt stuffing. No hallucination. Just recall.

Integration paths

Playground

Web chat powered by your own LLM API key. Talk to your agent and test memory store & recall in real time.

# Launch Kumiho Playground
kumiho playground

# Chat with your AI agent
# Bring your own LLM API key
# Test memory store & recall in real time

Claude Code Plugin

Install the MCP plugin and get persistent memory across every coding session. Zero config.

// .claude/settings.json
{
  "plugins": {
    "kumiho-memory": {
      "url": "npm:kumiho-cowork"
    }
  }
}
// Memory persists across all sessions

Claude Cowork Plugin

Shared team memory for Cowork sessions. Every decision is traceable to its author.

// Team memory shared across Cowork sessions
// Every decision traceable to its author

kumiho_memory_recall("project architecture")
// → Team-wide decisions with full provenance

OpenClaw Plugin

Install in one command: openclaw plugins install @kumiho/openclaw-kumiho. Full graph traversal and memory lifecycle.

# Install the Kumiho plugin for OpenClaw
openclaw plugins install @kumiho/openclaw-kumiho

# Memory is live — persists across all sessions
# Full graph traversal & reasoning included

How it works

Memory flows through a lifecycle — each stage adds structure and durability.

Ingest
Store
Recall
Revise
Consolidate

Kumiho vs. DIY memory stacks

An honest comparison

What you needRAG + Vector DBKumiho AI Cognitive Memory
Store conversationsYou build chunking + embeddingmemory_store() — one call
Recall by meaningVector similarity (no structure)Hybrid: fulltext + graph + vector
Trace provenanceNot availableEvery memory has typed edges
Handle contradictionsHope the model figures it outDream State detects + resolves
Survive model swapsRe-embed everythingLLM-decoupled by design
Audit trailBuild your ownImmutable revisions + timestamps

Common questions

What is Dream State?
An async consolidation process that reviews, enriches, links, and prunes memories. Think of it as offline maintenance for your agent's long-term memory.
Does this replace my vector database?
Yes. Kumiho includes its own vector infrastructure as part of the hybrid retrieval system (fulltext + graph + vector). You don't need to run a separate vector database — memory search, similarity matching, and semantic recall all run on Kumiho's built-in infrastructure.
Which AI platforms are supported?
Any platform that supports MCP tools, REST APIs, or Python. First-class plugins for Claude Code, Cowork, and OpenClaw. Plus Kumiho Playground for local testing.
How much memory can I store?
Depends on your tier. Memory is included at every tier. Free gets 1k nodes, Creator 30k, Studio 500k. Add Memory Standard ($29/mo) to boost Free limits.

Stop rebuilding memory stacks. Ship agent recall with provenance.

Kumiho AI Cognitive Memory — Graph-Native Agent Memory | 93% Recall Accuracy | Kumiho