Skip to content

Agent memory

When to use

Multi-step tasks with evolving user intent, preferences, or constraints.

Architecture sketch

Input → Scratchpad (ephemeral) → Tools
      → Profile (long-term) → Ledger (task)

Pitfalls

Storing everything forever → privacy and drift
Missing expiry windows and provenance

Checklist

[ ] Define memory types and retention windows
[ ] Source-of-truth + PII handling
[ ] Summarization with confidence and citations

Good–Better–Best

Good: ephemeral scratchpad
Better: task ledger with rollups
Best: governed profile with consent and expiry

Tiny eval snippet

def recall_rate(prompts, memory):
    return sum(int(intent in memory) for intent in prompts) / len(prompts)