Parametric Memory: Memory encoded directly within the model's weights (parameters) during training, analogous to human semantic memory (general knowledge)
Non-parametric Memory: Memory stored in external structures (like vector databases) that can be retrieved dynamically, analogous to human episodic memory
Sensory Memory: In humans, brief storage of raw senses; in AI, the initial processing of raw inputs (text, image, audio) into machine-processable signals before reasoning begins
Working Memory: Temporary storage and processing for current tasks; in AI, this includes the context window and KV caches used during multi-turn dialogue or reasoning
Implicit Memory: Non-declarative memory involving skills/habits; in AI, this maps to procedural patterns learned during training or in-context learning traces for task execution
Personal Memory: Data related to individual users (inputs, feedback) used to enhance personalization and user modeling
System Memory: Data generated by the AI itself during task execution, such as intermediate reasoning steps, planning processes (CoT), or search results
KV-Cache: Key-Value Cache—a technique in LLM inference to store intermediate attention computations (parametric short-term memory) to speed up generation
Memory Consolidation: The process of stabilizing a memory trace after initial acquisition; in AI, converting short-term context into long-term storage
CoT: Chain-of-Thought—a prompting technique where the model generates intermediate reasoning steps, which this paper classifies as a form of System Memory