Agent Memory: A persistent, self-evolving cognitive state that integrates factual knowledge and experience across tasks, distinct from static RAG or transient context buffers.
Parametric Memory: Memory stored directly within the model's weights, often updated via fine-tuning or editing, accessed implicitly during forward passes.
Latent Memory: Memory represented as continuous hidden states or activations that persist across steps, rather than discrete text tokens.
Token-level Memory: Discrete units (text, visual tokens) stored externally that can be explicitly inspected, modified, and retrieved.
Experiential Memory: Records of past actions, outcomes, and reasoning traces used to improve future problem-solving (distinct from static facts).
Context Engineering: Resource management paradigm optimizing the context window payload, focusing on interface correctness rather than cognitive continuity.
RAG: Retrieval-Augmented Generation—typically serving static external knowledge for single-turn queries, unlike the evolving internal state of Agent Memory.
KV cache: Key-Value cache—storing attention computations to speed up generation; categorized here as 'LLM Memory' rather than 'Agent Memory'.