Parametric Memory: Persistent state written into model weights (parameters) during training, accessed via FFN/MLP layers
Contextual Memory: Transient state in the context window (working memory) accessed via attention mechanisms during inference
External Memory: Non-parametric persistent state stored in external databases/indices, accessed via retrieval
Procedural Memory: Episodic memory that stores interaction history and process states to maintain cross-session consistency and long-term goals
ROME: Rank-One Model Editing—a technique to rewrite specific factual associations in a model's MLP layers
MEMIT: Mass-Editing Memory in a Transformer—a scalable method for editing thousands of facts in model weights
MEND: Model Editor Networks with Gradient Decomposition—a hypernetwork-based approach for efficient local model edits
PO setting: Parametric-Only setting—evaluating model recall without access to external documents or context (closed-book)
Induction Heads: Attention heads that copy patterns from previous tokens in the context, crucial for in-context learning
LAMA: LAnguage Model Analysis—a probe dataset used to test factual recall from model parameters