GNM: Generalized Neural Memory—the proposed framework where memory updates are conditioned on natural language instructions
MemoryLLM: The specific base architecture used in this paper, which adds writable memory embeddings to a Llama-3 backbone
ICL: In-Context Learning—providing examples or history directly in the prompt context window
RAG: Retrieval-Augmented Generation—fetching relevant documents from a database to augment the prompt
catastrophic forgetting: The tendency of neural networks to lose previously learned information upon learning new information
CounterFACT: A dataset originally designed for fact editing, adapted here to create a synthetic benchmark for instruction-following memory
test-ood: Out-of-Distribution Test Set—a data split containing fact categories and learning instructions never seen during training
harmonic mean: A type of average used here to aggregate different metrics (accuracy, specificity, selectivity) into a single score