Knowledge Editing: Techniques to insert or update specific factual knowledge in a trained language model without full retraining
Hallucination: When a language model generates plausible-sounding but factually incorrect information
ROME: Rank-One Model Editing—a method that locates and updates a specific factual association in a model's MLP layers
MEMIT: Mass-Editing Memory in a Transformer—an extension of ROME designed to edit thousands of facts simultaneously
ICE: In-Context Editing—providing the corrected fact directly in the prompt context rather than modifying model weights
GRACE: A memory-based editing method that uses a discrete codebook to intercept and adjust activations for specific inputs
Portability: Whether the model can use the edited knowledge to answer downstream reasoning questions (e.g., multi-hop queries)
Locality: Whether the edit remains specific to the target fact without changing answers to unrelated questions
Sycophancy: The tendency of a model to agree with the user's input, even if the input contradicts its own knowledge