Knowledge Editing: Techniques to modify specific facts stored in an LLM's weights without re-training the entire model
Specificity: The constraint that an edit should only affect the target fact and its variations, leaving unrelated knowledge unchanged
Implication Awareness: The constraint that an edit should automatically update facts that are logical consequences of the edited fact (based on If-Then rules)
Establish-and-Update: The proposed protocol where a model first learns a set of facts/rules, and then is tested on how well it updates them
DepEdit: The dataset proposed in this paper, consisting of facts, rules, and implications formulated as QA pairs
MEND: Model Editor Networks with Gradient Decomposition—a hypernetwork-based knowledge editing method
ROME: Rank-One Model Editing—a method that locates and edits specific factual associations in transformer MLPs
EMS: Exact-Match Score—a metric measuring if the generated answer exactly matches the ground truth
Weak Knowledge: Knowledge acquired by LLMs from training data without real-world justification (model beliefs)