← Back to Paper List

Emptying the Ocean with a Spoon: Should We Edit Models?

Yuval Pinter, Michael Elhadad
Ben-Gurion University of the Negev
Conference on Empirical Methods in Natural Language Processing (2023)
Factuality Memory RAG KG

📝 Paper Summary

Knowledge Internalization Model Editing Factuality
Direct model editing is an ill-posed solution for correcting LLM factuality due to scalability and safety issues; retrieval-augmented and attribution-based methods are safer, more accountable alternatives.
Core Problem
Direct model editing aims to patch individual factual errors in LLMs by modifying weights, but this approach fails to scale to the vast, changing nature of world knowledge.
Why it matters:
  • Facts change too rapidly for surgical weight edits to keep up (e.g., world leaders, daily events), making the goal akin to 'emptying the ocean with a spoon'
  • Editing introduces bias by prioritizing popular facts while neglecting the long tail, leading to inconsistent model behavior
  • The premise that LLMs should be 'truth-tellers' via weight storage reinforces dangerous user trust in stochastic models
Concrete Example: Inserting the fact 'Jack Depp is the son of Johnny Depp' might require updating logically entailed facts like 'Jack Depp is the sibling of Lily-Rose Depp' (ripple effect), which current editing methods often fail to do consistently.
Key Novelty
Position Paper: The Case Against Model Editing
  • Argues that LLMs are architecturally unsuited for use as consistent knowledge bases due to their stochastic nature and the 'ripple effect' complexity of updating interconnected facts
  • Proposes that 'model editing' should be restricted to interpretability probes rather than deployed as a fix for factuality errors
  • Advocates for decoupling knowledge from inference via retrieval-augmented architectures where provenance is explicit and updates are database operations
Evaluation Highlights
  • Qualitative analysis only: The paper reviews literature (e.g., LAMA benchmark limitations) to argue that LLMs perform poorly on long-tail facts compared to popular ones
  • Cites prior work showing 2019-level models (BERT-XL) only got top answers correct ~26.5% of the time on LAMA, and even that was heuristic-based
  • Highlights evidence that popular facts are harder to edit out than unpopular ones, creating a bias in what 'knowledge' remains in the model
Breakthrough Assessment
2/10
This is a position/opinion paper offering a critical perspective rather than a technical breakthrough or new empirical results. It frames the debate but does not propose a new algorithm.
×