Entity Memory: A learnable matrix where each row corresponds to a specific entity's representation, accessed only when that entity is mentioned in the text
Sparse Activation: A mechanism where the model only accesses a small subset of its parameters (specific entity memories) for a given input, rather than using all weights
Mention Masking: A training objective where entity names are masked out, forcing the model to use context to predict the correct entity
LAMA: LAnguage Model Analysis—a benchmark for probing the factual/declarative knowledge stored in language models
TriviaQA: A large-scale question answering dataset containing complex, compositional questions
BIO encoding: A tagging scheme (Beginning, Inside, Outside) used to mark the boundaries of entity mentions in text
Entity Linking: The task of assigning a unique identity (from a knowledge base) to an entity mention in text