Parametric memory: Knowledge stored implicitly in the weights (parameters) of a pre-trained neural network
Non-parametric memory: External knowledge sources (e.g., Wikipedia passages) accessed via retrieval at inference time
PopQA: The new dataset introduced in this paper, consisting of 14k questions about long-tail entities derived from Wikidata triples
Long-tail entities: Entities that appear infrequently in training data or real-world usage, often defined here by low Wikipedia page views
Contriever: A dense information retrieval model trained using contrastive learning to match queries with relevant documents
BM25: A widely used ranking function for information retrieval based on exact keyword matching statistics
Greedy decoding: A generation strategy where the model selects the highest probability token at each step
EntityQuestions: An existing open-domain QA dataset used as a secondary benchmark, also featuring long-tail distribution