MIPS: Maximum Inner Product Search—algorithms for efficiently finding vectors with the highest dot product in a large collection
ICT: Inverse Cloze Task—a pre-training objective where a model learns to retrieve the document a specific sentence came from, used here for initialization
Salient span masking: A masking strategy that focuses on named entities and dates rather than random tokens, forcing the model to rely on world knowledge
Marginal likelihood: The probability of the observed data summing over all possible values of a latent variable (here, the retrieved document)
Cold-start problem: The issue where a randomly initialized retriever returns irrelevant documents, preventing the downstream model from learning to use them
ORQA: Open-Retrieval Question Answering—a predecessor model that fine-tunes a retriever but uses a fixed index, serving as the primary baseline
Null document: A virtual empty document added to the candidate set, allowing the model to assign credit when no external retrieval is necessary