RALM: Retrieval-Augmented Language Modeling—conditioning a language model on relevant documents during generation
In-Context RALM: The proposed method of prepending retrieved documents to the LM's input context without updating LM weights
Retrieval Stride: The interval (number of tokens) between consecutive retrieval operations during text generation
Retrieval Query Length: The number of recent tokens from the current prefix used to formulate the search query
BM25: Best Matching 25—a standard bag-of-words ranking function used for sparse retrieval
Perplexity: A measurement of how well a probability model predicts a sample; lower values indicate better performance
Predictive Reranking: A proposed method where a reranker is trained to select documents that maximize the likelihood of the ground-truth continuation text
Zero-shot Reranking: Using a frozen LM to score retrieved documents based on the likelihood of the immediate context prefix given the document
Contriever: A dense retrieval model trained using contrastive learning
ODQA: Open-Domain Question Answering—answering questions based on a large collection of documents