GAR: Generation-Augmented Retrieval—a paradigm where an LLM generates additional context (like query expansions) to help a retriever find documents
RAG: Retrieval-Augmented Generation—a paradigm where a retriever fetches documents to help an LLM generate an answer
Zero-shot IR: Information Retrieval tasks performed without any training data from the target domain
nDCG@k: Normalized Discounted Cumulative Gain at k—a measure of ranking quality that accounts for the position of relevant documents
Recall@k: The fraction of relevant documents retrieved within the top-k results
BM25: Best Matching 25—a standard probabilistic information retrieval function based on term frequency and inverse document frequency
Relevance Model: A component (often an LLM) that scores how relevant a document is to a query, used here for filtering
Promptagator: A baseline method that uses LLMs to generate synthetic queries from a corpus to train a dense retriever