Query Expansion (QE): Technique to improve search results by adding relevant terms or generating pseudo-documents to enrich the original query
HyDE: Hypothetical Document Embeddings—a method where an LLM generates a fake document to answer a query, which is then used to search for real documents
NLI: Natural Language Inference—the task of determining if one sentence (hypothesis) logically follows from another (premise)
Gold Evidence: The ground-truth sentences or documents required to verify a claim in a dataset
BM25: A standard probabilistic information retrieval function used to rank documents based on keyword matching
Contriever: A dense retrieval model trained via contrastive learning to match queries and documents in vector space
Knowledge Leakage: When a model performs well on a test set because it has seen the test data (or related information) during its pre-training
Recall@k: The proportion of relevant documents retrieved within the top k results
NDCG@k: Normalized Discounted Cumulative Gain—a metric measuring the quality of ranking, giving more weight to relevant items appearing earlier
Fact Verification: The task of assessing whether a claim is true or false based on retrieved evidence