Contriever: A dense information retrieval model pre-trained using contrastive learning, used here as the backbone encoder
LERC: Learned Evaluation Metric for Reading Comprehension—a metric that scores how semantically equivalent a candidate answer is to a gold answer
distillation: The process of training a smaller model (the retriever) to mimic the behavior or knowledge of a larger model (GPT-4)
veracity classification: The downstream task of determining if a claim is Supported, Refuted, or Not Enough Info based on evidence
hard negatives: Documents that look relevant (e.g., high lexical overlap) but do not actually contain the answer; critical for training effective retrievers
BM25: A probabilistic retrieval function based on exact keyword matching, used here for first-stage retrieval
dense retriever: A retrieval system that uses vector embeddings to find semantically similar documents, rather than just keyword matching