BM25: A ranking function used in information retrieval to estimate the relevance of documents to a given search query based on term frequency
Rouge-L: Evaluation metric measuring the longest common subsequence between reference and generated text, capturing sentence-level structure
Bleu: Bilingual Evaluation Understudy—a metric for evaluating generated text by counting matching n-grams against a reference
T5: Text-to-Text Transfer Transformer—a model architecture where every NLP task is cast as feeding text input to generate text output
Adafactor: A stochastic optimization method based on Adam that reduces memory usage, often used for training Transformers
RankDocBySnpt: A proposed retrieval strategy where snippets are retrieved first to find matches, but the full parent documents containing those snippets are ranked and used for context
IDF: Inverse Document Frequency—a measure of how much information a word provides, based on how common or rare it is across all documents
hard negative: In contrastive learning or classification, a negative sample that is very similar to the positive sample, making it difficult for the model to distinguish