RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents, then generating responses based on them
Cross-Encoder: A re-ranking model that processes a query and document together in a single transformer pass to output a relevance score
Bi-Encoder: A retrieval model that encodes query and document separately into vectors, allowing fast nearest-neighbor search
Listwise Reranking: A ranking approach where the model considers a list of documents simultaneously to output a permutation, rather than scoring pairs or single documents
Distillation: Training a smaller student model (e.g., RankZephyr) to mimic the behavior/outputs of a larger teacher model (e.g., GPT-4)
Nugget: A specific atomic fact or piece of information that should be present in a correct answer
AutoNuggetizer: A framework proposed in this thesis to automate the extraction and checking of information nuggets using LLMs
Zero-Shot: Performing a task (like reranking) without having been explicitly trained on examples of that specific task
nDCG: Normalized Discounted Cumulative Gain—a measure of ranking quality that prioritizes highly relevant documents appearing earlier in the list
BoW: Bag-of-Words—a text representation model that counts word occurrences, ignoring order
In-Context Learning: The ability of LLMs to learn tasks from examples or instructions provided in the prompt without parameter updates
RLHF: Reinforcement Learning with Human Feedback—a method to align LLMs with human preferences