CLeHe: Contrasting Low-entropy distribution with High-entropy distribution—the paper's proposed method combining ensemble weighting and contrastive decoding
LeEns: Low-entropy Ensemble—the first component of the method that weights documents based on the entropy of the LLM's generation distribution given that document
Parametric Knowledge: Knowledge stored in the model's pre-trained weights (often outdated or hallucinated) as opposed to external retrieved context
Contrastive Decoding: A technique to adjust generation probabilities by maximizing the difference between a desired distribution (expert) and an undesired one (amateur/noise)
Lost in the middle: A phenomenon where LLMs fail to use relevant information if it appears in the middle of a long context window surrounded by irrelevant text
Pointwise Mutual Information (PMI): A measure used here to quantify the information gain provided by the external documents relative to the model's internal prior
Logit: The raw, unnormalized output score from the neural network before applying the softmax function
Product-of-experts: An ensemble method where probabilities from different sources are combined by multiplying them (or averaging their logits) rather than averaging probabilities
NaiveRAG: A baseline approach where retrieved documents are simply concatenated with the query into a single prompt