RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
Attention Map: A matrix representing how much focus a model places on different parts of the input when generating a specific token
Contextual Hallucination: A specific type of error where the model's output contradicts or is not supported by the provided retrieved source text
Lookback-Lens: A baseline method that detects hallucinations by analyzing the ratio of attention placed on the prompt versus newly generated tokens
Jensen-Shannon Divergence: A method of measuring the similarity between two probability distributions; used here to measure how much individual attention heads deviate from the layer average
Boruta: A feature selection algorithm that compares features' importance against randomized 'shadow' features to find statistically significant predictors
Lasso: Least Absolute Shrinkage and Selection Operator—a regression analysis method that performs both variable selection and regularization
Intrinsic Hallucination: Model output that directly conflicts with the provided source material
Greedy Decoding: A generation strategy where the model always selects the highest-probability token at each step