LLM: Large Language Model—AI systems trained on vast text data to generate human-like text
SVD: Singular Value Decomposition—a mathematical method to factorize a matrix, used here to find the principal directions (subspace) of the data
Huber contamination model: A statistical model representing data as a mixture of a majority distribution (truthful) and a contaminant distribution (hallucinated)
AUROC: Area Under the Receiver Operating Characteristic Curve—a performance metric for classification problems at various threshold settings
subspace: A vector space that is a subset of a larger vector space; here, a specific direction in the high-dimensional embedding space where hallucinations cluster
membership estimation: The process of assigning a probability or score indicating whether a sample belongs to a specific class (e.g., hallucinated) within a mixture
autoregressive: A property of models that generate sequences one token at a time, using prior tokens as context