hallucination: When an LLM generates a response that is fluent and plausible but factually incorrect or nonsensical.
logits: The raw, unnormalized output scores from the final layer of a neural network before the softmax function converts them into probabilities.
softmax: A function that converts a vector of logits into a probability distribution summing to 1.
aleatoric uncertainty: Uncertainty arising from inherent randomness or noise in the data/generation process (e.g., multiple valid ways to phrase an answer).
epistemic uncertainty: Uncertainty stemming from the model's lack of knowledge or training data regarding a specific input.
semantic entropy: An uncertainty metric that groups sampled responses by meaning (semantics) before calculating entropy, effectively ignoring phrasing differences.
Boltzmann distribution: A probability distribution from physics/thermodynamics where the probability of a state decreases as its energy increases.
partition function: The normalizing constant in the Boltzmann distribution, ensuring probabilities sum to 1; often intractable to compute exactly for LLMs.
AUROC: Area Under the Receiver Operating Characteristic curve; a metric for binary classification performance (here, distinguishing correct vs. incorrect answers).
AUPR: Area Under the Precision-Recall curve; focuses on performance when the positive class (e.g., correct answers) is rare or of specific interest.
FPR@95: False Positive Rate at 95% True Positive Rate; measures how many errors are missed when trying to catch 95% of correct instances.
OOD: Out-of-Distribution; data that is significantly different from the data the model was trained on.