log-probabilities: The logarithm of the probability assigned by the model to a specific token; a measure of model confidence
calibration bias: The systematic discrepancy between a model's predicted probabilities and the actual correctness of its outputs
GRU: Gated Recurrent Unitβa type of recurrent neural network that processes sequential data and can capture long-term dependencies
logical hallucinations: Errors where the model fails in reasoning or internal logic (e.g., bad math, invalid code logic) rather than just retrieving incorrect facts
factual hallucinations: Errors where the model generates information that contradicts established reality or the provided context
black-box: A setting where the internal weights and states of a model are not accessible, only its inputs and outputs (and sometimes log-probs)
teacher-forcing: A training technique where the model is fed the ground-truth previous token (or the actual generated token in this context) rather than its own prediction to compute probabilities for the sequence