DoLa: Decoding by Contrasting Layers—the proposed method that subtracts earlier layer logits from final layer logits
Premature Layer: An intermediate transformer layer selected dynamically to represent lower-level linguistic information
Mature Layer: The final transformer layer containing fully processed, semantic, and factual information
Contrastive Decoding (CD): A baseline method that contrasts logits between a small 'amateur' model and a large 'expert' model
JSD: Jensen-Shannon Divergence—a symmetric measure of similarity between two probability distributions, used here to select the premature layer
APC: Adaptive Plausibility Constraint—a filtering technique that masks tokens with low probability in the expert/mature model to prevent implausible tokens from being boosted
ITI: Inference Time Intervention—a baseline method that shifts model activations during inference using a classifier trained on truthful data
CoT: Chain-of-Thought—a prompting strategy where the model generates intermediate reasoning steps before the final answer