RAG: Retrieval-Augmented Generation—providing external documents in the prompt to help the model answer questions
Parametric Memory: Knowledge stored within the model's pre-trained weights (e.g., facts it 'knows' from training)
Causal Tracing: A technique to identify which specific internal neural activations cause a model to output a specific prediction by corrupting and restoring states
Average Indirect Effect (AIE): A metric quantifying how much a specific hidden state contributes to the probability of the correct answer
Subject Token: The token in the query representing the entity being asked about (e.g., 'Tower' in 'Where is the Eiffel Tower?')
Last Token (LT): The final token position in the prompt sequence, from which the next token (the answer) is predicted
Residual Stream: The primary vector pathway in a Transformer where information is processed and passed between layers
Attention Knockout: A probing method where specific attention edges (connections between tokens) are zeroed out to measure their importance to the prediction
SLM: Small Language Model—typically models with fewer than ~7 billion parameters (e.g., Phi-2)