Hypernetwork: A neural network that generates the weights for another neural network (in this case, generating LoRA adapters).
LoRA: Low-Rank Adaptation—a parameter-efficient fine-tuning technique that injects trainable low-rank matrices into frozen model layers.
Symbolic Context: Explicit text generated by the model (the 'dummy document') to serve as context for the answer.
Parameter Context: Model weights (adapters) dynamically generated by the hypernetwork to shift the model's behavior for a specific input.
FiD: Fusion-in-Decoder—a RAG architecture that processes retrieved documents independently in the encoder and fuses them in the decoder.
LongLLMLingua: A method for compressing long contexts into shorter, information-dense versions.
Whitening: A linear transformation that decorrelates data and normalizes its variance to make it spherical (identity covariance matrix).
SVD: Singular Value Decomposition—a matrix factorization method used here for the whitening process.