Knowledge Verbalization: The process of prompting an LLM to generate relevant background information from its internal weights (parametric knowledge) before answering a question
Selective Retrieval: An inference strategy where the system dynamically decides whether to perform retrieval or rely on the model's internal knowledge
Parametric Knowledge: Information stored implicitly in the neural network weights of an LLM, acquired during pre-training
GenRead: A specific prompting method used to elicit (verbalize) knowledge from an LLM by asking it to generate a background document
DPO: Direct Preference Optimization—a method to align language models to preferences without a separate reward model
kNN: k-Nearest Neighbors—an algorithm used here to find similar past queries in embedding space to help decide the best knowledge source
Behavior Cloning: A supervised learning approach where a model is trained to mimic the actions (here, source selection and generation) of an expert policy
Dense Retrieval: Retrieval based on semantic vector similarity (embeddings) rather than keyword matching