RAG: Retrieval-Augmented Generation—combining a search system with a text generator to answer questions using retrieved documents
KB: Knowledge Base—in this paper, a collection of textual documents (unstructured text)
Self-annotation: The process where the model generates its own training data (questions and answers) based on the provided text
Iterative tuning: A training cycle where the model is fine-tuned, generates new responses, verifies them, and is fine-tuned again on the verification results
LoRA: Low-Rank Adaptation—a parameter-efficient fine-tuning technique that updates only a small subset of model weights
CoT: Chain-of-Thought—a prompting technique where the model generates intermediate reasoning steps before the final answer
QE: Query Expansion—refining a search query by adding relevant terms (here, generated by the model itself) to improve retrieval
F1 score: A metric measuring the overlap between the predicted answer and the ground truth, balancing precision and recall