RAG: Retrieval-Augmented Generation—AI systems that answer questions by searching for external documents
DST: Document Structure Tree—a hierarchical representation of a document where nodes are headings (structure) or text passages (content)
RST: Retrieval SubTree—a dynamic subset of the DST used during inference to maintain focus while exploring
Router: An LLM agent trained to navigate the document tree by selecting, expanding, or rejecting nodes
LoRA: Low-Rank Adaptation—a parameter-efficient technique for fine-tuning large language models
EM: Exact Match—a metric checking if the generated answer is identical to the ground truth
F1-5: A metric for list-style QA, measuring the overlap between the predicted list and the gold list, capped at 5 items
SFT: Supervised Fine-Tuning—training a model on labeled examples
Dense Retriever: A retrieval system using vector embeddings to find relevant text
Off-the-shelf: Using pre-trained models without further modification or fine-tuning
Greedy decoding: A generation strategy where the model always picks the most likely next token