HIG: Hierarchical Index Graph—the core data structure comprising a document connectivity layer and an entity-level knowledge graph layer
triplets: Knowledge graph units consisting of (subject, predicate, object) used to represent structured information within documents
collaborative document layer: A graph layer where nodes are documents and edges represent semantic similarity, allowing retrieval to hop to related documents
Exact Match (EM): A metric measuring the percentage of predictions that match the ground truth answer exactly
SOTA: State-of-the-art—the current best performance achievable by existing methods
ROUGE-L: Recall-Oriented Understudy for Gisting Evaluation—a metric measuring overlap of the longest common subsequence between reference and candidate text
BLEU: Bilingual Evaluation Understudy—a metric for evaluating the quality of text which has been machine-translated from one natural language to another
LLMs: Large Language Models—AI systems trained on vast amounts of text data to generate human-like text
CoT: Chain-of-Thought—a prompting technique that encourages LLMs to generate intermediate reasoning steps
Zero-shot: A setting where the model performs a task without seeing any specific training examples for that task