RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
Knowledge Graph (KG): A structured representation of knowledge using triples (head entity, relation, tail entity)
KG Generator: A module that prompts an LLM to extract knowledge triples from unstructured text
Reasoning Chain: A sequence of logically connected knowledge triples used to derive an answer
Autoregressive: A process where the current output depends on previous outputs (here, selecting the next triple based on previous ones)
Bi-encoder: A model architecture that encodes the query and candidate items (triples) separately into vector embeddings for efficient similarity comparison
Exact Match (EM): A metric measuring the percentage of predictions that match the ground truth answer exactly
Triple Ranker: A component that scores the relevance of KG triples to the current reasoning context
Triple Selector: A component that chooses the best triple from candidates to extend the reasoning chain
Beam Search: A search algorithm that explores a graph by expanding the most promising nodes (chains) at each step
Lost-in-the-middle: A phenomenon where LLMs fail to retrieve information located in the middle of a long input context