Personalized PageRank (PPR): A graph traversal algorithm that ranks nodes based on their probability of being visited from a specific set of 'seed' nodes, allowing for context-specific importance scoring.
OpenIE: Open Information Extraction—a method for extracting structured relational triples (subject, relation, object) from unstructured text without a pre-defined schema.
Seed Nodes: The specific nodes in a graph (entities, triples, or passages) used to initialize the Personalized PageRank algorithm, directing the search neighborhood.
Associativity: The capacity to draw multi-hop connections between disparate pieces of knowledge (e.g., A is related to B, B is related to C, therefore A is related to C).
Sense-making: The ability to interpret larger, more complex, or uncertain contexts, often requiring the synthesis of information from multiple parts of a text.
Dense-Sparse Integration: Combining dense vector representations (rich context/passages) with sparse graph structures (specific entities/concepts) to balance specificity and generalizability.
Recognition Memory: A process where an LLM reviews retrieved graph components (triples) and filters out irrelevant ones before they are used for further reasoning, akin to human recognition.
NV-Embed-v2: A state-of-the-art text embedding model used as a baseline and backbone for retrieval in this paper.