Structured Intent (SI): A semi-structured frame (key-value pairs) extracted from a natural language question, capturing intent facets like Answer Type, Entities, Time, and Relation
Verbalization: The process of converting structured data (KG triples, table rows) into natural language sentences so they can be processed uniformly with text
Clocq: A specific retrieval method for Knowledge Graphs that fetches relevant subgraphs and disambiguates entities in a single step
GNN (Graph Neural Network): A neural network designed to operate on graph structures; used here to score the relevance of evidence nodes based on their connections to entity nodes
Cross-Encoder: A transformer model that processes two inputs (query and document) simultaneously to output a relevance score, typically more accurate but slower than bi-encoders
BM25: A classical probabilistic information retrieval function used to rank documents based on term frequency and inverse document frequency
CompMix: A benchmark dataset specifically designed for evaluating QA systems that must operate over heterogeneous sources (text, tables, KG)
TimeQuestions: A benchmark dataset focusing on temporal question answering requiring understanding of time points and intervals
BART: A transformer encoder-decoder model used here for the specific sub-task of generating the Structured Intent from the question
DOM-tree: Document Object Model tree—the structural representation of a webpage; used here to extract context labels for table rows