NLI: Natural Language Inference—determining if a hypothesis is true (entailment), false (contradiction), or unrelated (neutral) given a premise
CG2C: Context Graph to Claim—the proposed method for generating synthetic training data by extracting graph structures from documents
MHQA: Multi-Hop Question Answering—tasks requiring reasoning across multiple documents or paragraphs to find an answer
FactCG: The fact-checking model trained using the CG2C synthetic data
Context Graph: A graph representation of a document where nodes are entities and edges are relations described in the text
LLM-AGGREFACT: A benchmark dataset of real LLM hallucinations across various tasks (summarization, QA, data-to-text)
DiRe: Disconnected Reasoning—a phenomenon where models solve multi-hop tasks using shortcuts rather than connecting multiple facts