← Back to Paper List

KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs

D Wu, Y Yan, Z Liu, Z Liu, M Sun
Tsinghua University, Microsoft Research
arXiv, 6/2025 (2025)
RAG KG QA RL

📝 Paper Summary

Graph-based RAG pipeline
KG-Infused RAG integrates pre-existing large-scale knowledge graphs into RAG pipelines via spreading activation and preference learning to improve structured knowledge retrieval without expensive ad-hoc graph construction.
Core Problem
Existing RAG methods either rely solely on text corpora (missing structural connections) or require building ad-hoc knowledge graphs from scratch (computationally expensive and limited in scope), failing to leverage existing high-quality large-scale KGs.
Why it matters:
  • Corpus-based methods often miss multi-hop connections between entities required for complex reasoning tasks
  • Ad-hoc KG construction (e.g., GraphRAG) is resource-intensive and prone to extraction errors, making it impractical for very large corpora
  • Current approaches treat KGs and text corpora as separate silos rather than integrating them to mutually enhance retrieval quality
Concrete Example: For a multi-hop question, a standard RAG might retrieve passages about Entity A and Entity B separately but miss the bridge entity connecting them. KG-Infused RAG uses spreading activation on a KG to find the bridge entity, expanding the query to retrieve the full reasoning chain.
Key Novelty
KG-Infused RAG
  • Simulates 'spreading activation' (a cognitive psychology concept) on pre-existing KGs to iteratively retrieve related entities and facts, starting from query seeds
  • Uses the retrieved KG subgraph to expand the natural language query, guiding the subsequent corpus retrieval to be more context-aware
  • Optimizes the knowledge augmentation stage using Direct Preference Optimization (DPO) to better align the integration of structured and textual knowledge
Architecture
Architecture Figure Figure 1
The overall framework of KG-Infused RAG showing the three main stages: KG-Guided Spreading Activation, KG-Based Query Expansion, and KG-Augmented Answer Generation.
Evaluation Highlights
  • Outperforms vanilla RAG by 3.9% to 17.8% across five QA benchmarks (HotpotQA, 2WikiMQA, MuSiQue, Bamboogle, StrategyQA)
  • Surpasses ad-hoc KG construction baselines (GraphRAG, LightRAG) in both performance and efficiency by leveraging pre-existing KGs
  • Demonstrates plug-and-play capability by boosting performance when integrated into advanced RAG systems like Self-RAG and DeepNote
Breakthrough Assessment
8/10
Significant for efficiently bridging the gap between structured KGs and unstructured text retrieval without the high cost of graph construction. Strong empirical gains and plug-and-play versatility.
×