← Back to Paper List

Youtu-GraphRAG: Vertically Unified Agents for Graph Retrieval-Augmented Complex Reasoning

J Dong, S An, Y Yu, QW Zhang, L Luo, X Huang, Y Wu…
Tencent Cloud ADP
arXiv, 8/2025 (2025)
RAG KG Agent Reasoning Benchmark

📝 Paper Summary

Graph-based RAG pipeline Agentic RAG pipeline
Youtu-GraphRAG unifies graph construction and retrieval through a shared schema, using dual-perception community detection and agentic query decomposition to enhance complex reasoning while reducing token costs.
Core Problem
Existing GraphRAG methods optimize graph construction and retrieval in isolation, leading to misalignment where the constructed graph structure does not effectively support complex retrieval needs, especially under domain shifts.
Why it matters:
  • Prior methods rely on loose OpenIE extraction, introducing noise and irrelevant trivia that degrade graph usability
  • Current community detection algorithms focus on topology but neglect semantic coherence, resulting in suboptimal knowledge clustering
  • Evaluation often suffers from data contamination (knowledge leakage), preventing fair assessment of RAG systems on pre-trained LLMs
Concrete Example: In a query like 'Which pharmaceutical companies manufacture diabetes drugs?', a standard decomposition might hallucinate connections. Without a schema, extraction tools might create vague relations like 'associated_with' instead of 'manufactures', causing the retrieval agent to miss the specific manufacturer-drug link required for the answer.
Key Novelty
Vertically Unified Agentic GraphRAG via Graph Schema
  • Treats graph construction as schema-bounded constraint generation, where an agent automatically expands a seed schema to fit new domains, ensuring precise entity/relation types
  • Introduces 'Dually-Perceived Community Detection' that clusters nodes based on both structural links and semantic embedding similarity, creating a 4-level knowledge tree
  • Deploys an agentic retriever that interprets the *same* schema to decompose complex questions into parallel sub-queries (node, triple, or community level) with iterative reflection
Architecture
Architecture Figure Figure 1 (implied)
The complete Youtu-GraphRAG framework unifying construction and retrieval.
Evaluation Highlights
  • Moves the Pareto frontier with up to 90.71% saving of token costs compared to GraphRAG (Microsoft) baselines
  • Achieves up to 16.62% higher accuracy over state-of-the-art baselines on complex reasoning benchmarks
  • Demonstrates robustness on a newly proposed 'Anonymity Reversion' task designed to measure performance without parametric knowledge leakage
Breakthrough Assessment
8/10
Strong engineering contribution unifying construction and retrieval via schema. The dual-perception clustering and anonymity evaluation are significant methodological improvements, though the core concept refines rather than replaces GraphRAG.
×