← Back to Paper List

SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs

Unknown authors
OpenReview
Agent KG Reasoning RL

📝 Paper Summary

Neuro-symbolic AI Agentic RAG pipeline Knowledge Graph Reasoning
SymAgent is a neural-symbolic agent that treats Knowledge Graphs as dynamic environments, using symbolic rules for planning and a self-learning loop to autonomously improve reasoning and identify missing knowledge.
Core Problem
Existing KGQA methods either treat KGs as static repositories (ignoring inherent logic) or assume KGs are complete (failing when data is missing), while LLMs often hallucinate on complex reasoning.
Why it matters:
  • Real-world Knowledge Graphs are often incomplete, causing standard semantic parsing (SPARQL) methods to fail execution.
  • Retrieval-augmented generation often fetches irrelevant or noisy subgraph information, confusing the LLM.
  • Current methods struggle to bridge the semantic gap between natural language questions and the structured, symbolic nature of KGs.
Concrete Example: For the question 'Where was the person who recorded song X born?', if the link between song X and the artist is missing in the KG, standard parsers return no answer. SymAgent detects this gap, uses a `searchWikidata` tool to find the artist in text, extracts the triple, and completes the reasoning.
Key Novelty
Neural-Symbolic Self-Learning Agent (SymAgent)
  • Agent-Planner: Uses LLM inductive reasoning to extract symbolic logic rules from the KG to guide question decomposition (e.g., inferring that 'birthplace' questions follow a specific relation chain).
  • Agent-Executor: Treats the KG as a dynamic environment, autonomously selecting tools (graph traversal or external text search) based on execution feedback to handle missing data.
  • Self-Learning: An iterative training loop where the agent explores, self-reflects to refine trajectories, and updates its policy using outcome-based rewards without human annotation.
Evaluation Highlights
  • +30.17% improvement in F1 score on complex reasoning datasets compared to GPT-4 using Qwen2-7B as the backbone.
  • Outperforms state-of-the-art ToG and RoG baselines on WebQSP, CWQ, and MetaQA-3hop datasets, achieving 78.54% Hits@1 on WebQSP.
  • Zero-shot generalization: Achieves 6x higher F1 score on domain-specific MetaQA-3hop compared to base LLM by effectively leveraging KG structure.
Breakthrough Assessment
8/10
Significant performance gains using smaller models (7B) against GPT-4. Effectively addresses the critical 'incomplete KG' problem via a novel self-learning tool-use framework.
×