← Back to Paper List

MM-OpenFGL: A Comprehensive Benchmark for Multimodal Federated Graph Learning

Xunkai Li, Yuming Ai, Yinlin Zhu, Haodong Lu, Yi Zhang, Guohao Fu, Bowen Fan, Qiangqiang Dai, Rong-Hua Li, Guoren Wang
arXiv (2026)
MM Benchmark KG

📝 Paper Summary

Federated Graph Learning Multimodal Learning
MM-OpenFGL establishes the first comprehensive benchmark for Multimodal Federated Graph Learning (MMFGL), providing standardized datasets, simulation strategies for modality/topology heterogeneity, and a modular evaluation framework.
Core Problem
Existing Federated Graph Learning (FGL) frameworks focus on single-modality graphs and fail to address the specific challenges of multimodal data distribution, such as disjoint modalities and cross-client structural mismatches.
Why it matters:
  • Real-world multimodal graphs (e.g., social networks with text/images) are naturally distributed across platforms due to privacy and competition, preventing centralized training.
  • Naive application of standard FGL methods to multimodal data often performs worse than isolated training because they cannot reconcile cross-modal semantic conflicts.
  • The lack of a unified benchmark and formal problem definition hinders progress, leaving researchers without tools to rigorously evaluate MMFGL algorithms.
Concrete Example: Different social media platforms maintain independent user graphs where posts contain both text and images. Cross-platform sharing is prohibited. A naive federated aggregation might average weights from a text-heavy platform with an image-heavy one without alignment, degrading performance compared to training locally on just one modality.
Key Novelty
Tri-dimensional Simulation Strategy for MMFGL
  • Formalizes MMFGL by introducing simulation strategies across three axes: Modality (IID vs. NonIID/missing modalities), Topology (Available vs. Unavailable/hidden structure), and Label (IID vs. NonIID).
  • Integrates a modular pipeline supporting both End-to-End training (task-specific) and Two-Stage training (federated graph foundation models), allowing evaluation of pre-training transferability.
Evaluation Highlights
  • Multimodal GNNs outperform single-modality baselines (e.g., +significant margins for MM-GCN vs GCN), confirming the necessity of fusing topology and multimodal semantics.
  • Specialized heterogeneous FGL methods like MH-pFLID consistently outperform standard FL baselines in challenging Modality-NonIID settings across 7 datasets.
  • Graph Foundation Models (GFMs) demonstrate superior scalability, successfully handling complex downstream tasks like modality generation where traditional MM-GNNs fail.
Breakthrough Assessment
9/10
Foundational work that defines a new sub-field (MMFGL). It provides the first standardized benchmark, datasets, and problem formalization, filling a critical gap in federated learning research.
×