← Back to Paper List

Cross-Lingual Knowledge Augmentation for Mitigating Generic Overgeneralization in Multilingual Language Models

S Ralethe, J Buys
University of Cape Town, South Africa
Proceedings of the 5th Workshop on Multilingual …, 2025 (2025)
Factuality KG Benchmark

📝 Paper Summary

Generic overgeneralization (GOG) Multilingual commonsense reasoning Knowledge injection
Combining commonsense (ConceptNet) and encyclopedic (DBpedia) knowledge via graph attention networks reduces the tendency of multilingual models to interpret generic statements like 'lions have manes' as universal truths.
Core Problem
Language models and humans exhibit 'generic overgeneralization' (GOG), incorrectly treating generic statements (e.g., 'ducks lay eggs') as universal ('all ducks lay eggs'), a bias exacerbated in low-resource languages.
Why it matters:
  • Models fail to capture nuanced semantics, treating exceptions (e.g., male ducks don't lay eggs) as false rather than valid variations
  • Low-resource languages like isiZulu and Sepedi lack digital corpora and have distinct morphological features (e.g., obligatory plural marking) that may amplify this bias
  • Prior mitigation work focused only on English and limited knowledge bases (ASCENT KB), leaving cross-lingual patterns and broader knowledge sources unexplored
Concrete Example: When given the true generic 'ducks lay eggs', models incorrectly accept the universal claim 'all ducks lay eggs' despite male ducks not laying eggs. In isiZulu, 'amabhubesi' (lions) uses a prefix 'ama-' that inherently marks plurality, potentially biasing models toward universal interpretations.
Key Novelty
Dual-Source Cross-Lingual Knowledge Injection
  • Projects large-scale knowledge graphs (ConceptNet for commonsense, DBpedia for facts) into low-resource languages using alignment tools
  • Demonstrates that commonsense knowledge specifically helps 'minority' generics (subset properties) while encyclopedic knowledge helps 'majority' generics (exceptions)
  • Uses a Graph Attention Network (QA-GNN) to reason over these projected knowledge subgraphs to correct the model's semantic interpretation
Evaluation Highlights
  • Combined knowledge (ConceptNet + DBpedia) reduces generic overgeneralization by ~67% (relative MRR reduction) in mT5-large across 5 languages
  • ConceptNet alone reduces overgeneralization for minority characteristic generics by 45-52% relative to baseline
  • Nguni languages (isiZulu, isiXhosa) show 4-7% higher baseline overgeneralization than Sotho-Tswana languages, suggesting morphological influence
Breakthrough Assessment
7/10
Strong empirical evidence for cross-lingual knowledge transfer addressing a specific semantic bias. First study of this phenomenon in African languages, though limited by reliance on translated evaluation data.
×