← Back to Paper List

Attributing Culture-Conditioned Generations to Pretraining Corpora

Huihan Li, Arnav Goel, Keyu He, Xiang Ren
University of Southern California
arXiv (2024)
Pretraining Factuality Benchmark

📝 Paper Summary

Cultural bias in LLMs Pretraining data attribution Memorization analysis
The MEMOed framework attributes cultural bias in LLM generations to pretraining data patterns, distinguishing between true memorization, diffuse associations, and cross-cultural generalizations driven by frequency imbalances.
Core Problem
LLMs often exhibit cultural biases in open-ended generation, favoring high-frequency cultures and producing templated or inaccurate outputs for marginalized ones, but the link to specific pretraining data patterns is unclear.
Why it matters:
  • Models default to Western or high-frequency cultural norms, marginalizing real-world diversity
  • Understanding the root cause (memorization vs. generalization) is necessary for effective mitigation (unlearning or data augmentation)
  • Current attribution methods often focus on specific facts rather than broader cultural association patterns
Concrete Example: When asked about food for a Japanese neighbor, a model generates 'Miso Soup' (memorized). When asked about a low-frequency culture, it might generate generic 'meat' (diffuse association) or incorrectly attribute a 'kimono' to a Korean neighbor (cross-culture generalization).
Key Novelty
MEMOed (MEMOrization from pretraining document) Framework
  • Classifies generated symbols into four categories: Memorized (grounded in data), Diffuse (generic high-frequency terms), Cross-culture Generalization (misattributed memorization), and Weak Association (conceptual synthesis)
  • Uses 'contributory documents' analysis to measure if a symbol-culture pair appears close together with high relevance in pretraining data, distinguishing rote memorization from other behaviors
Evaluation Highlights
  • Found that 46% of food symbols and 26% of clothing symbols generated by OLMo-7B are due to direct memorization of pretraining data
  • Memorized associations strongly correlate with culture frequency in pretraining data, leaving low-frequency cultures relying on generic symbols
  • Identified 'Diffuse Association' symbols (e.g., 't-shirt') that appear in generations for >50% of cultures despite lacking specific cultural ties
Breakthrough Assessment
7/10
Provides a rigorous framework for attributing cultural generation behavior to pretraining data. While the scope is limited to one model (OLMo-7B), the taxonomy of associations (Memorized vs. Diffuse vs. Cross-culture) is a valuable analytical tool.
×