← Back to Paper List

Spiral of Silences: How is Large Language Model Killing Information Retrieval?--A Case Study on Open Domain Question Answering

X Chen, B He, H Lin, X Han, T Wang, B Cao, L Sun…
Center for Research on Intelligent Perception and Computing, Institute of Automation, Chinese Academy of Sciences
arXiv, 4/2024 (2024)
RAG QA Factuality

📝 Paper Summary

Impact of AI-generated content (AIGC) on Information Retrieval Feedback loops in RAG systems
The continuous influx of LLM-generated text into web corpora creates a feedback loop where retrieval systems increasingly prioritize AI text over human content, eventually degrading retrieval accuracy.
Core Problem
As LLMs flood the web with synthetic text, retrieval systems ingest this content, potentially creating a feedback loop that alters retrieval dynamics and marginalizes human-authored information.
Why it matters:
  • Synthetic content is predicted to dominate 90% of the web by 2026, fundamentally changing the data retrieval systems rely on
  • Biased ranking algorithms may create a 'Spiral of Silence' where accurate human knowledge is pushed out of top search results
  • Prior work focuses on short-term RAG performance, missing the long-term ecological impact of the retrieval-generation feedback loop
Concrete Example: In a simulation using the NQ dataset, initial inclusion of LLM text boosts retrieval accuracy. However, after 10 iterations of generating and indexing content, the retrieval system ranks incorrect LLM answers higher than correct human ones, causing Acc@5 to drop by 21.4%.
Key Novelty
Digital 'Spiral of Silence' in RAG Feedback Loops
  • Proposes an iterative simulation pipeline where an RAG system generates content that is immediately indexed and used for future retrieval, modeling the real-world web feedback loop
  • Identifies a 'Spiral of Silence' effect where retrieval algorithms progressively favor LLM-generated text, rendering human-authored content invisible in top rankings over time
Architecture
Architecture Figure Figure 1
Conceptual diagram of the 'Spiral of Silence' simulation pipeline in RAG systems.
Evaluation Highlights
  • Long-term retrieval degradation: Acc@5 drops by 21.4% on Natural Questions (NQ) and 19.4% on PopQA after 10 iterations of the feedback loop
  • Bias towards AI text: Human-authored content in the top-50 search results drops to below 10% across all datasets after 10 iterations
  • Diversity collapse: Self-BLEU scores of top-ranked results consistently rise, indicating severe homogenization of information presented to users
Breakthrough Assessment
8/10
Crucial study on the ecological stability of the web information ecosystem. While the simulation is simplified, the identification of the 'Spiral of Silence' mechanism in RAG offers a vital warning for future search architecture.
×