| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| InfoGain-RAG consistently outperforms naive RAG, open-source BGE rerankers, and proprietary GTE rerankers across various LLM backbones on NaturalQA. | ||||
| NaturalQA | Exact Match | 40.3% | 58.1% | +17.8% |
| NaturalQA | Exact Match | 53.9% | 58.1% | +4.2% |
| NaturalQA | Exact Match | 35.8% | 53.3% | +17.5% |
| Generalization to Fact Verification tasks shows significant improvements. | ||||
| FM2 | Exact Match | 73.6% | 83.4% | +9.8% |
| InfoGain-RAG outperforms self-reflective and retriever-optimization methods. | ||||
| NaturalQA | Exact Match | 49.5% | 51.9% | +2.4% |
| NaturalQA | Exact Match | 42.3% | 54.3% | +12.0% |