RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents.
Marginal Robustness Benefit: A metric proposed in this paper (Delta) measuring the performance difference between the best and worst robust training strategies.
RAAT: Retrieval-Augmented Adaptive Training—an adversarial training method that optimizes for the worst-case retrieval scenario.
IRM: Invariant Risk Minimization—a training objective aiming to learn representations that are stable across different environments (e.g., different retrieval qualities).
Golden Document: The specific retrieved document that contains the ground truth answer.
EM: Exact Match—a metric checking if the generated answer matches the ground truth string exactly.
SFT: Supervised Fine-Tuning—standard training on labeled data.
Contriever: A dense retrieval model used to fetch relevant documents from a corpus.
RetRobust: A robust training strategy that mixes relevant and irrelevant documents during training to teach the model to ignore noise.