Pivot LLM: The language model used to generate seed cases and adversarial perturbations (e.g., ChatGPT)
Seed Test Cases: QA pairs that the pivot LLM can answer correctly using provided evidence (Open-Book Correct)
Answer Swapping: An adversarial attack where the answer in the evidence is replaced with a factually wrong but context-appropriate alternative to test if the model updates its belief
Context Enriching: An adversarial attack where the evidence is expanded with extra relevant information to make reasoning harder, testing robustness to dense context
Closed-book setting: Answering questions using only the model's internal parametric memory without external evidence
Open-book setting: Answering questions provided with external supporting evidence
Prompt Chaining: Breaking a complex task (like generating adversarial data) into a sequence of smaller, connected prompts
Entailment Accuracy: A metric checking if the generated answer is logically entailed by the reference answer, more lenient than exact match