Fusion-in-Decoder: An architecture where passages are encoded separately but attended to jointly by the decoder
DPR: Dense Passage Retrieval—a method using dual BERT encoders to retrieve relevant documents based on semantic similarity
BM25: A ranking function used in information retrieval to estimate the relevance of documents to a given search query
Exact Match: An evaluation metric that counts a prediction as correct only if it matches one of the ground truth answers exactly after normalization
T5: Text-to-Text Transfer Transformer—a pre-trained language model that treats every NLP problem as a text generation task
Open Domain QA: Answering questions using a large collection of documents (like Wikipedia) without knowing in advance which specific document contains the answer
Extractive models: QA systems that select a specific span of text from a document as the answer
Generative models: QA systems that generate new text for the answer, potentially synthesizing information