RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
Prompt Tuning: A parameter-efficient fine-tuning method that appends learnable continuous vectors (soft prompts) to the input while keeping the model frozen
Multi-task Prompt Tuning: Training separate prompt vectors for different tasks (e.g., planning vs. answering) that share a common underlying representation
ROUGE-L: Evaluation metric measuring the longest common subsequence between generated and reference text
MAUVE: A metric measuring the gap between neural text and human text distributions
Self-RAG: A baseline RAG method that uses reflection tokens to critique retrieved documents and generated content
Hadamard product: Element-wise multiplication of two matrices
Soft prompt: Learnable continuous vectors prepended to the input embeddings of a language model to condition its behavior
Contriever: A dense information retrieval model used to find relevant documents
greedy decoding: A generation strategy where the model selects the highest probability token at each step