RAG: Retrieval-Augmented Generation—enhancing LLMs by providing external documents during generation
LoRA: Low-Rank Adaptation—an efficient fine-tuning method that updates only a small subset of parameters
CoT: Chain-of-Thought—a prompting technique encouraging models to reason step-by-step before answering
STR-EM: String Exact Match—a metric measuring if the exact answer string appears in the generation
RAGAS: A framework for reference-free evaluation of RAG systems using metrics like Faithfulness and Relevancy
Exact Match (EM): A metric checking if the generated answer is character-for-character identical to the ground truth
Faithfulness: A metric measuring if the generated answer is factually consistent with the retrieved context
Relevancy: A metric measuring if the generated answer actually addresses the query
TRL: Transformer Reinforcement Learning—a library often used for SFT and RLHF, utilized here for the training module