RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
TRACe: A framework evaluating uTilization, Relevance, Adherence, and Completeness of RAG responses
Adherence: Whether the response is strictly grounded in the provided context (synonymous with faithfulness)
Utilization: The fraction of retrieved context tokens that are actually used by the generator to produce the response
Completeness: The fraction of relevant context information that is incorporated into the response
Relevance: The fraction of retrieved context tokens that are actually relevant to the input query
NLI: Natural Language Inference—determining if a hypothesis is true (entailment), false (contradiction), or unrelated (neutral) given a premise
DeBERTa: Decoding-enhanced BERT with disentangled attention—a transformer model optimized for natural language understanding tasks
Chain of Thought (CoT): Prompting strategy that asks the model to generate intermediate reasoning steps before the final answer