RAG: Retrieval-Augmented Generation—systems that enhance LLM outputs by retrieving relevant documents from an external corpus
Entropy: A measure of uncertainty in the model's output distribution; high entropy implies the model is unsure which token to select
Hallucination: A phenomenon where LLMs generate text that is coherent but factually incorrect or ungrounded
Entity: A specific object, person, location, or concept identified within the text (e.g., 'Bill Clinton', 'Arkansas')
FLARE: A dynamic RAG method that triggers retrieval when the probability of generated tokens falls below a threshold
RETRO: A retrieval-enhanced transformer that retrieves external information at fixed intervals (e.g., every chunk of tokens)
TF-IDF: Term Frequency-Inverse Document Frequency—a statistical measure used to evaluate how important a word is to a document in a collection
BM25: A probabilistic retrieval function used to rank documents based on the query terms appearing in each document