RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
Naive RAG: The earliest RAG paradigm following a simple 'Retrieve-Read' process without complex optimization
Advanced RAG: RAG systems incorporating pre-retrieval (e.g., query rewriting) and post-retrieval (e.g., re-ranking) optimizations
Modular RAG: Flexible RAG architectures incorporating specialized modules like Search, Memory, Routing, and Predict to handle diverse tasks
Hallucination: The generation of factually incorrect or nonsensical content by an LLM
HyDE: Hypothetical Document Embeddings—a technique where an LLM generates a hypothetical answer to be used for retrieval instead of the raw query
RAG-Fusion: A technique using multi-query generation and reciprocal rank fusion to improve retrieval quality
ICL: In-Context Learning—the ability of LLMs to learn from examples provided in the prompt without parameter updates
Fine-tuning (FT): Retraining a pre-trained model on a specific dataset to adapt its weights
Dense Retrieval: Retrieval based on semantic vector similarity rather than keyword matching