RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents.
HyDE: Hypothetical Document Embedding—generating a fake answer to a query, embedding that answer, and using it to search for real documents.
MMR: Maximal Marginal Relevance—a retrieval method that selects documents based on a trade-off between relevance to the query and diversity among the selected documents.
Sentence Window Retrieval: Retrieving a single relevant sentence based on similarity, then expanding the context window to include surrounding sentences for the generation step.
Cross-encoder: A model that processes the query and document simultaneously (jointly) to output a relevance score, often used for reranking.
LLM Rerank: Using a Large Language Model to evaluate and re-order retrieved documents based on relevance to the query.
Naive RAG: A standard baseline using fixed-size text chunking and cosine similarity retrieval without additional optimization.
Cohere Rerank: A commercial cross-encoder service used to re-score and re-order retrieved documents.