LRM: Large Reasoning Model—models like OpenAI o1 or QwQ optimized for long chain-of-thought reasoning via reinforcement learning
RAG: Retrieval-Augmented Generation—enhancing model outputs by retrieving relevant external documents
Agentic RAG: A RAG system where the model autonomously decides when and what to search for, rather than a fixed retrieve-then-generate pipeline
Reason-in-Documents: A proposed module that analyzes raw retrieved documents to extract only reasoning-relevant information, preventing context pollution
Chain-of-Thought: A prompting/reasoning technique where models generate intermediate reasoning steps before the final answer
MCTS: Monte Carlo Tree Search—a heuristic search algorithm for decision processes, often used to guide reasoning paths
Catastrophic forgetting: The tendency of a neural network to completely and abruptly forget previously learned information upon learning new information
Standard RAG: Traditional RAG where retrieval happens once based on the initial query before generation begins