ReAct: Reason+Act—a paradigm where LLMs interleave reasoning traces with tool execution steps to solve tasks
Deep Research: An agentic paradigm focused on long-horizon, multi-step information synthesis and report generation, rather than simple Q&A
MoE: Mixture of Experts—a neural network architecture where different sub-models (experts) specialize in different parts of the input space
RACE: Report Agent Comparison Evaluation—a metric for scoring research reports against a reference based on criteria like comprehensiveness and depth
Reflective Internalization: A process of rewriting multi-agent feedback loops into a single agent's self-correction thought process for training data
Hallucination: When an LLM generates factually incorrect information or references non-existent products
SFT: Supervised Fine-Tuning—training a pre-trained model on a smaller, specific dataset to adapt it to a task
Trajectory: The sequence of thoughts, actions (tool calls), and observations an agent takes to solve a problem