Thought Template: A structured abstraction of a problem-solving method (containing name, tags, description, scope, and steps) used to guide reasoning.
Template Trajectory: A sequence of high-level thought templates selected to solve a specific problem.
Hierarchical RL: Reinforcement learning where a high-level policy (navigator) selects abstract actions (templates), which are then executed/instantiated by a lower-level policy.
Navigator: The module (model) responsible for analyzing the problem and planning the sequence of templates.
RAG: Retrieval-Augmented Generation—enhancing model output by retrieving relevant external data.
Inference Scaling: Techniques to improve model performance by increasing computation during the inference phase (e.g., search, sampling).
MCTS: Monte Carlo Tree Search—a heuristic search algorithm for decision processes, often used in game play and reasoning search.
Instantiated Reasoning: The process of filling a high-level abstract template with the specific numbers and details of the current problem.
AIME: American Invitational Mathematics Examination—a challenging math competition benchmark.