Atomic Task: A simple agentic task solvable with a single specific tool invocation, generated by reversing the logic from answer to question
Search-R1: A baseline agentic workflow utilizing reinforcement learning for optimization
SFT: Supervised Fine-Tuning—training a model on labeled examples (in this case, generated task trajectories) to improve its instruction-following and tool-use capabilities
Rejection Sampling: A technique used here to filter out low-quality generated tasks by verifying if they meet specific criteria (e.g., solvable by tools but not by LLM alone)
Depth-based extension: A method to increase task complexity by creating a chain of dependencies, where the output of one step becomes the input for the next
Width-based extension: A method to increase task complexity by combining multiple independent sub-problems into a single query
Agentic task: A problem requiring autonomous multi-step reasoning, tool use, and environmental interaction to solve