ReAct: Reason+Act—a prompting paradigm where LLMs generate reasoning traces before taking actions
Reflexion: An agent framework where LLMs verbally reflect on past failures to improve performance in subsequent trials
Tool Graph: A directed graph structure where nodes are tools and edges represent valid or likely transitions between them
Tool-use trajectory: The sequence of tools called by an agent to solve a specific problem (e.g., [Search -> Calculator -> Finish])
Exact Match (EM): A metric checking if the generated answer string exactly matches the ground truth
Markov assumption: The assumption that the next tool choice depends only on the current state (specifically the previous tool used), allowing the graph to limit choices to immediate successors
Semantic similarity search: Finding relevant items (tools) by comparing vector embeddings of the query and the item descriptions
Beam search: A search algorithm that explores a graph by expanding the most promising nodes in a limited set
Transition weight: A numerical score on an edge in the Tool Graph indicating the preference or probability of moving from one tool to another