CoT: Chain of Thought—a prompting technique where the model is shown intermediate reasoning steps in examples to encourage it to generate similar reasoning steps
Blocksworld: A classic planning domain involving moving blocks between a table and stacks according to specific rules
PDDL: Planning Domain Definition Language—a standard formal language used to define planning problems (predicates, actions, preconditions, effects)
Zero-Shot CoT: A prompting strategy that appends 'Let's think step by step' to the query without providing examples
STRIPS: Stanford Research Institute Problem Solver—a formal language for automated planning problems
In-context learning: The ability of a model to improve its performance on a task by observing examples provided in the prompt context
Table-to-Stack: A simplified subset of Blocksworld problems where all blocks start on the table and must be arranged into a single target stack
Length Generalization: The ability of a model to apply a learned procedure to problem instances that are longer or larger (e.g., more steps) than those seen in training/examples