Zebra Puzzle: A type of logic grid puzzle requiring the deduction of unique attribute assignments (e.g., 'The Brit lives in the red house') based on a set of clues
CSP: Constraint Satisfaction Problem—a mathematical problem defined by objects whose state must satisfy a number of constraints or limitations
Z3: A high-performance theorem prover (SMT solver) from Microsoft Research used here to verify puzzle uniqueness and measure complexity via conflict counts
Search Space: The total number of possible configurations for a puzzle before applying specific clues; calculated as (N!)^M
Non-monotonic reasoning: Reasoning where adding new information (clues) may invalidate previous conclusions, requiring backtracking
Pass@N: An evaluation metric measuring the probability that at least one correct solution is generated out of N independent samples
Best-of-N: A sampling strategy where N solutions are generated and the best one (verified by a heuristic or reward model) is selected
CoT: Chain-of-Thought—prompting models to generate intermediate reasoning steps before the final answer