SMC: Sequential Monte Carlo—a statistical method that maintains a population of 'particles' (candidate solutions), extending them step-by-step and resampling to focus on the most promising ones
Pass@1: The probability that a single model generation (or the top-ranked generation) solves the task correctly
Importance Sampling: A technique to estimate properties of a target distribution by sampling from a different proposal distribution and weighting samples by the ratio of their probabilities
Planner: A capable LM (e.g., GPT-4o) that writes the inference code/program
Follower: A smaller or equal-sized LM (e.g., Llama-3) that executes the inference program by generating tokens or providing probabilities
PPL: Probabilistic Programming Language—a programming framework that allows defining probabilistic models and performing inference on them automatically
Particle: In SMC, a single candidate generation sequence being evolved in parallel with others
LLAMMPPL: The specific probabilistic programming library used in this paper, allowing LMs to be used as distributions within Python programs
Coherency: A measure of how fluent and natural the generated text reads, evaluated here by an LLM-as-a-judge