MDM: Masked Diffusion Model—a generative model that iteratively unmasks tokens in a sequence, allowing for non-sequential generation orders
Slot: A contiguous sub-sequence of tokens within the target text that is generated as a single unit
Plan-and-Infill: A decoding framework where the model first decides which parts of the text to generate (planning) and then generates them (infilling)
MCTS: Monte Carlo Tree Search—a heuristic search algorithm that navigates a decision tree by simulating future outcomes to find optimal moves
PUCT: Predictor-Upper Confidence Tree—a selection criterion in MCTS that balances exploiting high-value actions with exploring actions that have high prior probability
Rollout: A simulation step in MCTS where the algorithm plays out a sequence of random or heuristic moves from a certain state to estimate its future value
MBPP: Mostly Basic Python Problems—a benchmark dataset for evaluating code generation capabilities
ReFusion: A state-of-the-art plan-and-infill method for MDMs that McDiffuSE compares against