POMDP: Partially Observable Markov Decision Process—a mathematical framework for modeling decision-making where the agent cannot directly observe the full state of the environment
Non-Autoregressive Generation: Generating a sequence (like a sentence or dialogue) in parallel or iteratively, rather than one token/turn strictly after another
Mask-and-Fill: A technique where parts of a sequence are hidden (masked) and a model predicts the missing content, used here to refine dialogue turns
MAS: Multi-Agent Simulation—using multiple LLMs playing different roles (user, assistant, tool) to generate conversation data by interacting with each other
Trajectory Skeleton: A structural outline of a dialogue containing the sequence of actions and observations but lacking full natural language detail
BFCL: Berkeley Function Calling Leaderboard—a benchmark for evaluating the ability of LLMs to call functions correctly
ACEBench: A benchmark for evaluating agentic capabilities
Autoregressive: Generating output one step at a time, where each step depends on the previous ones