VAE: Variational Autoencoder—a neural network that learns to compress data into a latent space (z) and reconstruct it, used here to model query difficulty
latent difficulty: A learned internal representation (z) encoding how hard a query is, used to control workflow complexity
operator: A specific combination of an LLM and a collaboration protocol (e.g., 'GPT-4o + Chain-of-Thought')
DAG: Directed Acyclic Graph—a structure where information flows in one direction without loops, representing the agent workflow
MoE: Mixture-of-Experts—a technique where different parts of a network (experts) are activated for different inputs; here used to select agent operators
LLM routing: The process of assigning a specific Large Language Model (e.g., GPT-4 vs. Llama-3) to a task based on difficulty and cost
reparameterization trick: A method to allow gradient descent through stochastic nodes in a neural network by separating randomness from parameters
pass@k: A metric measuring the probability that at least one of k generated solutions is correct