AutoML: Automated Machine Learning—tools that automatically select models and hyperparameters (e.g., H2O, AutoSklearn) usually within a fixed search space.
ReACT: Reasoning and Acting—a paradigm where agents generate reasoning traces and task-specific actions in an interleaved manner.
POMDP: Partially Observable Markov Decision Process—a framework used by many agents where the agent optimizes rewards based on a history of observations, often leading to long context requirements.
Pass@k: A metric measuring the probability that at least one of the top k generated solutions is correct or achieves a certain threshold.
Triton Kernel: A language and compiler for writing highly efficient custom Deep Learning primitives for GPUs.
Search Policy: A set of rules determining which node in the solution tree to expand next (e.g., prioritize debugging recent failures vs. improving the best solution).
Stateless Objective: An evaluation function that depends only on the current solution code, not on the history of how it was generated.