GUI-DFS: Graphical User Interface Depth-First Search—an exploration algorithm where the agent systematically clicks through UI elements (menus, buttons) to map out available functions.
Test-time scaling: The practice of using more computation during inference (e.g., generating multiple candidate plans or retrying steps) to improve performance, often at the cost of high latency.
LoRA: Low-Rank Adaptation—a parameter-efficient fine-tuning technique that updates only a small subset of model weights.
Action Primitives: Pre-defined, parameterized low-level behaviors (e.g., 'drag from [x1,y1] to [x2,y2]') that ensure precise execution for fine-grained tasks.
Grounding: The process of linking abstract concepts (e.g., 'the red button') to concrete coordinates or UI elements on the screen.
DFS: Depth-First Search—an algorithm for traversing tree or graph structures that explores as far as possible along each branch before backtracking.