OCR: Optical Character Recognition—technology used by agents to read text from screenshots
ADB: Android Debug Bridge—a command-line tool allowing agents (especially framework-based ones) to send commands like taps and swipes to the device
Accessibility Services: An Android framework designed for users with disabilities, often repurposed by agents to read screen content and perform clicks programmatically
GroundingDINO: An open-set object detection model used by agents to identify and locate UI elements (icons, buttons) based on text descriptions
System-level Agents: Agents developed by OEMs (e.g., Honor, Vivo) with deep OS integration and elevated privileges
Intents: Messaging objects in Android used to request an action from another app component (e.g., launching an app)
Transparent Overlay: An attack where a malicious app draws an invisible window over a target, intercepting clicks intended for the visible app beneath it