VLM: Vision-Language Model—AI models that can process both images and text to generate text or actions
PDDL: Planning Domain Definition Language—a standardized language used to define states, actions, and goals in planning problems using logical predicates
POMDP: Partially Observable Markov Decision Process—a mathematical framework for decision-making where the agent cannot directly observe the full state of the world
CoT: Chain-of-Thought—a prompting technique where the model is encouraged to generate intermediate reasoning steps before the final answer
OmniGibson: A high-fidelity physics simulator used for training and evaluating embodied AI agents in realistic household environments
Pre-caution: A safety condition that must be satisfied BEFORE a specific risk-prone action is taken (e.g., 'ensure stove is clear before turning on')
Post-caution: A safety condition that must be satisfied AFTER a specific action is taken (e.g., 'turn off stove after cooking')
Process-oriented evaluation: Evaluating an agent's performance by checking constraints at specific steps during execution, rather than only checking the final result