underspecification: The degree to which a system accomplishes a goal without the operator defining the specific steps or methods used to achieve it
directness of impact: The degree to which a system's actions affect the real world without human mediation or approval (human-in-the-loop)
goal-directedness: The degree to which a system acts to optimize a quantifiable objective function (e.g., reward maximization in RL)
long-term planning: The capability of a system to make sequences of decisions that depend on each other over an extended time horizon
agentic: Possessing the characteristics of an agent; specifically in this paper, having high degrees of underspecification, directness of impact, goal-directedness, and long-term planning
FATE: Fairness, Accountability, Transparency, and Ethics—a field of research focused on the societal impacts of algorithmic systems
RL: Reinforcement Learning—a type of ML where agents learn to make decisions by receiving rewards or penalties
LLM: Large Language Model—a deep learning model trained on vast amounts of text to generate human-like language
ADM: Automated Decision-Making—systems that make decisions or enact policies without human intervention
emergent agency: Agentic behaviors (like planning or deception) that arise implicitly from training on large datasets or simple objectives, rather than being explicitly programmed