Capability Profile (M, T): The abstraction of an agent used for recommendation, consisting of a Backbone Model (M) and a set of Tools (T)
YAML configuration: A human-readable data serialization format used here to store executable agent specifications
Positive-only supervision: Training data where only successful or high-quality interactions are recorded, implying preference without explicit negative labels
Implicit feedback: Inferences about user preference drawn from observed actions (like selection or successful execution) rather than explicit ratings
Narrative query: A free-form natural language description of a task or intent (e.g., 'Help me plan a trip to Tokyo') as opposed to a keyword search
Compositional Agents: Agents constructed by explicitly combining a specific LLM backbone with a specific set of tools to solve a complex task
TwoTower model: A neural network architecture for retrieval where query and item are processed by separate encoders, and their similarity is computed via dot product [implied context from abstract]
Coreset: A small, weighted subset of a dataset that approximates the properties of the full dataset, used here to select representative queries