CoT: Chain-of-Thought—a prompting technique where models generate intermediate reasoning steps before the final answer
sLM: Small Language Model—typically models with fewer than 7 billion parameters
Agentic Behavior: The ability of an AI to autonomously reason, plan, and execute actions (like running code or searching) to solve a task
Trajectory: A sequence of interactions consisting of thoughts, actions, and environmental observations (outputs from tools)
CodeAct: A framework where LLMs use executable code (e.g., Python) as their primary form of action/tool use
First-Thought Prefix (FTP): A proposed method where the initial reasoning step from a CoT prompt is forced as the prefix for an agent's first thought to align behavior
Self-Consistent Action Generation (SAG): A test-time inference method that samples multiple action sequences, filters out execution errors, and selects the most consistent result via voting
RAG: Retrieval-Augmented Generation—providing models with external documents to ground their answers
LoRA: Low-Rank Adaptation—a parameter-efficient fine-tuning technique that updates only a small subset of model weights