Tool Encapsulation: The process of converting raw, text-based API documentation into a structured, callable code function (e.g., a Python function def)
Tool Programming: A method where the LLM solves tasks by writing executable code (programs) rather than just generating text or JSON, allowing for loops and variable manipulation
AST: Abstract Syntax Tree—a tree representation of the syntactic structure of source code, used here to check if generated functions are syntactically valid
ReAct: Reason+Act—a prompting technique where models alternate between reasoning traces and action generation (tool calls)
Pass Rate: The percentage of test cases (queries) that are successfully solved by the model
Integration Verification: A proposed validation step where the LLM generates test inputs (potentially using other tools) to run a newly created function and ensure it works before adding it to the library
SFT: Supervised Fine-Tuning—training a model on a specific dataset to improve its performance on a target task
JSON: JavaScript Object Notation—a standard text-based format for representing structured data, commonly used in prior work for formatting tool calls