DBT: Data Build Tool—a framework for managing data transformations and analytics engineering code
Schema Linking: The process of mapping natural language terms in a question to specific database tables and columns
BigQuery: A fully managed, serverless enterprise data warehouse offered by Google Cloud
Snowflake: A cloud-based data warehousing platform with its own specific SQL dialect
CTE: Common Table Expression—a temporary named result set in SQL used to simplify complex queries
Dialect: A specific implementation of SQL (e.g., PostgreSQL, BigQuery Standard SQL) with unique functions and syntax
Nested Schema: Database columns that contain structured data like arrays or JSON objects within a single field
SFT: Supervised Fine-Tuning—training a model on labeled examples
Gold SQL: The ground-truth SQL query written by human experts to solve a benchmark task
Execution Accuracy (EX): A metric measuring whether the result of the predicted SQL query matches the result of the ground-truth SQL query
Success Rate (SR): The proportion of tasks where the agent's final answer matches the ground truth (used for the agentic setting)