Schema Linking: The process of identifying which words in a natural language question correspond to specific tables and columns in a database schema
CoT: Chain-of-Thought—a prompting technique where the model is encouraged to generate intermediate reasoning steps before the final answer
Auto-CoT: Automatically generated Chain-of-Thought reasoning paths, created here by matching SQL elements to question phrases via similarity
ICL: In-Context Learning—teaching an LLM a task at inference time by providing examples in the prompt without updating model weights
Spider: A large-scale, complex, cross-domain semantic parsing and text-to-SQL dataset
EM: Exact Match accuracy—measures if the predicted SQL structure matches the ground truth exactly
EX: Execution Accuracy—measures if the predicted SQL returns the correct result when run on the database
TS: Test-Suite Accuracy—a stricter version of Execution Accuracy that tests the SQL on multiple database instances to prevent false positives