SymGen: Symbolically Grounded Generation—the proposed method of prompting LLMs to output symbolic references to data fields
Jinja: A templating language for Python; SymGen uses Jinja-like syntax (e.g., {{ variable }}) to embed data references
Hallucination: When an LLM generates text that is factually incorrect or not grounded in the source context
Data-to-text: The task of generating natural language descriptions from structured data (tables, JSON, etc.)
Provenance: The origin or source of a piece of information; SymGen provides provenance by linking text spans to data fields
Zero-shot/Few-shot: Prompting the model with no examples (zero-shot) or a few examples (few-shot) of the task
BLEU: Bilingual Evaluation Understudy—a metric for evaluating the quality of machine-generated text by comparing it to reference text
ROUGE: Recall-Oriented Understudy for Gisting Evaluation—a set of metrics used to evaluate automatic summarization and machine translation
BERTScore: An evaluation metric that computes a similarity score for each token in the candidate sentence with each token in the reference sentence using BERT embeddings
CoT: Chain-of-Thought—a prompting technique where the model generates intermediate reasoning steps
PAL: Program-Aided Language models—a method that offloads reasoning steps to a Python interpreter