_comment: REQUIRED: Define ALL technical terms, acronyms, and method names used ANYWHERE in the entire summary. After drafting the summary, perform a MANDATORY POST-DRAFT SCAN: check every section individually (Core.one_sentence_thesis, evaluation_highlights, core_problem, Technical_details, Experiments.key_results notes, Figures descriptions and key_insights). HIGH-VISIBILITY RULE: Terms appearing in one_sentence_thesis, evaluation_highlights, or figure key_insights MUST be defined—these are the first things readers see. COMMONLY MISSED: PPO, DPO, MARL, dense retrieval, silver labels, cosine schedule, clipped surrogate objective, Top-k, greedy decoding, beam search, logit, ViT, CLIP, Pareto improvement, BLEU, ROUGE, perplexity, attention heads, parameter sharing, warm start, convex combination, sawtooth profile, length-normalized attention ratio, NTP. If in doubt, define it.
LLM: Large Language Model—a probabilistic model trained to generate text
computable function: A function effectively calculable by an algorithm (e.g., a Turing machine)
computable set: A set whose membership can be decided by a computable function
hallucination: In this paper's formal definition: any instance where the LLM's output differs from the unique output of the ground truth computable function
total computable function: A computable function that is defined for all possible inputs
Halting Problem: The problem of determining, from a description of an arbitrary computer program and an input, whether the program will finish running or continue to run forever (undecidable)
learning theory: A field of mathematics and computer science analyzing the capabilities and limitations of learning algorithms
computably enumerable: A set whose members can be listed by an algorithm
SFT: Supervised Fine-Tuning—training a model on labeled examples
S: The set of all finite-length strings from the alphabet
formal world: A simplified theoretical environment defined by the authors where ground truth is strictly defined by computable functions to allow for mathematical proofs