Gold-Angluin framework: A classical learning theory model studying whether an algorithm can infer a language (set of strings) after seeing a sequence of examples
Language Identification in the Limit: A learning criterion where an algorithm must converge to the correct language index after a finite number of steps, though the learner doesn't know when it has converged
positive examples: Data points that are valid members of the target language (e.g., factually correct statements)
negative examples: Data points explicitly labeled as NOT belonging to the target language (e.g., hallucinations or incorrect statements)
RLHF: Reinforcement Learning with Human Feedback—a method using human rankings or labels (often including negative examples) to align models
super-finite collection: A collection of languages containing all finite sets of the domain plus at least one infinite set
enumeration: An infinite sequence listing all elements of a language, potentially with repetitions
membership query: The ability of the detector to ask if a specific element x belongs to the generator's output set G