AGI: Artificial General Intelligence—systems that demonstrate broad capabilities of intelligence, including reasoning, planning, and learning, at or above human-level
LLM: Large Language Model—neural network models trained on massive text corpora to predict the next word in a sequence
TiKZ: A language for creating graphics in LaTeX, used here to test the model's ability to generate visual content via code
LeetCode: A platform for technical interview preparation, used to evaluate coding proficiency
Hallucination: The generation of factually incorrect or nonsensical information by an LLM
Theory of Mind: The ability to attribute mental states—beliefs, intents, desires, emotions, knowledge—to oneself and others
Autoregressive: A property of models that generate output one token at a time, using previously generated tokens as context for the next
Zero-shot: The ability of a model to perform a task without having seen explicit examples of that specific task during training