Society of Mind: A theory by Marvin Minsky proposing that intelligence emerges from the interaction of many simple, non-intelligent agents; here applied to LLM instances
hallucination: A phenomenon where an LLM generates text that is factually incorrect or nonsensical but appears plausible
Chain-of-Thought: A prompting technique that encourages the model to generate intermediate reasoning steps before the final answer
Self-Reflection: A technique where a single model generates an output and then is prompted to critique and refine its own output
MMLU: Massive Multitask Language Understanding—a benchmark measuring knowledge across 57 subjects like math, history, and law
GSM8K: Grade School Math 8K—a dataset of 8.5k high quality linguistically diverse grade school math word problems
PGN notation: Portable Game Notation—a standard plain text format for recording chess games
Stockfish: A strong open-source chess engine used here to evaluate the quality (pawn score advantage) of predicted moves