Lean 4: A functional programming language and interactive theorem prover used for formalizing mathematics
tactic: A command in Lean that transforms the current proof state (goal) into simpler subgoals
pass@k: An evaluation metric measuring the probability that at least one correct solution is generated out of k samples
autoformalization: The process of automatically translating natural language mathematics into formal code (e.g., Lean)
SFT: Supervised Fine-Tuning—training a model on labeled examples before applying reinforcement learning
RL: Reinforcement Learning—training an agent (here the LLM) to maximize a reward signal (proof correctness)
BFS: Best-First Search—a search algorithm that explores a graph by expanding the most promising nodes first
MCTS: Monte Carlo Tree Search—a heuristic search algorithm for decision processes, often used in game playing
CoT: Chain-of-Thought—a prompting technique encouraging models to generate intermediate reasoning steps
KL divergence: A measure of how one probability distribution differs from a second, reference probability distribution