STORM: Synergistic Theorem and f ORmula Mining—the multi-agent framework proposed in this paper for extracting and generating math problems
CoT: Chain-of-Thought—a prompting technique where models generate intermediate reasoning steps before the final answer
ATP: Automated Theorem Proving—using formal logic and computer programs (like Lean or Isabelle) to verify mathematical proofs
SFT: Supervised Fine-Tuning—training a pre-trained model on a specific labeled dataset to improve performance on a target task
Reasoning Density: A measure of how many logical steps, heuristic cues, and trial-and-error processes are contained within a derivation
MLLM: Multi-modal Large Language Model—an AI model capable of processing both text and images (used here for LaTeX extraction)
DPO: Direct Preference Optimization—a method for aligning language models to human preferences (mentioned in the context of a source paper example)
GPT-o1-Pro: A specific high-capability version of the OpenAI o1 model series used as the backbone for the agents