LRM: Large Reasoning Model—a class of LLMs optimized for reasoning via test-time compute scaling (e.g., o1, o3-mini, R1)
RPM: Raven's Progressive Matrices—a nonverbal IQ test involving completing a pattern in a 3x3 grid of images
I-RAVEN-X: An extension of the I-RAVEN dataset that tests generalization to longer rules and larger attribute ranges
Confounding attributes: Randomly sampled visual properties (e.g., background textures) included in the input that are irrelevant to the underlying logic rule
Oracle perception: The unrealistic assumption that a reasoning model has access to perfect, noise-free symbolic descriptions of visual inputs
Abductive reasoning: A logical inference method that seeks the simplest explanation (rule) for a set of observations
NeSy: Neuro-Symbolic—AI systems combining neural networks (for perception/learning) with symbolic logic (for reasoning)
PMF: Probability Mass Function—a distribution representing the probability of a discrete random variable taking specific values
SNR: Signal-to-Noise Ratio—ratio of useful information (reasoning attributes) to irrelevant data (confounders)