SNR: Signal-to-Noise Ratio—the ratio of the magnitude of the useful semantic feature (signal) to the task-irrelevant variation (noise)
InfoMax: A contrastive learning objective that maximizes mutual information between views (often implemented via InfoNCE loss)
Stop-gradient: An operation that prevents gradients from flowing through one branch of the network, used to stabilize training (e.g., in SimSiam or bootstrapped methods)
ReLU network: A neural network using Rectified Linear Units (f(x) = max(0, x)) as activation functions, allowing non-linear feature interactions
OOD: Out-of-Distribution—test data that comes from a different distribution than the training data (e.g., different noise patterns or spurious correlations)
Signal Learning: The process where the network weights align with the true semantic vector (mu)
Noise Memorization: The process where the network weights align with specific random noise vectors (xi) present in the training samples
Spurious correlation: A connection between a feature (e.g., background color) and a label that holds in training data but not in general (e.g., test data)
ColoredMNIST: A variant of the MNIST digit dataset where digits are colored specifically to introduce spurious correlations (e.g., 0 is usually red in training, but green in testing)