Aleatoric Uncertainty: Uncertainty inherent in the data itself (e.g., ambiguity, noise) that cannot be reduced with more training data.
Epistemic Uncertainty: Uncertainty due to lack of knowledge or data (e.g., unseen distribution), which can be reduced with more training.
Loss Prediction: A method where a model predicts its own loss for a given input, using the predicted loss as a proxy for uncertainty.
R-AUROC: Representation AUROC—a metric measuring if uncertainty estimates can distinguish between correct and incorrect representations (via 1-NN classification).
Stopgrad: A distinct operation in a computational graph that stops gradients from flowing backward during backpropagation, used here to isolate the backbone from the uncertainty head.
ImageNet-21k-W: ImageNet-21k Winter-2021, a large-scale dataset with roughly 14 million images and 21,000 classes used for pretraining.