MAE: Masked Autoencoder—a self-supervised learning method that masks parts of an input and trains a model to reconstruct the missing parts
Sentinel-2: An optical Earth observation satellite mission providing multi-spectral imaging
Sentinel-1: A satellite mission providing Synthetic Aperture Radar (SAR) imaging, which captures surface texture and roughness independent of cloud cover
DEM: Digital Elevation Model—3D representation of terrain elevation
SAR: Synthetic Aperture Radar—active remote sensing technology using radar waves
FCMAE: Fully Convolutional Masked Autoencoder—an MAE variant using Convolutional Neural Networks (CNNs) instead of Transformers
MP-MAE: Multi-Pretext Masked Autoencoder—the proposed method extending FCMAE to reconstruct multiple modalities
MMEarth: The proposed dataset containing 1.2M locations with 12 aligned modalities
Linear Probing: Evaluating a pretrained encoder by freezing its weights and training a simple linear classifier on top
L2A/L1C: Sentinel-2 processing levels: L1C is Top-of-Atmosphere, L2A is Bottom-of-Atmosphere (atmospherically corrected)