TSFM: Time Series Foundation Model—large-scale pre-trained models adapted for forecasting tasks
SAE: Sparse Autoencoder—an unsupervised network trained to decompose dense model activations into a sparse, interpretable set of features
CRPS: Continuous Ranked Probability Score—a metric for probabilistic forecasting that measures how closely the predicted distribution matches the true observation (lower is better)
TopK SAE: A variant of SAE that enforces sparsity by keeping only the K highest activating latent features and zeroing the rest
Ablation: The process of removing or zeroing out specific components (features) of a model to assess their contribution to performance
Residual Stream: The primary vector pathway in a Transformer where layers add their outputs, serving as the main carrier of information
Chronos-T5: A specific TSFM based on the T5 language model architecture that treats time series values as quantized tokens
Level Shift: A sudden, sustained change in the mean value of a time series
ETT: Electricity Transformer Temperature—a standard benchmark dataset for time series forecasting