Federated Model Soup: An ensemble method in FL where weights of multiple historical global models are averaged to create a robust model.
Flat Minima: A region in the loss landscape where the loss function is relatively constant around the minimum; models in flat minima generalize better than those in sharp minima.
PFL: Personalized Federated Learning—techniques to adapt a global FL model to specific local client distributions.
Model Patching: A fine-tuning technique that interpolates weights between a reference model (global soup) and a target model (local) to retain general capabilities while adapting.
SWA: Stochastic Weight Averaging—an optimization technique that averages model weights along the trajectory of SGD to find flatter minima.
Cross-silo FL: FL setting where clients are typically organizations (e.g., hospitals) with moderate amounts of data and reliable connections, as opposed to millions of mobile devices (cross-device).
OOD: Out-of-Distribution—data samples that differ in distribution from the training data (e.g., images from a different hospital).
Hessian Eigenvalue: A mathematical measure of the curvature of the loss function; higher values indicate sharper curvature (worse generalization).