FL: Federated Learning—a decentralized ML approach where devices train locally and share updates without exposing raw data
Non-IID: Non-Independent and Identically Distributed—data distribution varies across clients (e.g., one user has only photos of cats, another only dogs)
LAR: Local Averaged Representation—the mean feature vector of all samples belonging to a specific class on a client device
Feature Extractor: The part of the neural network that maps raw inputs (images) to latent vector representations
Prediction Header: The final layers of the neural network (usually fully connected) that map representations to class probabilities
Model Heterogeneity: A scenario in FL where clients use different model architectures (e.g., different sizes/depths) suited to their hardware constraints
Logits: The raw, unnormalized prediction scores output by the final layer of a neural network before applying softmax