CKA: Centered Kernel Alignment—a similarity metric used to compare representations of neural network layers even if they have different dimensions
Model Reassembly: The process of taking layers from different trained models and combining them to form a new functional model
Stitching Layer: A lightweight layer (usually linear + activation) inserted between two disparate network layers to align their feature dimensions
Heterogeneous FL: Federated Learning where clients have different model architectures (e.g., different depths, layer types)
Knowledge Distillation: Training a 'student' model to mimic the outputs (logits) of a 'teacher' model, used here to transfer knowledge from the reassembled model to the client's local model
IID vs. Non-IID: Independent and Identically Distributed vs. Non-IID—refers to whether client data distributions are uniform or skewed (e.g., each client only has specific classes)
Logits: The raw, unnormalized prediction scores output by the last layer of a neural network before the softmax activation