Federated Learning (FL): A machine learning approach where multiple clients collaboratively train a model without sharing their local data, typically by aggregating local updates.
Prompt Tuning: A Parameter-Efficient Fine-Tuning (PEFT) method where only a small continuous 'soft prompt' matrix is optimized while the large language model remains frozen.
FedAvg: Federated Averaging—an algorithm where clients perform multiple local update steps before sending model weights to the server for aggregation.
FedSGD: Federated Stochastic Gradient Descent—an algorithm where clients compute gradients on a single batch and send them to the server for a single update step.
Personalization: Adapting a global model to perform well on a specific client's local data distribution.
Robustness: In this context, the ability of the personalized model to retain performance on the global distribution (not forgetting general knowledge).
ROUGE-L: A metric for evaluating text generation that measures the longest common subsequence between the generated text and the reference text.
Catastrophic Forgetting: The tendency of a neural network to completely and abruptly forget previously learned information upon learning new information.
SNI: Super-NaturalInstructions—a large-scale benchmark dataset of NLP tasks used to construct the federated partitions.
Soft Prompt: A learnable matrix of continuous embeddings prepended to the input text embeddings, used to steer the frozen LLM.