Machine Unlearning: The process of removing the influence of specific training data from a machine learning model without retraining from scratch
Contrastive Learning: A learning technique that creates a meaningful latent space by pulling similar samples close together and pushing dissimilar samples apart
Latent Space: A vector space where the model represents input data; semantically similar inputs should be close together in this space
Anchor Sample: The specific data point currently being processed (or unlearned) in a contrastive learning framework
SISA: Sharding, Isolation, Slicing, and Aggregation—an exact unlearning method that trains multiple sub-models on disjoint data shards
Exact Unlearning: Guarantees the model is mathematically identical to one retrained from scratch
Approximate Unlearning: Updates model parameters to statistically approximate the state of a retrained model, often faster but with weaker theoretical guarantees
NER: Named Entity Recognition—identifying and classifying key information (names, organizations, locations) in text