MMoE: Multi-gate Mixture-of-Experts—an MTL architecture using gating networks to combine shared experts for different tasks
PLE: Progressive Layered Extraction—an MTL architecture that separates shared and task-specific experts to avoid negative transfer
Cold Start: The challenge of recommending items or serving users with little to no prior interaction history
PVCTR: Page View Click-Through Rate—the number of clicks divided by the number of page views
Jensen-Shannon Divergence: A symmetric measure of similarity between two probability distributions, used here to align domain embeddings
LLM: Large Language Model—used here (e.g., BERT, ChatGLM) to extract semantic features from text
S&R: Search and Recommendation—distinct but related tasks often modeled separately in industrial systems
AUC: Area Under the ROC Curve—a performance metric for classification tasks indicating the model's ability to distinguish between classes