LRM: Large Recommendation Models—industrial-scale systems designed to handle massive user-item interaction data
MLLM: Multimodal Large Language Models—AI models capable of processing and generating information from multiple modalities like text and images
CTR: Click-Through Rate—the ratio of users who click on a specific link to the total number of users who view a page, advertisement, or email
MHSA: Multi-Head Self Attention—a mechanism in Transformers that allows the model to jointly attend to information from different representation subspaces
LBS: Location-Based Services—services offered through a mobile device that take into account the device's geographical location
HSTU: Hierarchical Sequential Transduction Units—a specific transformer-based component used for modeling long sequences efficiently in recommendation systems
RPM: Revenue per Mile—revenue generated per 1,000 impressions (often used interchangeably with RPM or eCPM in advertising contexts)
CLIP: Contrastive Language-Image Pre-training—a model trained to predict which caption goes with which image, learning aligned multimodal representations