Term IDs (TIDs): A structured item identifier consisting of a set of semantically rich, standardized textual keywords derived from the LLM's native vocabulary
Semantic IDs (SIDs): Discrete codes generated by quantizing item embeddings (e.g., via RQ-VAE), used in prior work like TIGER to represent items
Context-aware Term Generation (CTG): A process to generate TIDs by prompting an LLM with both the target item's metadata and the metadata of its nearest neighbors to ensure consistency and discriminability
Integrative Instruction Fine-tuning (IIFT): A multi-task training paradigm enabling the LLM to learn both 'Generative Term Internalization' (mapping metadata to TIDs) and 'User Behavior Sequence Prediction'
Elastic Identifier Grounding (EIG): A retrieval mechanism during inference that attempts exact string matching first, then falls back to a structural score based on term overlap to map generated tokens to real items
Direct Hit Rate (DHR): The proportion of successful retrievals handled by the Direct Mapping track within EIG (exact matches)
Valid Rate (VR): The proportion of generated identifiers that validly belong to the candidate library