Persona Grounding: Labels indicating which specific persona sentence or attribute a dialogue utterance is based on, allowing models to learn explicit associations
PersonaChat: A benchmark dataset of 10.9K English dialogues where paired crowdworkers chat while adopting specific persona descriptions
CVAE: Conditional Variational Autoencoder—a generative model used here to infer implicit persona information (latent variables) from dialogue history when explicit profiles are missing
NLI: Natural Language Inference—a classification task determining if one sentence entails, contradicts, or is neutral to another; used here to check if a generated response contradicts the agent's persona
Multi-Session Chat (MSC): A dataset extension where the same speakers converse over multiple sessions, requiring the agent to recall information from previous interactions
Zero-shot / Few-shot: Evaluating a model's ability to perform a task (here, personalized generation) with no or very few specific examples during training
Out-of-Distribution (OOD): Scenarios where the model encounters personas or topics during testing that were not present in the training data
BoB: BERT-over-BERT—a specific model architecture cited that uses NLI to ensure consistency between response and persona