persona cue: The specific method used to introduce a sociodemographic profile to the model (e.g., a name, a system prompt, or a chat history)
external validity: The extent to which research findings (here, bias measurements) generalize to real-world settings (real user interactions)
implicit identity markers: Subtle cues in language use or conversation history that signal demographics without explicitly stating them
Spearman correlation: A statistical measure of rank correlation, used here to check if different cues lead to similar ranking of model outputs
LLM-as-a-judge: Using an LLM (here Llama-3.3-70B) to evaluate the quality or stance of text generated by another LLM
Tukey-Kramer test: A post-hoc statistical test used after ANOVA to determine exactly which means differ significantly from each other, accounting for unequal sample sizes
AITA: Am I The Asshole? โ a dataset based on Reddit posts where users describe a conflict and ask for moral judgment