RLHF: Reinforcement Learning from Human Feedback—fine-tuning AI models using human preferences
Mechanism Design: A field of economics and game theory essentially about designing rules of a game to achieve a specific outcome (like truthfulness) from strategic players
Social Choice Theory: A theoretical framework for analyzing how to combine individual opinions/preferences into a collective decision
Probabilistic Opinion: A feedback format where users assign a probability distribution over answers (indicating intensity of preference) rather than just selecting one
DSIC: Dominant Strategic Incentive-Compatible—a property of a mechanism where telling the truth is always the best strategy for a participant, regardless of what others do
Social Welfare: A measure of the overall well-being or satisfaction of a group, often defined as the sum of individual utilities
Sample Complexity: The number of training samples required to learn a model to a desired level of accuracy
Homogeneous vs. Heterogeneous: Homogeneous assumes all users are the same; Heterogeneous acknowledges users have different, distinct preference functions