Foundation Models: Large-scale AI models trained on broad data that can be adapted to a wide range of downstream tasks
Tetradic Alignment: A proposed alignment framework involving four parties: the AI agent, the user, the developer, and society
Anthropomorphism: The attribution of human traits, emotions, or intentions to non-human entities like AI assistants
Sociotechnical Speculative Ethics: An approach combining empirical knowledge of current tech with foresight methods to ethically evaluate future technologies
Red Teaming: The practice of rigorously challenging a system to identify vulnerabilities, safety flaws, or harmful outputs
Narrow AI: AI systems designed to perform a specific task (e.g., speech recognition) rather than general reasoning
Alignment: The process of ensuring AI systems behave in accordance with intended goals, values, and ethical principles