SFT: Supervised Fine-Tuning—training a pre-trained model on labeled data to improve performance on specific tasks
RLHF: Reinforcement Learning from Human Feedback—a method to align LLMs with human intent by using rewards derived from human preferences
NLG: Natural Language Generation—the subfield of AI focused on generating natural language text
hallucination: The generation of content by an LLM that deviates from user input, contradicts previously generated context, or misaligns with established world knowledge
input-conflicting hallucination: When LLM generated content deviates from the source input provided by users (e.g., misinterpreting instructions or summarizing incorrectly)
context-conflicting hallucination: When LLM generated content conflicts with information it previously generated within the same conversation
fact-conflicting hallucination: When LLM generated content contradicts established world knowledge or cannot be verified by it