atomic fact: A short sentence conveying exactly one piece of information, used as the fundamental unit of evaluation
InstructGPT: An OpenAI model (text-davinci-003) trained to follow instructions
PerplexityAI: A commercial conversational search engine that generates answers with citations based on live web search results
FActScore: Factual precision in Atomicity Score—metric representing the percentage of atomic facts supported by a knowledge source
NLI: Natural Language Inference—determining if a hypothesis is true (entailed), false (contradicted), or neutral given a premise
LMsubj: The Language Model acting as the 'Subject' being evaluated (e.g., ChatGPT, Vicuna)
Recall: The fraction of relevant instances that were retrieved; FActScore explicitly measures precision (correctness), not recall (completeness)
abstain: When a model refuses to answer a prompt (e.g., 'I don't know'), which avoids generating false information