Decompose-then-verify: An evaluation strategy where long text is broken into individual claims ('atoms') which are then independently checked for truthfulness
Atomic facts: Simple, indivisible statements (e.g., 'Obama was born in Hawaii') used in standard evaluation
Molecular facts: Complex facts retaining conditionals, modifiers, or dependency structures (e.g., 'If X happens, do Y') essential for medical advice
FActScore: A popular framework for evaluating factuality by breaking text into atomic claims, originally designed for biographies
AskDocsAI: A new dataset introduced in this paper containing medical Q&A pairs from Reddit's r/AskDocs with LLM-augmented doctor answers
MedCorp: A medical retrieval corpus compiled in this paper consisting of PubMed, StatPearls, and Textbooks
0-claim rate: The percentage of generated responses for which the evaluation system fails to extract a single verifiable claim
Internal Knowledge: Verification using the model's own pre-trained parameters (parametric knowledge) without external retrieval
MedRAG: A specific retrieval-augmented generation toolkit used here to fetch evidence from medical corpora
MedCPT: A medical-specific dense retriever model used to rank passages from MedCorp