Decompose-then-verify: A framework where long text is broken into atomic claims, each is verified individually, and scores are aggregated
Atomic claim: A single, indivisible statement of fact extracted from a longer text
Chunking: Breaking text into segments (chunks) of a specific size (e.g., number of sentences) for processing
Pre-verification: Using the LLM's internal knowledge to verify claims immediately after extraction, skipping external search for high-confidence cases
BM2.5: A probabilistic information retrieval function used to rank documents based on query term frequency
Logprobs: Log-probabilities of tokens generated by an LLM, used here to measure the model's confidence in its pre-verification label
Snippets: Short text previews (20-40 words) returned by search engines like Google, often used as insufficient evidence in prior work