Atomicity: A metric measuring whether a sub-claim represents a single factual unit (one relation between subject and object) rather than multiple facts.
Sufficiency: A metric measuring whether a sub-claim is unambiguous and retains enough context from the original claim to be verified independently.
Fabrication: A metric checking if the decomposition process introduced new, made-up information not present in the original claim.
Coverage: A metric assessing whether the list of sub-claims captures all factual assertions present in the original complex claim.
Redundancy: A metric checking if the generated sub-claims contain repetitive information, which wastes compute and skews error rates.
CoverBench: The source dataset (Jacovi et al., 2024) providing the original complex claims used to build FactLens.
GPT-4o-mini: The specific LLM used as the 'verifier' model in this paper to check the truthfulness of sub-claims against evidence.