| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| VeriFact shows high agreement with human ground truth, particularly on LLM-generated summaries. | ||||
| VeriFact-BHC (LLM-written) | Agreement with Ground Truth (Sentence propositions) | 84.7 | 92.7 | +8.0 |
| VeriFact-BHC (LLM-written) | Agreement with Ground Truth (Atomic propositions) | 88.5 | 88.8 | +0.3 |
| VeriFact-BHC (Human-written) | Agreement with Ground Truth (Sentence propositions) | 66.6 | 66.0 | -0.6 |