| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| CEC-Zero outperforms both supervised BERT baselines and LLM fine-tunes across aggregate benchmarks. | ||||
| Average across 9 benchmarks | F1 | Not reported as single aggregate number in text | Not reported as single aggregate number in text | +5-8 (range reported) |