| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| SeCom demonstrates superior segmentation performance compared to existing unsupervised and supervised baselines. | ||||
| DialSeg711 | WindowDiff (lower is better) | 0.468 | 0.222 | -0.246 |
| DialSeg711 | WindowDiff (lower is better) | 0.252 | 0.222 | -0.030 |
| Downstream QA performance on LOCOMO shows SeCom's advantage over other memory granularities. | ||||
| LOCOMO | Rouge-L | 20.1 | 22.3 | +2.2 |
| LOCOMO | Rouge-L | 19.8 | 22.3 | +2.5 |
| LOCOMO | Rouge-L | 20.5 | 22.3 | +1.8 |