| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Perplexity results confirming kNN-LM improves overall performance, though less so on the long-tail-heavy resplit data. | ||||
| WikiText-103 (Original) | Perplexity | 11.36 | 10.42 | -0.94 |
| WikiText-103 (Resplit) | Perplexity | 11.66 | 11.13 | -0.53 |