| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Initial analysis shows that naive parallel encoding causes massive failures, particularly in synthetic recall tasks. | ||||
| Synthetic Recall (Average) | Accuracy | 99.0 | 0.5 | -98.5 |
| Correlation analysis establishes the link between attention entropy and model failure. | ||||
| Across Tasks | Pearson Correlation (R) | 0.0 | 0.95 | +0.95 |
| Mitigation strategies (Sinks and Selective Attention) significantly recover performance. | ||||
| Synthetic Recall (Average) | Accuracy | 0.5 | 95.0 | +94.5 |