| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Comparative performance on Llama-3.1-8B when pruning 20% of layers (approx 6-7 layers). EntroDrop consistently outperforms similarity-based baselines. | ||||
| MMLU | Accuracy | 58.2 | 64.3 | +6.1 |
| MMLU | Accuracy | 63.7 | 64.3 | +0.6 |
| ARC-C | Accuracy | 46.2 | 50.1 | +3.9 |
| Comparative performance on Mistral-7B-v0.3 when pruning 20% of layers. | ||||
| MMLU | Accuracy | 56.8 | 59.2 | +2.4 |