| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Main comparison showing OSCAR's speed-up and performance retention against uncompressed baselines and hard compression methods. | ||||
| Average across 6 datasets | LLM Score | 43.3 | 44.6 | +1.3 |
| Average across 6 datasets | Speed-up | 1.0 | 3.3 | +2.3x |
| Average across 6 datasets | LLM Score | 42.7 | 44.6 | +1.9 |
| Scaling results demonstrating OSCAR's effectiveness on larger backbones (Mistral-24B). | ||||
| Average across 6 datasets | Speed-up | 1.0 | 5.0 | +4.0x |
| Average across 6 datasets | LLM Score | 50.1 | 51.1 | +1.0 |