| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Performance on NLU tasks using BERT-6L student and Stable Diffusion teacher. | ||||
| GLUE/SuperGLUE (Avg) | Average Score | Not reported in the paper | Not reported in the paper | +3.4% |
| Generative Reasoning | Task-specific Accuracy | Not reported in the paper | Not reported in the paper | +2.6% |
| NLU Tasks | Average Score | Not reported in the paper | Not reported in the paper | +1.4% |
| NLU Tasks | Average Score | Not reported in the paper | Not reported in the paper | +1.5% |