| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| SQL Generation: ITERGEN consistently outperforms baselines in execution accuracy across various model sizes. | ||||
| Spider | Execution Accuracy (Overall) | 32.7 | 50.7 | +18.0 |
| Spider | Execution Accuracy (Overall) | 46.4 | 47.6 | +1.2 |
| Privacy Leakage: ITERGEN eliminates privacy leaks completely compared to standard decoding. | ||||
| DecodingTrust (Enron) | Leaks (Count) | 67 | 0 | -67 |
| DecodingTrust (Enron) | Leaks (Count) | 45 | 0 | -45 |
| Vega-Lite Generation: ITERGEN improves accuracy and execution rates for data visualization code. | ||||
| NLV Corpus | Accuracy (%) | 24.69 | 30.47 | +5.78 |
| NLV Corpus | Execute (%) | 89.56 | 92.51 | +2.95 |