| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Proprietary FAQ Dataset | Top-1 Accuracy (English) | 0.77 | 0.85 | +0.08 |
| Proprietary FAQ Dataset | Top-1 Accuracy (Hinglish) | 0.68 | 0.86 | +0.18 |
| Proprietary FAQ Dataset | Similarity Score Gap (In-domain vs OOD) | 0.06 | 0.55 | +0.49 |
| Test Chat Session (91 queries) | Token Savings | 0 | 31 | +31 |