| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| User study results demonstrating the effectiveness of CoS in controlling personalization. | ||||
| User Study (Movie Summarization) | Spearman's Correlation ($ρ$) | 0 | 0.67 | +0.67 |
| Implicit Hate Classification results comparing CoS to baselines across different target groups. | ||||
| Implicit Hate Dataset (Group: Black) | Accuracy | 50 | 82 | +32 |
| Implicit Hate Dataset (Group: Immigrant) | Accuracy | 37 | 47 | +10 |
| Implicit Hate Dataset (Group: Muslim) | Accuracy | 62 | 60.5 | -1.5 |
| Ablation study on factuality to ensure CoS doesn't degrade model knowledge. | ||||
| OpenBookQA | Factuality Accuracy Drop | 0 | 4.6 | 4.6 |