| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Probing results demonstrate that standard LLMs (P5) implicitly encode sensitive attributes. | ||||
| MovieLens-1M | AUC (Gender Prediction) | - | 0.6865 | +0.1865 |
| Performance on Direct Recommendation (MovieLens-1M) showing UP5 improves utility while ensuring fairness. | ||||
| MovieLens-1M | Hit@1 | 0.1554 | 0.1762 | +0.0208 |
| MovieLens-1M | AUC (Gender Fairness) | 0.6865 | 0.5186 | -0.1679 |
| Multi-attribute fairness results using Prompt Mixture. | ||||
| MovieLens-1M | Hit@1 | 0.1554 | 0.1654 | +0.0100 |
| MovieLens-1M | AUC (Age Fairness) | 0.6093 | 0.5154 | -0.0939 |