| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Main comparison using LLaMA3-8B backbone shows RecPO outperforming baselines across all datasets. | ||||
| MovieLens-1M | HR@1 | 0.2902 | 0.3451 | +0.0549 |
| Amazon-Books | HR@1 | 0.5065 | 0.5802 | +0.0737 |
| LastFM | HR@1 | 0.5719 | 0.6830 | +0.1111 |
| Main comparison using Qwen-7B backbone confirms method generalizability. | ||||
| MovieLens-1M | HR@1 | 0.2706 | 0.3446 | +0.0740 |
| Valid Ratio analysis showing RecPO maintains instruction following better than SimPO. | ||||
| Amazon-Books | Valid Ratio | 0.9564 | 0.9851 | +0.0287 |