| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Zero-shot performance comparing RecBase-1.5B against general-purpose LLMs (Llama-3, Qwen-2) and rec-specific LLMs (P5, RecGPT). | ||||
| H&M | AUC | 0.6287 | 0.6761 | +0.0474 |
| Steam | AUC | 0.8102 | 0.8343 | +0.0241 |
| Average (8 datasets) | AUC | 0.5843 | 0.6063 | +0.0220 |
| Effect of Fine-tuning on specific domains. | ||||
| Steam | AUC | 0.8343 | 1.0066 | +0.1723 |
| Ablation of CL-VAE components. | ||||
| Average (All) | AUC | 0.5891 | 0.6063 | +0.0172 |