| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Performance on ML-100K dataset comparing Llama4Rec against baselines across different backbones. | ||||
| ML-100K | Hit@3 | 0.0537 | 0.0647 | +0.0110 |
| ML-100K | NDCG@3 | 0.0381 | 0.0476 | +0.0095 |
| Performance on ML-1M dataset showing improvements on larger, sparser data. | ||||
| ML-1M | Hit@3 | 0.0846 | 0.0967 | +0.0121 |
| ML-1M | NDCG@3 | 0.0507 | 0.0608 | +0.0101 |
| Performance on BookCrossing dataset, which typically has higher sparsity. | ||||
| BookCrossing | Hit@3 | 0.0275 | 0.0308 | +0.0033 |