Evaluation Setup
Sequential recommendation on three real-world datasets (Beauty, Sports, Toys)
Benchmarks:
- Amazon Beauty (Sequential Recommendation)
- Amazon Sports (Sequential Recommendation)
- Amazon Toys (Sequential Recommendation)
Metrics:
- HR@10 (Hit Ratio)
- NDCG@10
- Statistical methodology: Not explicitly reported in the paper
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| TransRec demonstrates superior performance across all datasets compared to baselines, validating the multi-facet approach. |
| Amazon Beauty |
NDCG@10 |
0.0353 |
0.0682 |
+0.0329
|
Main Takeaways
- Combining IDs (distinctiveness) and Titles/Attributes (semantics) yields better performance than using either in isolation.
- Constrained generation effectively solves the 'hallucinated item' problem without needing expensive post-hoc vector matching.
- The method generalizes better to cold-start items because the LLM can leverage the semantic meaning of titles even with few user interactions.