| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| BinLLM consistently outperforms baselines on the ML-1M dataset across different model backbones. | ||||
| ML-1M | NDCG@10 | 0.0757 | 0.0805 | +0.0048 |
| ML-1M | HR@10 | 0.1378 | 0.1465 | +0.0087 |
| Ablation studies show that binary encoding is superior to decimal for short lengths, but decimal is effective for compression. | ||||
| ML-1M | NDCG@10 | 0.0792 | 0.0805 | +0.0013 |
| BinLLM shows robustness in sparse data scenarios (cold start). | ||||
| Games | NDCG@10 | 0.0241 | 0.0336 | +0.0095 |