| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| RTFM significantly improves TabPFN performance on the TabPertNet benchmark, surpassing all baselines. | ||||
| TabPertNet | Mean Rank AUC | 3.2 | 2.7 | -0.5 |
| TabPertNet | Mean Norm. AUC | 0.7483 | 0.8167 | +0.0684 |
| TabPertNet | Mean Norm. AUC | 0.6481 | 0.8167 | +0.1686 |
| RTFM also dominates on the TabArena benchmark, achieving the best rank and AUC. | ||||
| TabArena | Mean rank AUC OVO | 2.2 | 1.9 | -0.3 |
| TabArena | Mean Norm. AUC OVO | 0.9031 | 0.9298 | +0.0267 |
| TabArena | Mean Norm. AUC OVO | 0.7749 | 0.9298 | +0.1549 |