Evaluation Setup
Full-data supervised learning on diverse tabular datasets.
Benchmarks:
- Various tabular benchmarks (Binary Classification, Multiclass Classification, Regression)
Metrics:
- Accuracy (Classification)
- RMSE (Regression)
- ROC-AUC (Binary Classification)
- Statistical methodology: Not explicitly reported in the paper
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| The paper claims state-of-the-art performance but the provided text excerpt does not contain specific result tables with extractable numbers. It refers to 'extensive experiments' and 'Figure 2' for intra-node distance but lacks a results table in the excerpt. |
Main Takeaways
- DeLTa achieves state-of-the-art performance across diverse benchmarks (classification and regression).
- Qualitative analysis shows LLM-refined rules produce leaf nodes with lower intra-node sample distance than original RF rules, indicating better clustering of similar samples.
- The method successfully avoids the privacy and modality gap issues of data serialization methods.
- DeLTa is applicable to full-data settings without the need for computationally expensive LLM fine-tuning.