Evaluation Setup
Retrieval of relevant financial datasets (CSVs) based on complex user queries.
Benchmarks:
- Custom Financial Dataset (Tabular Data Retrieval) [New]
Metrics:
- Performance relative to SOTA (exact metric names like Recall@K not explicitly named, but comparative performance is described)
- Statistical methodology: Not explicitly reported in the paper
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| Custom Financial Dataset |
Performance comparison |
Not reported in the paper |
Not reported in the paper |
Not reported in the paper
|
Main Takeaways
- Fine-tuned lightweight models (TEM) can outperform massive general-purpose models (SOTA) in domain-specific tabular retrieval.
- File-level retrieval combined with a code-generation agent is a more scalable approach for heavy tabular analysis than row-level chunking.
- Vocabulary expansion (New Word Embeddings) helps stabilize training when introducing domain-specific terms.