Evaluation Setup
Law article recommendation on a constructed dataset of criminal judgments
Benchmarks:
- Custom Chinese Criminal Law Dataset (Law article recommendation (predicting article ID from case facts)) [New]
Metrics:
- Accuracy
- Statistical methodology: Not explicitly reported in the paper
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| Chinese Criminal Law Dataset |
Accuracy |
0.549 |
0.694 |
+0.145
|
Main Takeaways
- Integrating a Knowledge Graph (CLAKG) with LLMs significantly outperforms using LLMs alone for law article recommendation (+14.5% accuracy)
- The method outperforms strong baselines including BERT, DPCNN, and other RAG variants (TFIDF-RAG, Graph-RAG, Light-RAG), though specific numeric scores for these baselines are not in the snippet
- The closed-loop system allows the knowledge base to evolve via expert feedback, addressing the static nature of traditional training sets