| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Hallucination rates for LLama3.3-70B-Instruct vary significantly based on the input context provided. | ||||
| Clinical Trials Benchmark | Hallucination Rate | 31 | 0.3 | -30.7 |
| Clinical Trials Benchmark | Factuality Rate | 40 | 97 | +57 |
| Clinical Trials Benchmark | AUC | 0.50 | 0.95 | +0.45 |
| UMLS Disorders Benchmark | AUC | 0.50 | 0.96 | +0.46 |
| MedQA (USMLE) | Passing Rate | 87.1 | 92.1 | +5.0 |