| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Performance on Mixed Quality Data (Expert + Medium): CLARE consistently outperforms baselines when data is a mix of expert and sub-optimal demonstrations. | ||||
| Walker2d (Exp & Med) | Average Return | 1674.2 | 3613.4 | +1939.2 |
| Hopper (Exp & Med) | Average Return | 2135.0 | 1422.7 | -712.3 |
| Half-Cheetah (Exp & Med) | Average Return | 2375.0 | 4667.8 | +2292.8 |
| Performance on Expert Data Only: CLARE matches or exceeds baselines even with limited high-quality data. | ||||
| Walker2d (Expert) | Average Return | 4990.5 | 5010.4 | +19.9 |
| Ant (Expert) | Average Return | 3940.3 | 5172.8 | +1232.5 |