| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Real-world experiments demonstrating robustness of PLD-generated data compared to human demonstrations. | ||||
| Franka Cube Pick-up | Success Rate | 0.33 | 1.00 | +0.67 |
| Franka Peg Insertion | Success Rate | 1.00 | 1.00 | 0.00 |
| Simulation results showing scaling and architectural generalization. | ||||
| LIBERO-90 | Success Rate | Not reported in the paper | 0.99 | Not reported in the paper |
| SimplerEnv | Success Rate | Not reported in the paper | Not reported in the paper | +0.50 |