| Benchmark | Metric | Baseline | This Paper | Δ |
|---|---|---|---|---|
| Comparative analysis on Franka Kitchen and Meta-World benchmarks demonstrating EmbodiedGPT's superiority over baselines. | ||||
| Franka Kitchen (10 demos) | Success Rate | 28.7 | 50.8 | +22.1 |
| Franka Kitchen (10 demos) | Success Rate | 45.3 | 50.8 | +5.5 |
| Meta-World (10 demos) | Success Rate | 53.9 | 76.4 | +22.5 |
| Meta-World (10 demos) | Success Rate | 72.2 | 76.4 | +4.2 |
| Ablation studies validating the contributions of the closed-loop design and Chain-of-Thought (CoT) training. | ||||
| Franka Kitchen (10 demos) | Success Rate | 38.6 | 50.8 | +12.2 |
| Meta-World (10 demos) | Success Rate | 62.7 | 76.4 | +13.7 |
| Franka Kitchen (10 demos) | Success Rate | 26.2 | 50.8 | +24.6 |