Evaluation Setup
Robotic manipulation in simulation and real-world
Benchmarks:
- 10 Simulated Tasks (Robotic Manipulation) [New]
- 8 Real-World Tasks (Robotic Manipulation) [New]
Metrics:
- Success Rate
- Reward Accuracy
- Value-Order Consistency (VOC)
- Statistical methodology: Not explicitly reported in the paper
Key Results
| Benchmark |
Metric |
Baseline |
This Paper |
Δ |
| Reward Accuracy Benchmark |
Progress Accuracy |
Not reported in the paper |
92.8 |
Not reported in the paper
|
| Rank-correlation Benchmark |
Value-Order Consistency (VOC) |
Not reported in the paper |
0.953 |
Not reported in the paper
|
| Real-world manipulation tasks |
Success Rate |
0 |
95 |
+95
|
Main Takeaways
- GRM provides state-of-the-art accuracy in progress assessment, enabling reliable dense rewards.
- Dopamine-RL enables extremely sample-efficient learning (1 hour on real robot) by leveraging dense rewards without biasing the policy.
- The multi-view, hop-based approach generalizes well to unseen layouts and visual variations.