Evaluation Setup
Simulation-based training with PPO across multiple dexterous manipulation tasks
Benchmarks:
- Cluttered Object Singulation (Dexterous Manipulation)
- Constrained Object Retrieval (Dexterous Manipulation)
- In-hand Reorientation (Dexterous Manipulation)
- Bimanual Manipulation (Dexterous Manipulation)
Metrics:
- Success Rate
- Training Efficiency (Convergence Speed)
- Statistical methodology: Not explicitly reported in the provided text
Main Takeaways
- CCGE substantially improves training efficiency and success rates compared to existing exploration methods (State/Dynamics novelty) across all tested tasks
- The method successfully mitigates 'cross-state interference' by using state-conditioned counters, allowing agents to reuse contact patterns in different task phases
- Qualitative results suggest the policies learned with CCGE transfer robustly to real-world systems, implying the discovered contact strategies are physically realistic