MARL: Multi-Agent Reinforcement Learning—training multiple agents (here, human and robot) to interact in a shared environment
Co-adaptation: The process where two agents (human and exoskeleton) mutually adjust their behaviors in response to each other
Musculoskeletal simulation: A physics-based computational model simulating human bones, joints, and muscle dynamics
Non-stationary: A learning environment where the state distribution changes over time (e.g., because the other agent is changing its policy)
PPO: Proximal Policy Optimization—a reinforcement learning algorithm that updates policies with clipped constraints to ensure stability
RMS torque: Root Mean Square torque—a measure of the average magnitude of torque applied over a period
Sim-to-real: Transferring a policy learned in a computer simulation to physical hardware
Gait cycle: One complete sequence of walking functions, typically measured from heel strike to heel strike (0-100%)
Ablation: Removing a component of the system (e.g., a training stage) to test its specific contribution to the result