Inverse Kinematics (IK): The mathematical process of calculating the variable joint parameters (angles) needed to place the end of a robot arm at a given point in space.
ROS: Robotic Operating System—a set of software libraries and tools for building robot applications.
MuJoCo: Multi-Joint dynamics with Contact—a physics engine used for simulating robotics.
PPO: Proximal Policy Optimization—a reinforcement learning algorithm that improves stability by limiting how much the policy can change in each update.
DDPG: Deep Deterministic Policy Gradient—an actor-critic algorithm for continuous action spaces.
TRPO: Trust Region Policy Optimization—an RL algorithm that ensures monotonic improvement by enforcing a KL divergence constraint.
A2C: Advantage Actor-Critic—a synchronous version of the A3C algorithm that uses a baseline to reduce variance.
HSV: Hue, Saturation, Value—a color model often used in computer vision for color-based object detection.
quaternion: A mathematical notation for representing orientations and rotations of objects in three dimensions.