Egocentric: First-person viewpoint (camera worn on head/glasses)
Exocentric: Third-person viewpoint (static external cameras)
MPJPE: Mean Per Joint Position Error—average Euclidean distance between predicted and ground truth joint coordinates (mm)
PCK-AUC: Percentage of Correct Keypoints Area Under Curve—a metric where higher is better, measuring robustness of pose estimation at various error thresholds
Volumetric Convolution: 3D convolution operations performed on a voxel grid (height x width x depth) rather than a 2D image plane
Soft-argmax: A differentiable operation that extracts the numerical coordinate of the maximum value in a heatmap
SVEgoNet: Single-View Egocentric Network—the baseline pose estimator trained on the new dataset
MVExoNet: Multi-View Exocentric Network—the proposed model used to generate ground truth annotations
Triangulation: Geometric method to find 3D points by intersecting lines from 2D points in multiple camera views
Heatmap: A probability map where high values indicate the likely position of a keypoint