Egocentric: First-person perspective data captured from the point of view of the user (e.g., from glasses), as opposed to 'allocentric' or third-person views.
SLAM: Simultaneous Localization and Mapping—algorithms that construct a map of an unknown environment while simultaneously keeping track of an agent's location within it.
VIO: Visual Inertial Odometry—estimating a device's position and orientation by combining camera images with inertial sensor (IMU) data.
MPS: Machine Perception Services—Meta's cloud-based pipeline that processes raw Aria data to generate derived signals like trajectories and eye gaze.
VRS: Vision Replay System—an optimized file format used to record and playback streams of time-aligned sensor data.
6DoF: Six Degrees of Freedom—refers to the freedom of movement of a rigid body in three-dimensional space (forward/backward, up/down, left/right) combined with rotation about three perpendicular axes (yaw, pitch, roll).
Cyclopean Eye Frame: A coordinate frame defined at the midpoint between the left and right pupils, used as the origin for the combined gaze ray.
IMU: Inertial Measurement Unit—a device that measures specific force (accelerometer) and angular rate (gyroscope).
Epipolar Geometry: The geometry of stereo vision; used here to triangulate 3D points from 2D camera observations.