RePo: Context Re-Positioning—the proposed method that learns to assign continuous position values to tokens based on their content
RoPE: Rotary Position Embedding—a method encoding position information by rotating query and key vectors in embedding space
Extraneous Load: In Cognitive Load Theory, the mental effort imposed by the way information is presented or organized, which distracts from the actual learning or reasoning task
Germane Load: Mental effort dedicated to processing information and constructing schemas (useful reasoning), which RePo aims to maximize by reducing extraneous load
NoPE: No Position Encoding—a baseline where explicit position information is removed
NIAH: Needle-In-A-Haystack—a benchmark task testing a model's ability to retrieve a specific piece of information ('needle') buried in a long context ('haystack')
SwiGLU: A gated activation unit combining Swish activation and Gated Linear Units, used here to extract position representations
YaRN: Yet another RoPE for Nontraditional context—a method to extend the context window of RoPE-based models by modifying frequency components