Prompt Highlighting: Techniques that force an LLM to pay more attention to specific user-marked parts of the input prompt
Differential Cross-Covariance: A statistical method used here to isolate the change in signal between positive and negative conditions, subtracting out shared correlations
Routing Channel: The Key (K) vectors in attention, which determine 'where' the model attends based on similarity with Queries
Content Channel: The Value (V) vectors in attention, which determine 'what' information is actually passed forward to the next layer
Softplus: A smooth activation function, f(x) = log(1 + exp(x)), used here to assign continuous importance weights to attention heads
FlashAttention: An algorithm that speeds up attention computation by reducing memory reads/writes; compatible with Prism-Δ because the method edits inputs to attention rather than the attention matrix itself
SEKA: The primary baseline (Spectral Editing of Key Activations), which edits only Key vectors via spectral decomposition