Larimar: A specific LLM architecture augmented with an external episodic memory controller, allowing read/write access to latent representations.
readout vector: The vector retrieved from the external memory in Larimar, acting as a compressed KV cache to condition the decoder.
GRACE: Generalized RAdius-based Context Editingβa model editing method that adds a codebook adapter to specific layers to fix errors without changing base weights.
RougeL: A metric measuring the longest common subsequence between reference and generated text, used here to assess factual overlap.
Jaccard similarity: A statistic used for gauging the similarity and diversity of sample sets (tokens in this case).
hallucination: Generated content that is nonsensical or unfaithful to the provided source content/facts.
KV cache: Key-Value cache; stored intermediate states in Transformer models used to speed up generation.
SOTA: State-of-the-Art; the current best performance for a specific task.