CRUD: Create, Read, Update, Delete—the four fundamental atomic operations for persistent storage management
GRPO: Group Relative Policy Optimization—a reinforcement learning algorithm used here to refine the memory policy by optimizing task-level success
SFT: Supervised Fine-Tuning—initial training phase using labeled examples to teach the model the API schema and basic behaviors
Scratchpad: A centralized memory entry that is mandatorily retrieved at every step to maintain global task state, complementing the selective vector storage
POMDP: Partially Observable Markov Decision Process—a mathematical framework for decision-making where the agent cannot directly observe the full state of the environment
Needle-in-a-Haystack: An evaluation setting where a small piece of critical information ('needle') is hidden within a large amount of irrelevant text ('haystack')
Vector Database: A storage system that indexes data via embedding vectors, used here to implement the 'Read' operation via semantic similarity