memory fragmentation: The dispersion of relevant information across different storage formats (raw logs vs. summaries), making it hard to retrieve a complete picture
turn: A single exchange (user input + assistant response) in a dialogue
round: A grouping of turns representing a coherent exchange or topic within a session
generated memory: Information synthesized by an LLM from raw logs, such as summaries, extracted facts, or reflective insights
KNN graph: k-Nearest Neighbors graph, where nodes are connected to their k most similar peers based on vector embedding similarity
NLTK: Natural Language Toolkit—a standard Python library used here for splitting text into sentences without using a heavy neural model
Sentence-BERT: A modification of the BERT network that uses siamese networks to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity
RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
LLM-as-a-Judge: An evaluation method where a strong LLM (like GPT-4) is prompted to score the accuracy of a model's response against a reference answer