VLMs: Vision-Language Models—AI models capable of processing both text and images.
LongCodeZip: A baseline textual compression method that filters code chunks based on Approximate Mutual Information (AMI) and knapsack optimization.
dependency closure: The property that all necessary definitions, variables, and constraints required to understand a piece of code are present in the context.
semantic fragmentation: The loss of meaning or logic when related code parts are separated or removed during compression.
CompScore: A metric for code summarization where a judge model (GPT-4o) compares generated summaries against a reference.
AMI: Approximate Mutual Information—a metric used to rank the relevance of code chunks by estimating their contribution to reducing perplexity.
Exact Match (EM): A metric checking if the generated code is identical to the ground truth.
Edit Similarity (ES): A metric measuring the textual similarity between generated code and ground truth based on edit distance.
RAG: Retrieval-Augmented Generation—retrieving relevant snippets to augment the model's context.
Glyph: A specialized 9B parameter Vision-Language Model designed for reading text from images (visual compression).