Prefix-tuning: A method to optimize a continuous vector (embedding) appended to the system message, rather than updating model weights.
System message: High-level instructions given to an LLM (e.g., 'You are a helpful AI assistant') that govern its general behavior.
Abstractive summarization: Generating a summary that captures the main ideas of a text, often using new phrasing rather than extracting sentences.
Ungrounded entities: Named entities (e.g., people, places, medical terms) appearing in the output that do not exist in the source context.
Spurious attributes: Features of the synthetic task (like lack of newlines) that the model might overfit to, hurting performance on real tasks.
KL divergence: A statistical measure used here to ensure the optimized model does not drift too far from the original model's behavior on general reference data.