LoRA: Low-Rank Adaptation—a parameter-efficient fine-tuning technique that freezes the main model weights and trains small rank-decomposition matrices instead
SFT: Supervised Fine-Tuning—training a pre-trained model on labeled instruction-output pairs to learn specific tasks
Knowledge LoRA: An adapter module in Prereq-Tune trained specifically to memorize facts/statements (prerequisite knowledge) relevant to the downstream task
Skill LoRA: An adapter module in Prereq-Tune trained on the task format (e.g., Q&A) while the Knowledge LoRA is active; it learns the task 'skill' rather than the facts
Knowledge Inconsistency: The mismatch between facts present in fine-tuning data and the facts (or lack thereof) in the model's pre-training corpus
Fictitious Synthetic Data: Training data generated by an LLM concerning non-existent entities (e.g., fake people), used here to teach the model to handle unknown information without hallucinating
RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents