DKG: Domain Knowledge Graph—a structured representation of knowledge (entities and relations) specific to a particular field like medicine
RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents or data
Triple: The fundamental unit of a knowledge graph, consisting of (Subject, Relation, Object)
Zero-shot: The ability of a model to perform a task without seeing any specific training examples for that task
Vector Database: A database that stores data as high-dimensional vectors (embeddings), enabling fast similarity search
Cosine Distance: A metric used to measure how different two vectors are; used here to find semantically similar knowledge triples
Pruning: The process of removing irrelevant or low-quality retrieved information to prevent confusing the LLM
Apprenticeship Phase: A phase where the system learns from 'gold' (correct) answers provided by an expert/dataset to build its initial knowledge graph
Mastership Phase: A phase where the system operates autonomously, using user feedback to decide which self-generated answers are high-quality enough to extract knowledge from