Knowledge Graph (KG): A structured representation of facts, typically as triples (entity, relation, entity), used here as a source of ground truth
Judge Model: A lightweight classifier (neural network) trained to predict whether an LLM knows a fact based on the LLM's internal hidden states
Negative Sampling: The process of creating false statements by corrupting a true triple (e.g., swapping the tail entity) to test if the model can identify falsehoods
P-tuning: A method to optimize continuous prompt embeddings instead of discrete text tokens, used here to compress instructions
Declarative Statement: A simple sentence stating a fact (e.g., 'Sky is blue') rather than a question (e.g., 'What color is the sky?'), used to simplify evaluation
Hidden States: The internal numerical representations (vectors) within an LLM that capture the model's processing of the input before generating output tokens
Substitute Model: Using a smaller version of an LLM (e.g., 7B) to compute hidden states for the judge, assuming representation alignment with larger versions (e.g., 70B)