Knowledge Profiling: A framework characterizing facts by whether they are encoded (stored) and how accessible they are (recallable with/without thinking)
Encoding: Operationalized as the ability of an LLM to correctly complete a factual proposition given a context mimicking its pre-training data
Recall: The ability of an LLM to correctly answer questions about an encoded fact across different contexts (phrasings, direct/reverse directions)
Thinking: Inference-time computation (like Chain-of-Thought or internal reasoning traces) used before generating a final answer
Reversal Curse: The phenomenon where an LLM knows 'A is B' but fails to answer 'What is B?' (e.g., knows Oasis -> Boardwalk, but not Boardwalk -> Oasis)
WikiProfile: The new benchmark dataset introduced in this paper, containing 2,150 facts extracted from Wikipedia with associated probing questions
Lost Keys: Metaphor for facts that are encoded in the model parameters but inaccessible during standard inference
Empty Shelves: Metaphor for facts that were never learned or encoded by the model during pre-training
Direct Question: A question asking for the object given the subject (matches training order)
Reverse Question: A question asking for the subject given the object (reverses training order)
Autorater: An LLM-based grader used to evaluate the correctness of model responses against gold answers
Fact: Defined as a proposition involving an ordered pair of entities (subject and object) extracted from a source text