ZP-LKE: Zero-Prompt Latent Knowledge Estimator—the proposed method that uses only lists of example facts to probe knowledge, without instructional text
LKE: Latent Knowledge Estimator—methods attempting to determine what facts are stored inside an LLM's parameters
meta-linguistic judgment: The ability of an LLM to understand instructions about language usage (e.g., 'answer in JSON format'), which often interferes with pure knowledge retrieval
side-channel: Information unintentionally leaked to the model via the prompt template (e.g., using 'elected' implies a political role), making the task easier than intended
T-Rex: A large-scale dataset of facts aligned with Wikipedia triplets, used here for benchmarking knowledge estimation
Wikidata: A structured knowledge base used as the source of ground truth facts
ICL: In-Context Learning—the ability of LLMs to learn tasks from examples provided in the prompt without parameter updates
HGP: Human-Generated Prompts—manual templates written by humans to query the model (e.g., 'X was born in Y')
MMP: Machine-Mined Prompts—templates automatically discovered from large corpora (e.g., Wikipedia) that frequently co-occur with the fact triplets
FS-LKE: Few-Shot Latent Knowledge Estimator—using prompt templates combined with a few examples
ZS-LKE: Zero-Shot Latent Knowledge Estimator—using prompt templates without any examples