_comment: REQUIRED: Define ALL technical terms, acronyms, and method names used ANYWHERE in the entire summary. After drafting the summary, perform a MANDATORY POST-DRAFT SCAN: check every section individually (Core.one_sentence_thesis, evaluation_highlights, core_problem, Technical_details, Experiments.key_results notes, Figures descriptions and key_insights). HIGH-VISIBILITY RULE: Terms appearing in one_sentence_thesis, evaluation_highlights, or figure key_insights MUST be defined—these are the first things readers see. COMMONLY MISSED: PPO, DPO, MARL, dense retrieval, silver labels, cosine schedule, clipped surrogate objective, Top-k, greedy decoding, beam search, logit, ViT, CLIP, Pareto improvement, BLEU, ROUGE, perplexity, attention heads, parameter sharing, warm start, convex combination, sawtooth profile, length-normalized attention ratio, NTP. If in doubt, define it.
TKGQA: Temporal Knowledge Graph Question Answering—answering questions using facts stored as quadruples (subject, predicate, object, timestamp)
TKG: Temporal Knowledge Graph—a knowledge graph where every edge (fact) has an associated timestamp
FKS: Facts Knowledge Store—a vector index of all facts in the graph embedded using a standard language model
TKS: Temporal Knowledge Store—a vector index of all facts embedded using a fine-tuned time-aware encoder
Hits@1: A metric measuring the percentage of questions where the top-1 predicted answer is correct
SentenceBERT: A modification of the BERT network that uses siamese networks to derive semantically meaningful sentence embeddings
Contrastive Learning: A learning paradigm where the model learns to pull positive pairs closer and push negative pairs apart in vector space
Quadruple: A format for temporal facts: (subject, predicate, object, time)