UMLS: Unified Medical Language System—a comprehensive repository of health and biomedical vocabularies and standards
Triplets: The fundamental unit of data in a knowledge graph, consisting of (Subject, Relation, Object)
MMR: Maximal Marginal Relevance—a ranking method that balances relevance to the query with diversity (novelty) relative to already selected items
Cross-encoder: A model architecture that processes two inputs (query and document) simultaneously to output a relevance score, typically more accurate but slower than bi-encoders
ROUGE-L: A metric measuring the overlap of the longest common subsequence between a generated summary and a reference, assessing structural similarity
UmlsBERT: A BERT model pre-trained on the UMLS metathesaurus to understand medical concepts and relations
MedCPT: A medical cross-encoder model trained on PubMed articles, used here for re-ranking retrieved triples