GNN: Graph Neural Network—a deep learning model that processes graph-structured data by aggregating information from neighboring nodes.
GIN: Graph Isomorphism Network—a specific GNN architecture designed to be as powerful as the Weisfeiler-Lehman graph isomorphism test.
InfoNCE: Information Noise Contrastive Estimation—a loss function used to learn representations by pulling positive pairs together and pushing negative pairs apart.
Principal Subgraph Mining: An algorithm that extracts a vocabulary of frequent, large subgraphs from a dataset to serve as fragments.
Fragment Graph: A coarse-grained graph where nodes represent chemical fragments (e.g., rings) and edges represent bonds connecting them.
Scaffold Split: A dataset splitting method that separates molecules based on their core structural framework (scaffold) to test generalization to new chemical spaces.
ROC-AUC: Receiver Operating Characteristic - Area Under Curve—a performance metric for classification problems at various threshold settings.
MAE: Mean Absolute Error—a measure of errors between paired observations expressing the same phenomenon.
GatedGCN: Gated Graph Convolutional Network—a GNN variant using gate mechanisms to control information flow.