SMILES: Simplified Molecular Input Line Entry System—a text notation for describing the structure of chemical molecules
ADMET: Absorption, Distribution, Metabolism, Excretion, and Toxicity—key pharmacokinetic properties determining a drug's viability
HTS: High-Throughput Screening—automated testing of large numbers of chemical and biological compounds for a specific biological target
DTI: Drug-Target Interaction—prediction of the binding affinity between a drug molecule and a protein target
ROC-AUC: Receiver Operating Characteristic - Area Under the Curve—a performance metric for classification problems at various threshold settings
ReAct: Reasoning + Acting—a prompting paradigm where LLMs generate reasoning traces and task-specific actions in an interleaved manner
ESM: Evolutionary Scale Modeling—large language models trained on protein sequences to predict structure and function
ChemBERTa: A BERT-like transformer model pre-trained on chemical structures (SMILES) for molecular property prediction