SMILES: Simplified Molecular Input Line Entry System—a text notation for representing chemical structures
TDC: Therapeutics Data Commons—a large benchmark suite of datasets for drug discovery tasks
ReAct: Reason+Act—a paradigm where agents generate reasoning traces and task-specific actions (tool calls) in an interleaved manner
IC50: Half maximal inhibitory concentration—a measure of the potency of a substance in inhibiting a specific biological or biochemical function
AUROC: Area Under the Receiver Operating Characteristic curve—a performance metric for classification problems at various threshold settings
MMLU: Massive Multitask Language Understanding—a benchmark designed to measure knowledge acquired during pretraining
MAE: Mean Absolute Error—a measure of errors between paired observations expressing the same phenomenon
Fingerprints: Bit-vector representations of molecular structure (e.g., Morgan fingerprints) used for similarity search
Instruction Tuning: Fine-tuning a pre-trained language model on a collection of formatted datasets (instruction, input, output) to improve its ability to follow tasks
Agentic-Tx: The proposed system that uses an LLM as a controller to orchestrate tools (like property predictors and web search) to solve complex therapeutic problems