Sci-LLMs: Scientific Large Language Models—models adapted for science via fine-tuning on scientific data or integration with scientific tools
Agentic Science: A stage of AI where systems act as autonomous partners capable of the full discovery cycle (hypothesis to analysis) with minimal human guidance
Level 1 (Computational Oracle): AI as a specialized tool/function approximator for specific tasks (e.g., protein folding prediction) without autonomy
Level 2 (Automated Research Assistant): AI that executes pre-defined sub-goals or workflows (e.g., running a simulation pipeline) but lacks high-level strategic direction
Level 3 (Autonomous Scientific Partner): AI that independently conducts the discovery loop, including hypothesis generation and experimental design
Level 4 (Generative Architect): Hypothetical future AI capable of inventing new scientific frameworks, instruments, or paradigms
RAG: Retrieval-Augmented Generation—enhancing model outputs by retrieving relevant information from external knowledge bases
CoT: Chain-of-Thought—a prompting technique where models generate intermediate reasoning steps
SMILES: Simplified Molecular Input Line Entry System—a text notation for representing chemical structures
PDEs: Partial Differential Equations—mathematical equations describing continuous physical phenomena like fluid dynamics