LLM: Large Language Model—AI models trained on vast text data capable of generating human-like text and code
PDB: Protein Data Bank—a database of 3D structural data of large biological molecules, such as proteins
Normal Mode Analysis (NMA): A computational method to simulate the vibrational movements (natural frequencies) of a protein structure
Chroma: A generative AI model used for de novo protein design
OmegaFold: A deep learning model for predicting the 3D folded structure of a protein from its amino acid sequence
CATH: Class, Architecture, Topology, Homology—a hierarchical classification system for protein structures
RAG: Retrieval-Augmented Generation—AI systems that answer questions by first searching for relevant documents
PDE: Partial Differential Equation—mathematical equations used to describe physical phenomena like vibrations
ForceGPT: A fine-tuned transformer model used in this paper to predict mechanical unfolding properties of proteins
AA: Amino Acid—the fundamental building blocks of proteins