← Back to Paper List

The Virtual Lab: AI Agents Design New SARS-CoV-2 Nanobodies with Experimental Validation

Kyle Swanson, Wesley Wu, Nash L. Bulaong, J. Pak, James Y. Zou
Department of Computer Science, Stanford University, Chan Zuckerberg Biohub-San Francisco
bioRxiv (2024)
Agent Reasoning Benchmark

📝 Paper Summary

Multi-agent scientific discovery Protein design / Nanobody engineering
The Virtual Lab is an AI system where a team of specialized LLM agents collaboratively designs, implements, and executes a computational pipeline to create effective new nanobodies for COVID-19.
Core Problem
Interdisciplinary research requires coordinating diverse experts (e.g., virologists, ML engineers), which is resource-intensive and slow, hindering rapid responses to evolving threats like SARS-CoV-2 variants.
Why it matters:
  • Rapidly evolving viruses develop resistance to existing therapies, creating an urgent need for fast, automated design of binders for new variants like KP.3
  • Most scientists lack immediate access to large, diverse teams of experts, limiting the complexity of research they can undertake alone
  • Current LLM tools (e.g., ChemCrow) handle narrow tasks but struggle with open-ended, multi-step research design requiring high-level reasoning across fields
Concrete Example: A human biologist wants to design nanobodies for a new variant but doesn't know how to code the latest ML models (like ESM or AlphaFold). Currently, they must hire a computational specialist or learn it themselves. The Virtual Lab automatically spawns a 'Computational Biologist' agent to write the code and an 'Immunologist' agent to guide the biological strategy.
Key Novelty
Virtual Lab: Collaborative AI-Human Research Framework
  • Simulates a research group where a Principal Investigator (PI) agent spawns specialized scientist agents (e.g., Immunologist, ML Specialist) based on the project description
  • Agents hold structured meetings (Team vs. Individual) to debate agendas, write code, and critique each other's work under human supervision
  • Integrates high-level reasoning (LLMs) with specialized computational tools (AlphaFold, Rosetta) to perform end-to-end research from ideation to execution
Evaluation Highlights
  • 92 AI-designed nanobodies were experimentally synthesized; 90% expressed soluble protein, showing the designs were biologically viable
  • Two novel nanobodies showed improved binding to the recent JN.1 or KP.3 variants while retaining binding to the ancestral strain, validating the design pipeline
  • Human intervention was minimal: The human researcher wrote only ~1.3% of the text (1,596 words) while AI agents generated 98.7% (122,462 words) including all code
Breakthrough Assessment
9/10
Demonstrates a complete loop from high-level AI research planning to wet-lab experimental validation with successful hits. Moves beyond 'LLMs writing code' to 'LLMs conducting science'.
×