genetic perturbation: Modifying a gene (e.g., repressing or activating it) to observe the resulting change in biological traits (phenotype)
phenotype: The observable physical properties of an organism or cell, such as cell growth or protein levels
hit: A gene that, when perturbed, produces a phenotypic response exceeding a specific threshold
hit ratio: The fraction of true hits discovered cumulatively over the course of the experiment relative to the total number of true hits
cold start problem: The difficulty of making good predictions at the beginning of an experiment when no data has yet been collected
essential genes: Genes critical for cell survival; perturbing them almost always causes a strong effect, making them 'easy' but often uninformative targets
Reactome: A curated database of biological pathways and processes used here to find genes with similar functions
Bayesian optimization: A strategy for global optimization of black-box functions, typically used here as a baseline for selecting experiments
acquisition function: In Bayesian optimization, a function that guides the search by balancing exploration (finding new info) and exploitation (using known info)
CRISPR: A technology used to selectively modify the DNA of living organisms, used here to perform the perturbations
Coreset: A specific active learning or batch selection strategy used as a baseline, focusing on selecting a diverse set of representative samples