Different experiments of differing fidelities are commonly used in the search for new drug molecules. In classic experimental funnels, libraries of molecules undergo sequential rounds of virtual, coarse, and refined experimental screenings, with each level balanced between the cost of experiments and the number of molecules screened. Bayesian optimization offers an alternative approach, using iterative experiments to locate optimal molecules with fewer experiments than large-scale screening, but without the ability to weigh the costs and benefits of different types of experiments. In this work, we combine the multifidelity approach of the experimental funnel with Bayesian optimization to search for drug molecules iteratively, taking full advantage of different types of experiments, their costs, and the quality of the data they produce. We first demonstrate the utility of the multifidelity Bayesian optimization (MF-BO) approach on a series of drug targets with data reported in ChEMBL, emphasizing what properties of the chemical search space result in substantial acceleration with MF-BO. Then we integrate the MF-BO experiment selection algorithm into an autonomous molecular discovery platform to illustrate the prospective search for new histone deacetylase inhibitors using docking scores, single-point percent inhibitions, and dose-response IC50 values as low-, medium-, and high-fidelity experiments. A chemical search space with appropriate diversity and fidelity correlation for use with MF-BO was constructed with a genetic generative algorithm. The MF-BO integrated platform then docked more than 3,500 molecules, automatically synthesized and screened more than 120 molecules for percent inhibition, and selected a handful of molecules for manual evaluation at the highest fidelity. Many of the molecules screened have never been reported in any capacity. At the end of the search, several new histone deacetylase inhibitors were found with submicromolar inhibition, free of problematic hydroxamate moieties that constrain the use of current inhibitors.
Journal article
Bayesian Optimization over Multiple Experimental Fidelities Accelerates Automated Discovery of Drug Molecules
ACS central science, v 11(2), pp 346-356
05 Feb 2025
Featured in Collection : UN Sustainable Development Goals @ Drexel
Abstract
Metrics
Details
- Title
- Bayesian Optimization over Multiple Experimental Fidelities Accelerates Automated Discovery of Drug Molecules
- Creators
- Matthew A. McDonald - Massachusetts Institute of TechnologyBrent A. Koscher - Massachusetts Institute of TechnologyRichard B. Canty - Massachusetts Institute of TechnologyJason Zhang - Massachusetts Institute of TechnologyAngelina NingKlavs F. Jensen - Massachusetts Institute of Technology
- Publication Details
- ACS central science, v 11(2), pp 346-356
- Publisher
- ACS Publications
- Number of pages
- 11
- Grant note
- DARPA Accelerated Molecular Discovery (AMD) program: HR00111920025 MIT Consortium, Machine Learning for Pharmaceutical Discovery and Synthesis consortium (MLPDS)
Funding was provided by DARPA Accelerated Molecular Discovery (AMD) program under contract HR00111920025 and the MIT Consortium, Machine Learning for Pharmaceutical Discovery and Synthesis consortium (MLPDS).
- Resource Type
- Journal article
- Language
- English
- Academic Unit
- Chemical and Biological Engineering
- Web of Science ID
- WOS:001415307100001
- Scopus ID
- 2-s2.0-85217125678
- Other Identifier
- 991022027423304721
UN Sustainable Development Goals (SDGs)
This publication has contributed to the advancement of the following goals:
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Collaboration types
- Domestic collaboration
- Web of Science research areas
- Chemistry, Multidisciplinary