J Comput Chem. 2021 Dec 4. doi: 10.1002/jcc.26791. Online ahead of print.
ABSTRACT
Buchwald-Hartwig amination reaction catalyzed by palladium plays an important role in drug synthesis. In the last few years, machine learning-assisted strategies emerged and quickly gained attention. In this article, an importance and relevance-based integrated feature screening method is proposed to effectively filter high-dimensional feature descriptor data. Then, a regularized machine learning boosting tree model, eXtreme Gradient Boosting, is introduced to intelligently predict reaction performance in multidimensional chemistry space. Furthermore, convergence, interpretability, generalization, and the internal association between reaction conditions and yields are excavated, which provides intelligent assistance for the optimal design of coupling reaction system and evaluating the reaction conditions. Compared with recently published results, the proposed method requires fewer feature descriptors, takes less time, and achieves more accurate prediction accuracy.
PMID:34862652 | DOI:10.1002/jcc.26791