A round-robin exercise for the precise prediction of aqueous solubility of organic chemicals using chemometric, machine learning, and stacking ensemble of deep learning models

J Comput Aided Mol Des. 2026 Jun 8;40(1):143. doi: 10.1007/s10822-026-00854-x.

ABSTRACT

Aqueous solubility is an important property for assessing the druggability and ecotoxicological effects of molecules. Successful drug candidates should have optimal aqueous solubility to improve bioavailability to target tissues. To effectively screen molecules in a short period of time, reliable predictive models are highly useful. In the present study, we conducted a round-robin exercise using a large, curated dataset of over 6000 compounds to predict aqueous solubility quantitatively. The six participating groups used an array of Machine Learning and Deep Learning algorithms to develop models with strong robustness and external predictive performance. All the models underwent rigorous Leave-One-Out and tenfold cross-validation. The diversity of training sets and descriptor types used by different groups paved the way for exploring the mechanistic basis for the efficient identification of contributing features. The best-performing model was selected using the statistical Sum of Ranking Differences (SRD) approach, considering the performances on training, cross-validation, and test, as well as the performance difference between the training and test sets. Additionally, a curated, true external set was screened by the six different models. Here, the best-performing model was selected using a consensus ranking strategy based on Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and [Formula: see text]. In both approaches, i.e., the inherent model performance in terms of training, test, and cross-validation statistics, and the ability of the model to efficiently predict true external data, the Stacking Ensemble of Deep q-RASPR models emerged as the winner. This model showed comparable predictive performance to the previously reported model, which apparently lacked a proper data curation workflow and contained a significant number of duplicates and mixtures in its dataset, which can inflate model statistics. The insights from the different feature contributions from the different groups identified the useful structural and physicochemical aspects, which can help synthetic chemists to optimize molecules.

PMID:42258020 | DOI:10.1007/s10822-026-00854-x

By Nevin Manimala