J Comput Aided Mol Des. 2025 Jul 13;39(1):48. doi: 10.1007/s10822-025-00629-w.
ABSTRACT
Identifying orthosteric binding sites and predicting small molecule affinities remains a key challenge in virtual screening. While blind docking explores the entire protein surface, its precision is hindered by the vast search space. Cavity detection-guided docking improves accuracy by narrowing focus to predicted pockets, but its effectiveness depends heavily on the quality of cavity detection tools. To overcome these limitations, we developed Consensus Blind Dock (CoBDock), a machine learning-based blind docking method that integrates molecular docking and cavity detection results to enhance binding site and pose prediction. Building on this, CoBDock-2 replaces traditional docking tools by extracting 1D numerical representations from protein, ligand, and interaction structural features, and applying advanced ensemble feature selection techniques. By evaluating 21 feature selection methods across 9,598 features, CoBDock-2 identifies key molecular characteristics of orthosteric binding sites. CoBDock-2 demonstrates consistent improvements over the original CoBDock across benchmark datasets (PDBBind v2020-general, MTi, ADS, DUD-E, CASF-2016), achieving 77% binding site identification accuracy (within 8 Å), 55% ligand pose prediction accuracy (RMSD ≤ 2 Å), a 19% reduction in the mean distance to ground truth ligands within the binding site, and an 18.5% decrease in the mean pose RMSD. Statistical analysis across the combined benchmark set confirms the significance of these improvements ( p < 0.05 ). Notably, the Weighted Hybrid Feature Selection variant in CoBDock-2 further increases binding site accuracy to 79.8%, demonstrating the benefit of combining multimodel and ensemble feature selection strategies. Variability in predictions also decreased significantly, highlighting enhanced reliability and generalizability. Also, a low-bias hypothetical comparison with a state-of-the-art DiffDock + NMDN method was conducted to position CoBDock-2 relative to modern deep learning-based docking strategies.
PMID:40652425 | DOI:10.1007/s10822-025-00629-w