Categories
Nevin Manimala Statistics

Machine-learning models based on histological images from healthy donors identify imageQTLs and predict chronological age

Proc Natl Acad Sci U S A. 2025 Nov 18;122(46):e2423469122. doi: 10.1073/pnas.2423469122. Epub 2025 Nov 11.

ABSTRACT

Histological images offer a wealth of data. Mining these data holds significant potential for enhancing disease diagnosis and prognosis, though challenges remain, especially in noncancer contexts. In this study, we developed a statistical framework that links raw histological images and their derived features to the genotype, transcriptome, and chronological age of the samples. We first demonstrated an association between image features and genotypes, identifying 906 image quantitative trait loci (imageQTLs) significantly associated with image features. Next, we identified differentially expressed (DE) genes by stratifying samples into image-similar groups based on image features and performing DE comparisons between groups. Additionally, we developed a deep-learning model that accurately predicts gene expression in specific tissues from raw images and their features, highlighting gene sets associated with observed morphological changes. Finally, we constructed another deep-learning model to predict chronological age directly from raw images and their features, revealing relationships between age and tissue morphology, especially aspects derived from nucleus features. Both models are supported by a computational approach that greatly compresses gigapixel whole-slide images and extracts interpretable nucleus features, integrating both large-scale tissue morphology and smaller local structures. We have made all interpretable nucleus features, imageQTLs, DE genes, and deep-learning models available as online resources for further research.

PMID:41218125 | DOI:10.1073/pnas.2423469122

By Nevin Manimala

Portfolio Website for Nevin Manimala