Categories
Nevin Manimala Statistics

CLCNet: a contrastive learning and chromosome-aware network for genomic prediction in plants

Brief Bioinform. 2026 May 4;27(3):bbag270. doi: 10.1093/bib/bbag270.

ABSTRACT

Genomic selection leverages genome-wide markers and phenotypes to predict breeding values, with its effectiveness largely dependent on the accuracy of genomic prediction (GP) models. However, GP methods often struggle to capture inter-individual variability and are limited by the curse of dimensionality, where the number of single-nucleotide polymorphisms (SNPs) far exceeds the sample size. To address these challenges, we present CLCNet (Contrastive Learning and Chromosome-aware Network), a novel deep learning framework that integrates contrastive learning and chromosome-aware feature modeling. CLCNet comprises two key components: (i) a contrastive learning module that enhances the model’s ability to capture fine-grained, genotype-dependent phenotypic differences among individuals, and (ii) a chromosome-aware module that captures structured feature selection at both chromosome and genome levels, thereby distilling the most informative SNPs. We evaluated CLCNet across 4 crop species, covering 10 agronomically important traits, and compared it with a diverse set of classical linear, machine learning, and deep learning models. CLCNet achieved superior prediction performance, with statistically significant improvements in Pearson correlation coefficient, ranging from 0.34% to 12.19% over baseline, together with reduced mean squared error. Performance gains were more pronounced for traits with moderate linkage disequilibrium (LD; r2 = 0.21-0.36) and high heritability (h2 > 0.66), such as those in maize, rapeseed, and soybean. For cotton traits characterized by high LD (r2 = 0.74) and lower heritability (h2 < 0.50), CLCNet maintained robust performance without degradation. Overall, these results demonstrate that CLCNet is an effective framework for improving genomic prediction accuracy and holds strong potential for practical applications in plant breeding.

PMID:42218721 | DOI:10.1093/bib/bbag270

By Nevin Manimala

Portfolio Website for Nevin Manimala