Categories
Nevin Manimala Statistics

Predicting disease-specific histone modifications and functional effects of non-coding variants by leveraging DNA language models

Genome Biol. 2026 Feb 14. doi: 10.1186/s13059-026-04003-3. Online ahead of print.

ABSTRACT

BACKGROUND: Epigenetic modifications play a vital role in the pathogenesis of human diseases, particularly neurodegenerative disorders such as Alzheimer’s disease, where dysregulated histone modifications are strongly implicated in disease mechanisms. While recent advances underscore the importance of accurately identifying these modifications to elucidate their contribution to Alzheimer’s disease pathology, existing computational methods remain limited by their generic approaches that overlook disease-specific epigenetic signatures.

RESULTS: To bridge this gap, we develop a novel large language model-based deep learning framework tailored for disease-contextual prediction of histone modifications and variant effects. Focusing on Alzheimer’s disease as a case study, we integrate epigenomic data from multiple patient samples to construct a comprehensive, disease-specific histone modification dataset, enabling our model to learn Alzheimer’s disease -associated molecular signatures. A key innovation of our approach is the incorporation of a Mixture of Experts architecture, which effectively distinguishes between disease and healthy epigenetic states, allowing for precise identification of Alzheimer’s disease -relevant epigenetic modification patterns. Our model demonstrates robust performance in disease-specific histone modification prediction, significantly outperforming existing state-of-the-art methods that lack disease context. Beyond accurate modification site prediction, our framework provides important biological insights by successfully prioritizing Alzheimer’s disease-associated genetic variants, which show significant enrichment in disease-relevant pathways.

CONCLUSIONS: Our framework establishes a powerful new paradigm for epigenetic research that can be extended to other complex diseases, offering both a valuable tool for variant effect interpretation and a promising strategy for uncovering novel disease mechanisms through epigenetic profiling.

PMID:41691336 | DOI:10.1186/s13059-026-04003-3

By Nevin Manimala

Portfolio Website for Nevin Manimala