Large language model consensus substantially improves the cell type annotation accuracy for scRNA-seq data

Commun Biol. 2026 Jun 8. doi: 10.1038/s42003-026-10420-8. Online ahead of print.

ABSTRACT

The rapid expansion of single-cell RNA sequencing (scRNA-seq) has made accurate cell type annotation a critical bottleneck for biological discovery. Existing computational methods are often limited by reference data dependency, while emerging single Large Language Model (LLM) approaches are susceptible to model-specific biases and provide insufficient uncertainty quantification. To address these limitations, we introduce mLLMCelltype, a framework that harnesses collective intelligence-the emergent problem-solving capacity arising when multiple independent agents interact through structured deliberation to produce solutions exceeding individual capabilities-of multiple LLMs through an iterative deliberation process. Across 49 diverse datasets, our framework achieves a mean accuracy of 77.2%, a 15.7-percentage-point improvement over the best-performing single-LLM baseline (61.5%). The consensus mechanism demonstrates high robustness to noisy input and generalizes to datasets released after the LLMs’ training. By providing transparent reasoning chains and robust consensus-based confidence metrics, mLLMCelltype minimizes manual annotation effort and enables reliable interpretation of complex cellular landscapes. The framework is available as an open-source package and an accessible web server.

PMID:42260142 | DOI:10.1038/s42003-026-10420-8

By Nevin Manimala