JAMA Netw Open. 2026 May 1;9(5):e2614909. doi: 10.1001/jamanetworkopen.2026.14909.
ABSTRACT
IMPORTANCE: Surveillance of postacute sequelae of SARS-CoV-2 infection (PASC) depends on diagnostic coding systems that capture fewer than one-half of affected individuals, rendering millions invisible to health systems and policymakers.
OBJECTIVE: To quantify the gap between true PASC burden and diagnostic code-based estimates, determine the proportion representing chronic disease, and characterize organ system heterogeneity and temporal trends across diverse populations.
DESIGN, SETTING, AND PARTICIPANTS: This retrospective cohort study used electronic health record data from 58 hospitals and affiliated clinics in 4 US regions, from 2017 to 2025. Adults (aged ≥18 years) with laboratory-confirmed SARS-CoV-2 infection or a COVID-19 diagnosis code were included. A custom artificial intelligence algorithm, the Precision Phenotyping for Research Cohorts (P2RC), was implemented using federated infrastructure.
EXPOSURE: Laboratory-confirmed SARS-CoV-2 infection or COVID-19 diagnosis code.
MAIN OUTCOMES AND MEASURES: The primary outcomes were PASC prevalence, the proportion classified as chronic conditions, organ system distribution, and temporal trends from 2020 to 2024. χ2 Tests were used to assess organ system heterogeneity across regions, and negative binomial regression was used to model quarterly temporal trends, yielding incidence rate ratios (IRRs) with 95% CIs.
RESULTS: In this cohort study of 457 950 COVID-19 cases (mean age, 52.05 years; 275 107 [60.07%] female), the P2RC algorithm identified 74 560 PASC cases (16.28% overall; 28 585 [18.58%] in New England, 978 [19.55%] in Southeast Texas, 10 534 [22.69%] in Southern California, and 34 463 [13.64%] in Western Pennsylvania), more than 2-fold higher than the proportion identified by code-based surveillance (<7%). Of 883 International Statistical Classification of Diseases, Tenth Revision, Clinical Modification codes associated with PASC, 594 (67.27%) represented chronic or potentially chronic conditions. Of 74 560 patients with PASC, 66 587 (89.31%) developed chronic conditions requiring ongoing clinical management; this represents 14.54% of the total number of 457 950 patients with COVID-19. Substantial organ system heterogeneity was observed (χ2 = 2504.73; P < .001): New England demonstrated thyroid-predominant endocrine patterns, while Southeast Texas, Southern California, and Western Pennsylvania showed metabolic-predominant profiles. Negative binomial regression revealed increasing PASC prevalence through mid-2024 (IRR per quarter, 1.01 [95% CI, 1.00-1.01; P < .001] in New England; 1.00 [95% CI, 1.00-1.01; P < .001] in Southern California; and 1.02 [95% CI, 1.01-1.02; P < .001] in Western Pennsylvania), indicating an accumulating rather than resolving burden.
CONCLUSIONS AND RELEVANCE: In this cohort study, approximately 1 in 6 patients with COVID-19 developed PASC, and 89.31% of these patients had at least 1 chronic condition. Current diagnostic coding captured fewer than one-half of the cases, obscuring a substantial chronic disease burden. The persistently increasing prevalence through 2024 indicated an accumulating health care burden requiring investment in surveillance infrastructure and integrated care pathways.
PMID:42201733 | DOI:10.1001/jamanetworkopen.2026.14909