Categories
Nevin Manimala Statistics

Biomarker discovery studies for patient stratification using machine learning analysis of omics data: a scoping review

BMJ Open. 2021 Dec 6;11(12):e053674. doi: 10.1136/bmjopen-2021-053674.

ABSTRACT

OBJECTIVE: To review biomarker discovery studies using omics data for patient stratification which led to clinically validated FDA-cleared tests or laboratory developed tests, in order to identify common characteristics and derive recommendations for future biomarker projects.

DESIGN: Scoping review.

METHODS: We searched PubMed, EMBASE and Web of Science to obtain a comprehensive list of articles from the biomedical literature published between January 2000 and July 2021, describing clinically validated biomarker signatures for patient stratification, derived using statistical learning approaches. All documents were screened to retain only peer-reviewed research articles, review articles or opinion articles, covering supervised and unsupervised machine learning applications for omics-based patient stratification. Two reviewers independently confirmed the eligibility. Disagreements were solved by consensus. We focused the final analysis on omics-based biomarkers which achieved the highest level of validation, that is, clinical approval of the developed molecular signature as a laboratory developed test or FDA approved tests.

RESULTS: Overall, 352 articles fulfilled the eligibility criteria. The analysis of validated biomarker signatures identified multiple common methodological and practical features that may explain the successful test development and guide future biomarker projects. These include study design choices to ensure sufficient statistical power for model building and external testing, suitable combinations of non-targeted and targeted measurement technologies, the integration of prior biological knowledge, strict filtering and inclusion/exclusion criteria, and the adequacy of statistical and machine learning methods for discovery and validation.

CONCLUSIONS: While most clinically validated biomarker models derived from omics data have been developed for personalised oncology, first applications for non-cancer diseases show the potential of multivariate omics biomarker design for other complex disorders. Distinctive characteristics of prior success stories, such as early filtering and robust discovery approaches, continuous improvements in assay design and experimental measurement technology, and rigorous multicohort validation approaches, enable the derivation of specific recommendations for future studies.

PMID:34873011 | DOI:10.1136/bmjopen-2021-053674

Categories
Nevin Manimala Statistics

Causes of infective endocarditis in the Western Cape, South Africa: a prospective cohort study using a set protocol for organism detection and central decision making by an endocarditis team

BMJ Open. 2021 Dec 6;11(12):e053169. doi: 10.1136/bmjopen-2021-053169.

ABSTRACT

BACKGROUND: Blood culture negative infective endocarditis (BCNIE) poses both a diagnostic and therapeutic challenge. High rates of BCNIE reported in South Africa have been attributed to antibiotic use prior to blood culture sampling.

OBJECTIVES: To assess the impact of a systematic approach to organism detection and identify the causes of infective endocarditis (IE), in particular causes of BCNIE.

DESIGN: Prospective cohort study.

METHODS: The Tygerberg Endocarditis Cohort study prospectively enrolled patients with IE between November 2019 and February 2021. A set protocol for organism detection with management of patients by an endocarditis team was employed. This prospective cohort was compared with a retrospective cohort of patients with IE admitted between January 2017 and December 2018.

RESULTS: One hundred and forty patients with IE were included, with 75 and 65 patients in the retrospective and prospective cohorts, respectively. Baseline demographic characteristics were similar with a mean age of 39.6 years and male predominance (male sex=67.1%). The rate of BCNIE was lower in the prospective group (28/65 or 43.1%) compared with the retrospective group (47/75 or 62.7%; p=0.039). The BCNIE in-hospital mortality rate in the retrospective cohort was 23.4% compared with 14.2% in the prospective cohort (p=0.35). A cause was identified (including non-culture techniques) in 86.2% of patients in the prospective cohort, with Staphylococcus aureus (26.2%), Bartonella species (20%) and the viridans streptococci (15.3%) being most common.

CONCLUSION: The introduction of a set protocol for organism detection, managed by an endocarditis team, has identified Staphylococcusaureus as the most common cause of IE and identified non-culturable organisms, in particular Bartonella quintana, as an important cause of BCNIE. A reduction in in-hospital mortality in patients with BCNIE was observed, but did not reach statistical significance.

PMID:34873007 | DOI:10.1136/bmjopen-2021-053169

Categories
Nevin Manimala Statistics

Validity and reliability of the diagnostic codes for hypochondriasis and dysmorphophobia in the Swedish National Patient Register: a retrospective chart review

BMJ Open. 2021 Dec 6;11(12):e051853. doi: 10.1136/bmjopen-2021-051853.

ABSTRACT

OBJECTIVES: In the International Classification of Diseases, Tenth Edition (ICD-10), hypochondriasis (illness anxiety disorder) and dysmorphophobia (body dysmorphic disorder) share the same diagnostic code (F45.2). However, the Swedish ICD-10 allows for these disorders to be coded separately (F45.2 and F45.2A, respectively), potentially offering unique opportunities for register-based research on these conditions. We assessed the validity and reliability of their ICD-10 codes in the Swedish National Patient Register (NPR).

DESIGN: Retrospective chart review.

METHODS: Six hundred individuals with a diagnosis of hypochondriasis or dysmorphophobia (300 each) were randomly selected from the NPR. Their medical files were requested from the corresponding clinics, located anywhere in Sweden. Two independent raters assessed each file according to ICD-10 definitions and Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision and Fifth Edition criteria. Raters also completed the Clinical Global Impression-Severity (CGI-S) and the Global Assessment of Functioning (GAF).

PRIMARY OUTCOME MEASURE: Per cent between-rater agreement and positive predictive value (PPV). Intraclass correlation coefficients for the CGI-S and the GAF.

RESULTS: Eighty-four hypochondriasis and 122 dysmorphophobia files were received and analysed. The inter-rater agreement rate regarding the presence or absence of a diagnosis was 95.2% for hypochondriasis and 92.6% for dysmorphophobia. Sixty-seven hypochondriasis files (79.8%) and 111 dysmorphophobia files (91.0%) were considered ‘true positive’ cases (PPV=0.80 and PPV=0.91, respectively). CGI-S scores indicated that symptoms were moderately to markedly severe, while GAF scores suggested moderate impairment for hypochondriasis cases and moderate to serious impairment for dysmorphophobia cases. CGI-S and GAF inter-rater agreement were good for hypochondriasis and moderate for dysmorphophobia.

CONCLUSIONS: The Swedish ICD-10 codes for hypochondriasis and dysmorphophobia are sufficiently valid and reliable for register-based studies. The results of such studies should be interpreted in the context of a possible over-representation of severe and highly impaired cases in the register, particularly for dysmorphophobia.

PMID:34873001 | DOI:10.1136/bmjopen-2021-051853

Categories
Nevin Manimala Statistics

Relationship between renal function and prognosis of Chinese proliferative diabetic retinopathy patients undergoing the first vitrectomy: protocol for a prospective cohort study

BMJ Open. 2021 Dec 6;11(12):e052417. doi: 10.1136/bmjopen-2021-052417.

ABSTRACT

INTRODUCTION: China has the largest number of adults with diabetes aged 20-79 years (116.4 million) in 2019. Due to the socioeconomic condition or a lack of awareness of diabetic complications, many adults with diabetes have proliferative diabetic retinopathy (PDR) or renal function impairment at their first visit to the clinic for a sudden loss of vision, and pars plana vitrectomy (PPV) is required for their treatment. Risk factors for the outcomes and complications of PPV surgery in PDR patients have been widely explored in many epidemiological studies and clinical trials. However, few prospective studies have analysed the association between renal function and surgical outcomes in PDR.

METHODS AND ANALYSIS: This is a single-centre, prospective cohort study of PDR patients with type 2 diabetes mellitus who have definite indications for PPV surgery with or without renal function impairment. We will consecutively enrol PDR patients who meet the inclusion and exclusion criteria from November 2020 to December 2023. Each participant will be followed up for at least 6 months after surgery. Clinical data from medical records and vitreous fluid will be collected.Demographic characteristics and study outcomes will be summarised using descriptive statistics. The variation will be described and evaluated using the χ² test or Kruskal-Wallis test. Generalise additive mixed models will be used to explore the association between the renal profile and surgical outcomes including BCVA, and retinal and choroidal microvasculature/microstructure. Multivariate ordinal regression analysis will be used to detect the independent association between renal profile and BCVA changes, and smooth curve fitting will be employed to briefly present the tendency.

ETHICS AND DISSEMINATION: The trial has received ethical approval from the West China Hospital of Sichuan University. Results of this trial will be disseminated through publication in peer-reviewed journals and presentations at local and international meetings.

TRIAL REGISTRATION NUMBER: ChiCTR2000039698.

PMID:34873003 | DOI:10.1136/bmjopen-2021-052417

Categories
Nevin Manimala Statistics

Patients’ choice of healthcare providers and predictors of modern healthcare utilisation in Bangladesh: Household Income and Expenditure Survey (HIES) 2016-2017 (BBS)

BMJ Open. 2021 Dec 6;11(12):e051434. doi: 10.1136/bmjopen-2021-051434.

ABSTRACT

OBJECTIVES: The number of modern healthcare providers in Bangladesh has increased and they are well equipped with modern medical instruments and infrastructures. Despite this development, patients seeking treatment from alternative healthcare providers are ongoing. Hence, this study aims to determine the underlying predictors of patients’ choosing modern healthcare providers and health facilities for getting treatments.

SETTING: Data from the nationally representative Household Income and Expenditure Survey 2016-2017 conducted by the Bangladesh Bureau of Statistics were used.

PARTICIPANTS: 34 512 respondents sought treatment for their illnesses from different types of available healthcare providers.

PRIMARY AND SECONDARY OUTCOME MEASURE: Patients’ choice of healthcare providers (primary) and predictors of patients’ choice of modern healthcare providers (secondary).

RESULTS: The study found that 40% of the patients visit modern healthcare providers primarily on having symptoms of illness, and the remainder goes to alternative healthcare providers. Patients living in urban areas (adjusted OR (AOR)=1.11, 95% CI 1.05 to 1.17, p<0.01), and if the travel time was between 1 and 2 hours (AOR=1.11, 95% CI 1.00 to 1.22, p<0.05) compared with travel time less than 1 hour, were positively associated to utilisation of modern healthcare facilities for their first consultation. The statistical models show that the predisposing and need factors do not significantly impact patients’ choice of modern healthcare providers.

CONCLUSIONS: The distribution of modern healthcare providers should be even across the country to eliminate the rural-urban divide in modern healthcare utilisation. Enhancing the digital provision of modern healthcare services could reduce travel time, omit transportation costs and save waiting time for treatment by the modern healthcare providers. Policymakers can think of introducing a national health insurance programme in Bangladesh as a potential policy instrument.

PMID:34873000 | DOI:10.1136/bmjopen-2021-051434

Categories
Nevin Manimala Statistics

Impact of the COVID-19 pandemic on ongoing health research: an ad hoc survey among investigators in Germany

BMJ Open. 2021 Dec 6;11(12):e049086. doi: 10.1136/bmjopen-2021-049086.

ABSTRACT

OBJECTIVES: To gain insights into the impact of the COVID-19 pandemic on ongoing health research projects, using projects from a selected funding programme in Germany as an example.

DESIGN: Online survey and validation workshop.

SETTING: Lockdowns and social distancing policies impact on clinical and public health research in various forms, especially if unrelated to COVID-19. Research institutions have reduced onsite activities, data are often collected remotely, and during the height of the crisis, clinical researchers were partially forced to abandon their projects in favour of front-line care.

PARTICIPANTS SURVEY: 120 investigators of health research projects across Germany, performed between 15 and 25 May 2020; workshop: 32 investigators, performed on 28 May 2020.

RESULTS: The response rate (78%) showed that the survey generated significant interest among investigators. 85 responses were included for analysis, and the majority of investigators (93%) reported that their projects were affected by the pandemic, with many (80%) stating that data collection was not possible as planned, and they could not carry out interventions as intended (67%). Other impacts were caused by staff being unavailable, for example, through child or elder care commitments or because of COVID-19 quarantine or illness. Investigators also reported that publications were delayed or not feasible at all (56%), and some experienced problems with PhD or Masters theses (18%). The majority of investigators had mitigation strategies in place such as adjustment of data collection methods using digital tools (46%) or of project implementation in general (46%), others made changes in research design or research questions (27%).

CONCLUSIONS: The COVID-19 pandemic has severely impacted on health research projects. The main challenge is now to mitigate negative effects and to improve long-term resilience in health research. The pandemic has also acted as a driver of innovation and change, for example, by accelerating the use of digital methods.

PMID:34872995 | DOI:10.1136/bmjopen-2021-049086

Categories
Nevin Manimala Statistics

Effective and scalable single-cell data alignment with non-linear canonical correlation analysis

Nucleic Acids Res. 2021 Dec 6:gkab1147. doi: 10.1093/nar/gkab1147. Online ahead of print.

ABSTRACT

Data alignment is one of the first key steps in single cell analysis for integrating multiple datasets and performing joint analysis across studies. Data alignment is challenging in extremely large datasets, however, as the major of the current single cell data alignment methods are not computationally efficient. Here, we present VIPCCA, a computational framework based on non-linear canonical correlation analysis for effective and scalable single cell data alignment. VIPCCA leverages both deep learning for effective single cell data modeling and variational inference for scalable computation, thus enabling powerful data alignment across multiple samples, multiple data platforms, and multiple data types. VIPCCA is accurate for a range of alignment tasks including alignment between single cell RNAseq and ATACseq datasets and can easily accommodate millions of cells, thereby providing researchers unique opportunities to tackle challenges emerging from large-scale single-cell atlas.

PMID:34871454 | DOI:10.1093/nar/gkab1147

Categories
Nevin Manimala Statistics

DNA methylation variation along the cancer epigenome and the identification of novel epigenetic driver events

Nucleic Acids Res. 2021 Dec 6:gkab1167. doi: 10.1093/nar/gkab1167. Online ahead of print.

ABSTRACT

While large-scale studies applying various statistical approaches have identified hundreds of mutated driver genes across various cancer types, the contribution of epigenetic changes to cancer remains more enigmatic. This is partly due to the fact that certain regions of the cancer genome, due to their genomic and epigenomic properties, are more prone to dysregulated DNA methylation than others. Thus, it has been difficult to distinguish which promoter methylation changes are really driving carcinogenesis from those that are mostly just a reflection of their genomic location. By developing a novel method that corrects for epigenetic covariates, we reveal a small, concise set of potential epigenetic driver events. Interestingly, those changes suggest different modes of epigenetic carcinogenesis: first, we observe recurrent inactivation of known cancer genes across tumour types suggesting a higher convergence on common tumour suppressor pathways than previously anticipated. Second, in prostate cancer, a cancer type with few recurrently mutated genes, we demonstrate how the epigenome primes tumours towards higher tolerance of other aberrations.

PMID:34871444 | DOI:10.1093/nar/gkab1167

Categories
Nevin Manimala Statistics

Spatial Variation in Australian Neonicotinoid Usage and Priorities for Resistance Monitoring

J Econ Entomol. 2021 Dec 6;114(6):2524-2533. doi: 10.1093/jee/toab192.

ABSTRACT

Australia is the third largest exporting country of cereals and a leader in other major commodity crops, yet little data exist on pesticide usage patterns in agriculture. This knowledge gap limits the management of off-target chemical impacts, such as the evolution of pesticide resistance. Here, for the first time, we quantify spatial patterns in neonicotinoid applications in Australia by coalescing land use data with sales and market research data contributed by agrichemical and agribusiness companies. An example application to resistance management is explored through the development of recommendations for the cosmopolitan crop pest, Myzus persicae (Sulzer) (Hemiptera: Aphididae), utilizing spatial statistical models. This novel dataset identified Australian neonicotinoid usage patterns, with most neonicotinoid products in Australia applied as cereal, canola, cotton and legume seed treatments and soil applications in sugarcane. Importantly, there were strong regional differences in pesticide applications, which will require regionally specific strategies to manage off-target impacts. Indeed, the estimated spatial grid of neonicotinoid usage demonstrated a statistically significant influence on the distribution of M. persicae neonicotinoid resistance, indicating off-target impacts are unevenly distributed in space. Future research on neonicotinoid usage will be supported by the spatial grids generated and made available through this study. Overall, neonicotinoid pesticides are widely relied upon throughout Australia’s plant production systems but will face increasing pressure from resistance evolution, emerging research on off-target impacts, and stricter regulatory pressures.

PMID:34871446 | DOI:10.1093/jee/toab192

Categories
Nevin Manimala Statistics

Habitual Intake of Marine-derived n-3 Polyunsaturated Fatty Acids is Inversely Associated with a Cardiometabolic Inflammatory Profile in Yup’ik Alaska Native People

J Nutr. 2021 Dec 6:nxab412. doi: 10.1093/jn/nxab412. Online ahead of print.

ABSTRACT

BACKGROUND: The relationship between dietary n-3 PUFAs and the prevention of cardiometabolic diseases, including type 2 diabetes, is unresolved. Examination of the association between n-3 PUFAs and chronic low-grade inflammation in a population where many individuals have had an extremely high intake of marine mammals and fish throughout their lifespan may provide important clues regarding the impact of n-3 PUFAs on health.

OBJECTIVE: The aim of this study was to explore associations between concentrations of n-3 PUFAs resulting from habitual intake of natural food sources high in fish and marine mammals with immune biomarkers of metabolic inflammation and parameters of glucose regulation.

DESIGN: A total of 569 Yup’ik Alaska Native adults (18-87 years) were enrolled in this cross-sectional study between December 2016 and November 2019. The red blood cell (RBC) nitrogen isotope ratio (15N/14N, or NIR) was used as a validated measure of n-3 PUFA intake to select 165 participant samples from the first and fourth quartiles of n-3 PUFA intake. Outcomes included 38 pro- and anti-inflammatory cytokines and eight measures of glucose homeostasis associated with type 2 diabetes risk. These outcomes were evaluated for their association with direct measurements of EPA, DHA and arachidonic acid in RBCs.

ANALYSIS: Linear regression was used to detect significant relationships with cytokines and n-3 PUFAs, adiposity, and glucose related variables.

RESULTS: DHA concentration in RBC membranes was inversely associated with IL-6 (β = -0.0066, P < 0.001); EPA was inversely associated with TNFα (β = -0.4925, P < 0.001); and the NIR was inversely associated with MCP-1 (β = -0.8345, P < 0.001) and IL-10 (β = -1.2868, P < 0.001).

CONCLUSIONS: Habitual intake of marine mammals and fish rich in n-3 PUFAs in this study population of Yup’ik Alaska Native adults is associated with reduced systemic inflammation, which may contribute to the low prevalence of diseases in which inflammation plays an important role.

PMID:34871429 | DOI:10.1093/jn/nxab412