Categories
Nevin Manimala Statistics

Developing Sampling Weights for Statistical Analysis of Parent-Child Pair Data From the National Health Interview Survey

Vital Health Stat 1. 2024 Apr;(207):1-31.

ABSTRACT

The National Health Interview Survey (NHIS), conducted by the National Center for Health Statistics since 1957, is the principal source of information on the health of the U.S. civilian noninstitutionalized population. NHIS selects one adult (Sample Adult) and, when applicable, one child (Sample Child) randomly within a family (through 2018) or a household (2019 and forward). Sampling weights for the separate analysis of data from Sample Adults and Sample Children are provided annually by the National Center for Health Statistics. A growing interest in analysis of parent-child pair data using NHIS has been observed, which necessitated the development of appropriate analytic weights. Objective This report explains how dyad weights were created such that data users can analyze NHIS data from both Sample Children and their mothers or fathers, respectively. Methods Using data from the 2019 NHIS, adult-child pair-level sampling weights were developed by combining each pair’s conditional selection probability with their household-level sampling weight. The calculated pair weights were then adjusted for pair-level nonresponse, and large sampling weights were trimmed at the 99th percentile of the derived sampling weights. Examples of analyzing parent-child pair data by means of domain estimation methods (that is, statistical analysis for subpopulations or subgroups) are included in this report. Conclusions The National Center for Health Statistics has created dyad or pair weights that can be used for studies using parent-child pairs in NHIS. This method could potentially be adapted to other surveys with similar sampling design and statistical needs.

PMID:38630839

Categories
Nevin Manimala Statistics

Encoding surprise by retinal ganglion cells

PLoS Comput Biol. 2024 Apr 17;20(4):e1011965. doi: 10.1371/journal.pcbi.1011965. Online ahead of print.

ABSTRACT

The efficient coding hypothesis posits that early sensory neurons transmit maximal information about sensory stimuli, given internal constraints. A central prediction of this theory is that neurons should preferentially encode stimuli that are most surprising. Previous studies suggest this may be the case in early visual areas, where many neurons respond strongly to rare or surprising stimuli. For example, previous research showed that when presented with a rhythmic sequence of full-field flashes, many retinal ganglion cells (RGCs) respond strongly at the instance the flash sequence stops, and when another flash would be expected. This phenomenon is called the ‘omitted stimulus response’. However, it is not known whether the responses of these cells varies in a graded way depending on the level of stimulus surprise. To investigate this, we presented retinal neurons with extended sequences of stochastic flashes. With this stimulus, the surprise associated with a particular flash/silence, could be quantified analytically, and varied in a graded manner depending on the previous sequences of flashes and silences. Interestingly, we found that RGC responses could be well explained by a simple normative model, which described how they optimally combined their prior expectations and recent stimulus history, so as to encode surprise. Further, much of the diversity in RGC responses could be explained by the model, due to the different prior expectations that different neurons had about the stimulus statistics. These results suggest that even as early as the retina many cells encode surprise, relative to their own, internally generated expectations.

PMID:38630835 | DOI:10.1371/journal.pcbi.1011965

Categories
Nevin Manimala Statistics

Decoding triancestral origins, archaic introgression, and natural selection in the Japanese population by whole-genome sequencing

Sci Adv. 2024 Apr 19;10(16):eadi8419. doi: 10.1126/sciadv.adi8419. Epub 2024 Apr 17.

ABSTRACT

We generated Japanese Encyclopedia of Whole-Genome/Exome Sequencing Library (JEWEL), a high-depth whole-genome sequencing dataset comprising 3256 individuals from across Japan. Analysis of JEWEL revealed genetic characteristics of the Japanese population that were not discernible using microarray data. First, rare variant-based analysis revealed an unprecedented fine-scale genetic structure. Together with population genetics analysis, the present-day Japanese can be decomposed into three ancestral components. Second, we identified unreported loss-of-function (LoF) variants and observed that for specific genes, LoF variants appeared to be restricted to a more limited set of transcripts than would be expected by chance, with PTPRD as a notable example. Third, we identified 44 archaic segments linked to complex traits, including a Denisovan-derived segment at NKX6-1 associated with type 2 diabetes. Most of these segments are specific to East Asians. Fourth, we identified candidate genetic loci under recent natural selection. Overall, our work provided insights into genetic characteristics of the Japanese population.

PMID:38630824 | DOI:10.1126/sciadv.adi8419

Categories
Nevin Manimala Statistics

CoVar: A generalizable machine learning approach to identify the coordinated regulators driving variational gene expression

PLoS Comput Biol. 2024 Apr 17;20(4):e1012016. doi: 10.1371/journal.pcbi.1012016. Online ahead of print.

ABSTRACT

Network inference is used to model transcriptional, signaling, and metabolic interactions among genes, proteins, and metabolites that identify biological pathways influencing disease pathogenesis. Advances in machine learning (ML)-based inference models exhibit the predictive capabilities of capturing latent patterns in genomic data. Such models are emerging as an alternative to the statistical models identifying causative factors driving complex diseases. We present CoVar, an ML-based framework that builds upon the properties of existing inference models, to find the central genes driving perturbed gene expression across biological states. Unlike differentially expressed genes (DEGs) that capture changes in individual gene expression across conditions, CoVar focuses on identifying variational genes that undergo changes in their expression network interaction profiles, providing insights into changes in the regulatory dynamics, such as in disease pathogenesis. Subsequently, it finds core genes from among the nearest neighbors of these variational genes, which are central to the variational activity and influence the coordinated regulatory processes underlying the observed changes in gene expression. Through the analysis of simulated as well as yeast expression data perturbed by the deletion of the mitochondrial genome, we show that CoVar captures the intrinsic variationality and modularity in the expression data, identifying key driver genes not found through existing differential analysis methodologies.

PMID:38630807 | DOI:10.1371/journal.pcbi.1012016

Categories
Nevin Manimala Statistics

Bayesian approach to assessing population differences in genetic risk of disease with application to prostate cancer

PLoS Genet. 2024 Apr 17;20(4):e1011212. doi: 10.1371/journal.pgen.1011212. eCollection 2024 Apr.

ABSTRACT

Population differences in risk of disease are common, but the potential genetic basis for these differences is not well understood. A standard approach is to compare genetic risk across populations by testing for mean differences in polygenic scores, but existing studies that use this approach do not account for statistical noise in effect estimates (i.e., the GWAS betas) that arise due to the finite sample size of GWAS training data. Here, we show using Bayesian polygenic score methods that the level of uncertainty in estimates of genetic risk differences across populations is highly dependent on the GWAS training sample size, the polygenicity (number of causal variants), and genetic distance (FST) between the populations considered. We derive a Wald test for formally assessing the difference in genetic risk across populations, which we show to have calibrated type 1 error rates under a simplified assumption that all SNPs are independent, which we achieve in practise using linkage disequilibrium (LD) pruning. We further provide closed-form expressions for assessing the uncertainty in estimates of relative genetic risk across populations under the special case of an infinitesimal genetic architecture. We suggest that for many complex traits and diseases, particularly those with more polygenic architectures, current GWAS sample sizes are insufficient to detect moderate differences in genetic risk across populations, though more substantial differences in relative genetic risk (relative risk > 1.5) can be detected. We show that conventional approaches that do not account for sampling error from the training sample, such as using a simple t-test, have very high type 1 error rates. When applying our approach to prostate cancer, we demonstrate a higher genetic risk in African Ancestry men, with lower risk in men of European followed by East Asian ancestry.

PMID:38630784 | DOI:10.1371/journal.pgen.1011212

Categories
Nevin Manimala Statistics

COVID-19 vaccine hesitancy among adults in Liberia, April-May 2021

PLoS One. 2024 Apr 17;19(4):e0297089. doi: 10.1371/journal.pone.0297089. eCollection 2024.

ABSTRACT

BACKGROUND: Vaccination is one of the most cost-effective public health interventions used to prevent diseases in susceptible populations. Despite the established efficacy of vaccines, there are many reasons people are hesitant about vaccination, and these reasons could be complex. This rapid survey estimated the prevalence of COVID-19 vaccine hesitancy and potentially contributing factors in Montserrado and Nimba counties in Liberia.

METHODS: A cross-sectional study was conducted among adults living in Liberia. The relationship between vaccine non-acceptance and sociodemographic characteristics was examined using chi-square statistics. The variables with a p-value less than 0.2 at the bivariate analysis were modelled in a binary logistic regression at a 5% level of significance. The adjusted odds ratio and 95% confidence interval are reported.

RESULTS: There were 877 participants in the study. Majority were 25-34 years of age (30.4%, 272/877), females (54.05%, 474/877), and Christians (85.2%, 747/877). Most of the participants were aware of the COVID-19 vaccine (75%, 656/877), single (41.4%, 363/877), self-employed (37.51%, 329/877), and live-in rural communities (56.1%, 492/877). Vaccine hesitancy was (29.1%, 255/877; 95% CI:26.2-32.2). Vaccine hesitancy was greater among adults living in urban areas (41%) compared to persons living in rural communities (59%) (aOR; 1.5, 95% CI: 1.1-2.1) and respondents aged 45-54 years (aOR:0.5; 95% CI: 0.2-0.9; p = 0.043) were 50% less likely to be hesitant to COVID-19 vaccination compared to those more than 55 years. The most common source of information was the media (53%, 492/877) and the main reason for being hesitant was a need for more information about the vaccine and its safety (84%, 215/255).

CONCLUSIONS: The majority of study participants were aware of the COVID-19 vaccines and their most common source of information was the media (television, radio). Vaccine hesitancy was moderate. This could pose a challenge to efforts to control the spread of the COVID-19 pandemic. Therefore, the health authorities should provide more health education on the importance of vaccines and their safety to the populace.

PMID:38630778 | DOI:10.1371/journal.pone.0297089

Categories
Nevin Manimala Statistics

Arabic translation and cultural adaptation of a training load and player monitoring in high-level football questionnaire: A cognitive interview study

PLoS One. 2024 Apr 17;19(4):e0302006. doi: 10.1371/journal.pone.0302006. eCollection 2024.

ABSTRACT

BACKGROUND: Understanding the current practice and the associated challenges in applying monitoring tools is essential to improving football performance in the Middle East, thus the purpose was to translate and culturally adapt a published questionnaire that assessed the practice and perception of High-Level football teams toward Training Load and Player Monitoring to be used in the Arabic context, aiming to contribute to the enhancement of football performance, player welfare, and training quality in the region.

METHOD: A total of 15 Arabic-speaking coaches (mean age 42.6 ± 9.9 years; mean experience 10.9 ± 5.7 years; 53.3% football coaches and 46.7% strength & conditioning coaches) were conveniently selected to participate in this study. The current research followed a systematic cross-cultural adaptation process, which included forward translation, synthesis, back-translation, expert panel review, and pre-testing through cognitive interviewing. Three rounds of cognitive interviews were conducted with the 15 participants. Descriptive statistics, including means with standard deviations and frequencies with percentages, were reported for the participants’ characteristics.

RESULT: With some minor linguistic modifications to the questionnaire by the expert committee (i.e., adjustments such as Sport Scientist to Sport Science Specialist), the instrument was translated and culturally adapted into Arabic. All participants confirmed that the resulting Arabic versions of the training load and player monitoring in high-level football questionnaires were appropriate and fully understandable for Arabic speakers in conveying the intended meanings of the items in each.

CONCLUSION: The training load and player monitoring in the high-level football questionnaire was successfully translated and culturally adapted into Arabic and are now ready for use in the Arabic context, offering an opportunity for comprehensive research and enabling tailored performance optimization strategies, which could ultimately lead to advancements in player development and welfare within Arabic-speaking football communities.

PMID:38630762 | DOI:10.1371/journal.pone.0302006

Categories
Nevin Manimala Statistics

Vitamin B12 deficiency and neuropsychiatric symptoms in Lebanon: A cross-sectional study of vegans, vegetarians, and omnivores

PLoS One. 2024 Apr 17;19(4):e0297976. doi: 10.1371/journal.pone.0297976. eCollection 2024.

ABSTRACT

BACKGROUND: Vitamin B12 deficiency is responsible for a variety of complications, particularly neurological/neuropsychiatric complications, including depression, irritability, paresthesia and insomnia. Since vitamin B12 is found in animal-derived products, vegans/vegetarians are at a greater risk for developing vitamin B12 deficiency.

AIMS: This study aims to investigate the occurrence of vitamin B12 deficiency among a sample of adult Lebanese population, with a particular emphasis on assessing the severity of its neurological/neuropsychiatric signs and symptoms, especially among vegans/vegetarians.

METHODOLOGY: A cross-sectional study was conducted among a sample of 483 Lebanese adults. Data was collected through a standardized questionnaire that included socio-demographic characteristics, the Patient Health Questionnaire-9 (PHQ-9), Generalized anxiety disorders-7 (GAD-7), and the Insomnia Severity Index (ISI) scales.

RESULTS: Among the participants, 11.4% were in the vegan/vegetarian group, and about 43.1% had vitamin B12 deficiency. After analyzing the PHQ-9, GAD-7 and ISI total scores, higher scores were reported in participants with vitamin B12 deficiency, compared to individuals with normal vitamin B12 serum levels (p < 0.001). Regarding the diet type, vegans/vegetarians were more susceptible to developing depression compared to omnivores (mean scores of 11.92 vs 8.02 on the PHQ-9 scale, respectively, with p < 0.001). Of the patients with vitamin B12 deficiency, 81.1% reported having paresthesia compared to 43.7% of individuals with no vitamin B12 deficiency (p < 0.001).

CONCLUSION: Vitamin B12 deficiency in Lebanon is notably high and is linked to an increased risk of developing depression, generalized anxiety disorder, insomnia, and paresthesia. Vegans/vegetarians exhibit a higher susceptibility to developing depression compared to omnivores, whereas the risk of developing insomnia, generalized anxiety disorder and paresthesia was statistically insignificant when comparing vegans/vegetarians to omnivores.

PMID:38630748 | DOI:10.1371/journal.pone.0297976

Categories
Nevin Manimala Statistics

Perinatal mortality in German dairy cattle: Unveiling the importance of cow-level risk factors and their interactions using a multifaceted modelling approach

PLoS One. 2024 Apr 17;19(4):e0302004. doi: 10.1371/journal.pone.0302004. eCollection 2024.

ABSTRACT

Perinatal mortality (PM) is a common issue on dairy farms, leading to calf losses and increased farming costs. The current knowledge about PM in dairy cattle is, however, limited and previous studies lack comparability. The topic has also primarily been studied in Holstein-Friesian cows and closely related breeds, while other dairy breeds have been largely ignored. Different data collection techniques, definitions of PM, studied variables and statistical approaches further limit the comparability and interpretation of previous studies. This article aims to investigate the factors contributing to PM in two underexplored breeds, Simmental (SIM) and Brown Swiss (BS), while comparing them to German Holstein on German farms, and to employ various modelling techniques to enhance comparability to other studies, and to determine if different statistical methods yield consistent results. A total of 133,942 calving records from 131,657 cows on 721 German farms were analyzed. Amongst these, the proportion of PM (defined as stillbirth or death up to 48 hours of age) was 6.1%. Univariable and multivariable mixed-effects logistic regressions, random forest and multimodel inference via brute-force model selection approaches were used to evaluate risk factors on the individual animal level. Although the balanced random forest did not incorporate the random effect, it yielded results similar to those of the mixed-effect model. The brute-force approach surpassed the widely adopted backwards variable selection method and represented a combination of strengths: it accounted for the random effect similar to mixed-effects regression and generated a variable importance plot similar to random forest. The difficulty of calving, breed and parity of the cow were found to be the most important factors, followed by farm size and season. Additionally, four significant interactions amongst predictors were identified: breed-calving ease, breed-season, parity-season and calving ease-farm size. The combination of factors, such as secondiparous SIM breed on small farms and experiencing easy calving in summer, showed the lowest probability of PM. Conversely, primiparous GH cows on large farms with difficult calving in winter exhibited the highest probability of PM. In order to reduce PM, appropriate management of dystocia, optimal heifer management and a wider use of SIM in dairy production are possible ways forward. It is also important that future studies are conducted to identify farm-specific contributors to higher PM on large farms.

PMID:38630747 | DOI:10.1371/journal.pone.0302004

Categories
Nevin Manimala Statistics

Perceptions of risk and coping strategies during the COVID-19 pandemic among women and older adults

PLoS One. 2024 Apr 17;19(4):e0301009. doi: 10.1371/journal.pone.0301009. eCollection 2024.

ABSTRACT

The world’s health, economic, and social systems have been adversely impacted by the COVID-19 pandemic. With lockdown measures being a common response strategy in most countries, many individuals were faced with financial and mental health challenges. The current study explored the effect of the COVID-19 pandemic on the psychological well-being, perception of risk factors and coping strategies of two vulnerable groups in Malaysia, namely women and older adults from low-income households (USD592). A purposive sample of 30 women and 30 older adults was interviewed via telephone during Malaysia’s Movement Control Order (MCO) regarding the challenges they faced throughout the pandemic. Thematic analysis was subsequently conducted to identify key themes. The themes identified from the thematic analysis indicated a degree of overlap between both groups. For women, seven themes emerged: 1) Psychological challenges due to COVID-19 pandemic, 2) Family violence, 3) Finance and employment related stress and anxiety, 4) Women’s inequality and prejudice, 5) Coping strategies, 6) Professional support, and 7) Women’s empowerment. Similarly, there were six themes for the older adults: 1) Adverse emotional experiences from COVID-19, 2) Threats to health security, 3) Loss of social connections, 4) Government aid to improve older adults’ psychological well-being, 5) Psychological support from family members and pets, and 6) Self-reliance, religion, and spirituality. The findings provide valuable information on the specific burdens faced by these groups, and support psychological interventions and mitigations that would be appropriate to improve well-being during the recovery phase.

PMID:38630742 | DOI:10.1371/journal.pone.0301009