Categories
Nevin Manimala Statistics

imputomics: web server and R package for missing values imputation in metabolomics data

Bioinformatics. 2024 Feb 20:btae098. doi: 10.1093/bioinformatics/btae098. Online ahead of print.

ABSTRACT

MOTIVATION: Missing values are commonly observed in metabolomics data from mass spectrometry (MS). Imputing them is crucial because it assures data completeness, increases the statistical power of analyses, prevents inaccurate results, and improves the quality of exploratory analysis, statistical modeling, and machine learning. Numerous Missing Value Imputation Algorithms (MVIAs) employ heuristics or statistical models to replace missing information with estimates. In the context of metabolomics data, we identified 52 MVIAs implemented across 70 R functions. Nevertheless, the usage of those 52 established methods poses challenges due to package dependency issues, lack of documentation and their instability.

RESULTS: Our R package, imputomics, provides a convenient wrapper around 41 (plus random imputation as a baseline model) out of 52 MVIAs in the form of a command-line tool and a web application. In addition, we propose a novel functionality for selecting MVIAs recommended for metabolomics data with the best performance or execution time.

AVAILABILITY: imputomics is freely available as an R package (github.com/BioGenies/imputomics) and a Shiny web application (biogenies.info/imputomics-ws). The documentation is available at biogenies.info/imputomics.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

PMID:38377398 | DOI:10.1093/bioinformatics/btae098

By Nevin Manimala

Portfolio Website for Nevin Manimala