Categories
Nevin Manimala Statistics

Robust statistical methods for high-dimensional data, with applications in tribology

Anal Chim Acta. 2023 Oct 23;1279:341762. doi: 10.1016/j.aca.2023.341762. Epub 2023 Sep 5.

ABSTRACT

Data sets derived from practical experiments often pose challenges for (robust) statistical methods. In high-dimensional data sets, more variables than observations are recorded and often, there are also data present that do not follow the structure of the data majority. In order to handle such data with outlying observations, a variety of robust regression and classification methods have been developed for low-dimensional data. The high-dimensional case, however, is more challenging, and the variety of robust methods is much more limited. The choice of the method depends on the specific data structure, and numerical problems are more likely to occur. We give an overview of selected robust methods as well as implementations and demonstrate the application with two high-dimensional data sets from tribology. We show that robust statistical methods combined with appropriate pre-processing and sampling strategies yield increased prediction performance and insight into data differing from the majority.

PMID:37827663 | DOI:10.1016/j.aca.2023.341762

By Nevin Manimala

Portfolio Website for Nevin Manimala