Categories
Nevin Manimala Statistics

Machine learning and advanced statistical modeling can identify key quality management practices that affect post-pasteurization contamination of fluid milk

J Food Prot. 2021 Mar 26. doi: 10.4315/JFP-20-431. Online ahead of print.

ABSTRACT

Spoilage of HTST- (high-temperature, short-time) and vat- pasteurized fluid milk due to introduction of Gram-negative bacteria post-pasteurization remains a challenge for the dairy industry. While processing facility level practices (e.g., sanitation practices) are known to impact the frequency of post-pasteurization contamination (PPC), the relative importance of different practices is not well defined, affecting the ability of facilities to select intervention targets that reduce PPC and provide the greatest return on investment. Thus, the goal of this study was to use an existing longitudinal dataset of bacterial spoilage indicators obtained for pasteurized fluid milk samples collected from 23 processing facilities between July 2015 and November 2017 (with 3 to 5 samplings per facility) and data from a survey on fluid milk quality management practices, to identify factors associated with PPC and rank their relative importance, using two separate approaches: (i) multimodel inference and (ii) conditional random forest. Data pre-processing for multimodel inference analysis showed (i) nearly all factors were significantly associated with PPC when assessed individually using univariable logistic regression and (ii) numerous pairs of factors were strongly associated with each other (Cramer’s V ³0.80). Multimodel inference and conditional random forest analyses identified similar drivers associated with PPC; factors identified as most important based on these analyses included cleaning and sanitation practices, activities related to good manufacturing practices, container type (which is a proxy for different filling equipment), in-house finished product testing, and designation of a quality department, indicating potential targets for reducing PPC. In addition, this study illustrates how machine learning approaches can be used with highly correlated and unbalanced data, as typical for food safety and quality, to facilitate improved data analyses and decision-making.

PMID:33770185 | DOI:10.4315/JFP-20-431

By Nevin Manimala

Portfolio Website for Nevin Manimala