Categories
Nevin Manimala Statistics

Automating population dose survey processing-An Australian feasibility study

Med Phys. 2024 Jul 9. doi: 10.1002/mp.17289. Online ahead of print.

ABSTRACT

BACKGROUND: A comprehensive collection of data on doses in adult computed tomography procedures in Australia has not been undertaken for some time. This is largely due to the effort involved in collecting the data required for calculating the population dose. This data collection effort can be greatly reduced, and the coverage increased, if the process can be automated without major changes to the workflow of the imaging facilities providing the data. Success would provide a tool to determine a truly national assessment of the dose incurred through diagnostic imaging in Australia.

PURPOSE: The aims of this study were to develop an automated tool to categorize electronic records of imaging procedures into a standardized set of broad procedure types, to validate the tool by applying it to data collected from nine facilities, and to assess the feasibility of applying the automated tool to compute population dose and determine the data manipulations required.

METHODS: A rule-based classifier was implemented capitalizing on semantic and clinical rules. The keyword list was initially built from 609 unique study descriptions. It was then refined using an additional 414 unique study descriptions. The classifier was then tested on an additional 1198 unique study descriptions. Input from a radiologist provided the ground truth for the refinement of the classifier.

RESULTS: From a sample of 238 139 studies containing 2794 unique study descriptions, the classifier correctly classified 2789 study types with only five misclassifications, demonstrating the feasibility of automating the process and the need for data pre-processing. Dose statistics for 21 categories were compiled using the 238 139 studies.

CONCLUSION: The classifier achieved excellent classification results using the testing data supplied by the facilities. However, since all data supplied were from public facilities, the performance of the classifier may be biased. The performance of the classifier is yet to be tested on a more representative mix of private and public facilities.

PMID:38981056 | DOI:10.1002/mp.17289

By Nevin Manimala

Portfolio Website for Nevin Manimala