Categories
Nevin Manimala Statistics

Fetal Echocardiographic Z-Score Pilot Project: Study Design and Impact of Gestational Age and Variable Type on Reproducibility of Measurements Within and Across Investigators

J Am Soc Echocardiogr. 2023 Jun 9:S0894-7317(23)00300-0. doi: 10.1016/j.echo.2023.05.010. Online ahead of print.

ABSTRACT

BACKGROUND: Fetal echocardiography is widely available but normative data are not robust. In this pilot study, we evaluated 1) the feasibility of pre-specified measurements in a normal fetal echocardiogram to inform study design and 2) measurement variability to assign thresholds of clinical significance and guide analyses in larger fetal echocardiography Z-score initiatives.

METHODS: Images from pre-defined gestational age groups (16-20 weeks; >20-24 weeks; >24-28 weeks; and >28-32 weeks) were retrospectively analyzed. Fetal echocardiography expert raters attended online group training then independently analyzed 73 fetal studies (18 per age group) in a fully crossed design of 53 variables; each observer repeated measures for 12 fetuses. Kruskal-Wallis tests were used to compare measurements across centers and age groups. Coefficients of variation (CoV) were calculated at the subject level for each measurement as the ratio of standard deviation (SD) to mean. Intraclass correlation coefficients (ICC) were used to show inter- and intrarater reliabilities. Cohen’s d>0.8 was used to define clinically important differences. Measurements were plotted against gestational age, biparietal diameter, and femur length.

RESULTS: Expert raters completed each set of measurements in a mean (SD) of 23(9) minutes per fetus. Missingness ranged from 0% to 29%. CoV was similar across age groups for all variables (P<0.05), except ductus arteriosus mean velocity and left ventricular ejection time which were both higher at older gestational age. CoV was >15% for right ventricular systolic and diastolic widths despite fair to good repeatability (ICC>0.5); ductal velocities and 2D measures, left ventricular short axis dimensions, and isovolumic times all had high CoV and high interobserver variability despite good to excellent intraobserver agreement (ICC>0.6). CoV did not improve when ratios (e.g. tricuspid:mitral annulus) were used instead of linear measurements. Overall, 27 variables had acceptable inter- and intraobserver repeatability, while 14 had excessive variability between readers despite good intraobserver agreement.

CONCLUSIONS: There is considerable variability in fetal echocardiographic quantification in clinical practice that may affect the design of multicenter fetal echocardiography Z-score studies, and not all measurements may be feasible for standard normalization. As missingness was substantial, a prospective design will be needed. Data from this pilot study may aid in the calculation of sample sizes and inform thresholds for distinguishing clinically significant from statistically significant effects.

PMID:37302438 | DOI:10.1016/j.echo.2023.05.010

By Nevin Manimala

Portfolio Website for Nevin Manimala