Objective
The objective of the study was to determine whether an individualized growth standard (IS) improves the identification of preterm small-for-gestational-age (SGA) neonates at risk of developing moderate/severe cerebral palsy (CP) or death.
Study Design
This study was a secondary analysis of data from a randomized trial of MgSO 4 for the prevention of CP or death among anticipated preterm births. Singleton nonanomalous liveborns delivered before 34 weeks’ were classified as SGA (less than the 10th percentile for their gestational age) by a population standard (PS) or an IS (incorporating maternal age, height, weight, parity, race/ethnicity, and neonatal sex). The primary outcome was the prediction of moderate or severe CP or death by age 2 years.
Results
Of 1588 eligible newborns, 143 (9.4%) experienced CP (n = 33) or death (n = 110). Forty-four (2.8%) were SGA by the PS and 364 (22.9%) by the IS. All PS-SGA newborns also were identified as IS-SGA. SGA newborns by either standard had a similarly increased risk of CP or death (PS: relative risk [RR], 2.4, 95% confidence interval [CI], 1.3–4.3 vs IS: RR, 1.8, 95% CI, 1.3–2.5, respectively). The similarity of RRs remained after stratification by the MgSO 4 treatment group. The IS was more sensitive (36% vs 6%, P < .001) but less specific (78% vs 98%, P < .001) for CP or death. The receiver operating characteristic curve analysis revealed a statistically lower area under the curve for the PS, although the ability of either method to predict which neonates would subsequently develop CP or death was poor (PS: 0.55, 95% CI, 0.49–0.60 vs IS: 0.59, 95% CI, 0.54–0.64, P < .001).
Conclusion
An individualized SGA growth standard does not improve the association with, or prediction of, CP or death by age 2 years.
Fetal growth restriction traditionally has been defined according to liveborn growth curves based on a population standard. Some investigators have noted, however, that such a definition is suboptimal because it includes birthweights that result from both normal physiological variation and pathological states. Consequently, the ability to identify those fetuses or newborns with truly abnormal growth, who are most at risk for both short-term and long-term adverse outcomes, is compromised by the use of such a standard.
As a result, a definition of growth restriction based on an individualized standard has been proposed. An individualized standard is based on the predicted growth potential of a given individual and therefore should improve the accuracy with which truly at-risk fetuses or newborns are identified. Indeed, several studies have suggested that certain perinatal morbidities are better identified using an individualized standard as opposed to a population standard. Gardosi and Francis, for example, found that threatened preterm labor, antepartum hemorrhage, pregnancy-induced hypertension, preeclampsia, stillbirth, and early neonatal death all were better identified using the individualized standard.
The potential benefit of using individualized birthweight standards to help predict long-term neurodevelopmental adverse outcomes, however, is not known. Although it is well established that adverse neurodevelopmental outcomes such as cerebral palsy (CP) are associated with fetal growth restriction, many of those who are diagnosed with adverse neurodevelopmental outcomes based on population standards do not have evident growth abnormalities. If individualized standards could improve identification of those newborn who are pathologically growth restricted and thereby the identification of those at risk for long-term disability, counseling and the targeting of early intervention could both be improved.
Accordingly, the primary aim of this study was to determine whether an individualized growth standard, compared with a population-based standard, would improve identification of small-for-gestational-age (SGA) neonates destined to develop moderate or severe CP or death by 2 years of age.
Materials and Methods
This is a secondary analysis of data from a randomized trial (ie, the Beneficial Effects of Antenatal Magnesium Sulfate [BEAM] study) of magnesium sulfate for the prevention of moderate/severe CP or death among infants born prematurely. Full details of this study have been described previously. In brief, women judged to be at high risk for preterm birth prior to 32 weeks of gestation were randomized to either a magnesium sulfate or placebo infusion. After delivery, liveborn infants were followed up for 2 years with detailed developmental assessments performed by trained examiners.
The present analysis includes all nonanomalous singleton liveborns in the BEAM study who delivered prior to 34 weeks of gestation. Each newborn’s birthweight was used to determine whether it was SGA, defined as less than the 10th percentile, according to a population standard or an individualized standard. The population standard used was that proposed by Alexander et al, which is stratified according to neonatal sex and ethnicity.
Because the continuous values of the population standard were not available, the following approach was used to approximate these values: an Arc-Tan based transformation of birthweights (ArcTan[(weight/1000)2]*2/π) was performed, and the corresponding continuous values of the population standard were calculated by a simple approximation method based on the linear connection of the third, 10th, 50th, and 90th percentiles, provided by Alexander et al, in the different race, infant sex, and gestational age groups.
The individualized standard used was that proposed by Gardosi et al, which takes into account maternal age, height, weight, parity, race/ethnicity, and neonatal sex. The software GROW (Gestation Related Optimal Weight; www.gestation.net ) was used to calculate the individualized standard.
The association of the primary outcome, moderate/severe CP or death by 2 years of age, with the diagnosis of SGA by each type of standard was presented as the relative risk with 95% confidence intervals. The ability of each type of standard to accurately classify newborns, based on their SGA status, according to whether they developed CP or death, was assessed using sensitivity, specificity, and the area under the receiver operating characteristic curve (AUC-ROC).
Cerebral palsy was diagnosed by an annually certified pediatrician or pediatric neurologist if 2 or more of the following 3 features were present: a delay of 30% or more in gross motor developmental milestones (eg, inability to sit without arm support by 9.5 months or walk by 17 months of corrected age); abnormality in muscle tone (eg, scissoring), 4 or more or absent deep-tendon reflexes, or movement abnormality (eg, posturing or gait asymmetry); or persistence of primitive reflexes or absence of protective reflexes. When CP was diagnosed, the Gross Motor Function Classification System was used to assess severity.
In addition, the associations of the SGA diagnosis with secondary outcomes, including respiratory distress syndrome, necrotizing enterocolitis, grade 3 or 4 intraventricular hemorrhage, retinopathy of prematurity, seizures, or sepsis, were determined.
All analyses were performed with R ( http://www.r-project.org/ ) and SAS (SAS Institute Inc, Cary, NC). P < .05 was used to define statistical significance, and all tests were 2 tailed. No adjustments were made for multiple comparisons. McNemar’s test for classification agreement was used for the comparison of sensitivity or specificity. A nonparametric statistical method was used for the comparison of the AUC-ROCs. All analyses were repeated by treatment subgroups (placebo vs magnesium sulfate). Institutional review board approval was obtained prior to the initiation of the study.
Results
One thousand five hundred eighty-eight newborns met the inclusion criteria. Characteristics of the study population are presented in Table 1 . Of these, 364 (22.9%) were identified as SGA according to the individualized standard (IS) and 44 (2.8%) were identified as SGA according to the population standard (PS). Forty-four newborns met SGA criteria by both standards (ie, all PS-SGA newborns also were identified as IS-SGA).
Characteristic | n = 1588 |
---|---|
Weeks of gestation at randomization | 28.2 ± 2.4 |
Maternal age, y | 26.3 ± 6.3 |
Maternal height, cm | 162.9 ± 7.5 |
Maternal prepregnancy weight, kg | 69.5 ± 18.9 |
Maternal prepregnancy BMI, kg/m 2 | 26.2 ± 6.8 |
Maternal race/ethnicity, n (%) | |
Black | 719 (45.3) |
White | 598 (37.7) |
Hispanic | 235 (14.8) |
Other | 36 (2.3) |
Married, n (%) a | 760 (48.0) |
Nulliparous, n (%) | 567 (35.7) |
Previous preterm delivery, n (%) | 450 (28.3) |
Cigarette use during pregnancy, n (%) | 475 (29.9) |
One hundred forty-three neonates (9.4%) developed either moderate or severe CP (n = 33, 2.2 %) or death (n = 110, 7.2%) by 2 years of age. Moderate or severe CP was seen in 14.4% of the infants defined as SGA by the IS and in 21.4% of the infants defined as SGA by the PS. Neonates defined as SGA by either the PS or by the IS were significantly more likely to develop moderate/severe CP or death, although the magnitudes of the associations were similar for both types of standards ( Table 2 ).
Variable | Individualized standard | Population standard | P value a |
---|---|---|---|
Relative risk (95% CI) b | 1.8 (1.3–2.5) | 2.4 (1.3–4.3) | — |
Sensitivity | 36% | 6% | < .001 |
Specificity | 78% | 98% | < .001 |
Area under the curve | 0.59 (0.54–0.64) | 0.55 (0.49–0.60) | < .001 |
a P value based on McNemar’s test for classification agreement (sensitivity/specificity) or nonparametric comparison of areas under the receiver operating characteristic curves
The test characteristics of the different growth standards for the prediction of the primary outcome also are presented in Table 2 . Although the individualized standard had a significantly greater sensitivity for the primary outcome, its specificity was significantly lower. Although the area under the curve (AUC) for the prediction of the primary outcome was greater for the individualized standard, this difference was small and unlikely to have clinical relevance. Moreover, the AUC for either the individualized or population standard demonstrates that either standard is poorly predictive of CP or death.
The results were similar for prediction of the various secondary outcomes. Although the IS appeared to be better at predicting these outcomes, the difference was small in magnitude. In addition, the ability of either growth standard to predict any secondary outcome was repeatedly poor ( Table 3 ). The AUC values indicate that prediction of these outcomes was at best barely better than chance.