The Contribution of Low Apgar Scores in Identifying Neonates with Short-term Morbidities in a Large Single Center Cohort

doi:10.21203/rs.3.rs-3334649/v1

Download PDF

Article

The Contribution of Low Apgar Scores in Identifying Neonates with Short-term Morbidities in a Large Single Center Cohort

https://doi.org/10.21203/rs.3.rs-3334649/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 28 Mar, 2024

Read the published version in Journal of Perinatology →

You are reading this latest preprint version

Objective

To evaluate the association and utility of low 1- and 5-minute Apgar scores to identify short-term morbidities in a large newborn cohort.

Methods

15,542 infants > 22 weeks gestation from a single center were included. Clinical data and low Apgar scores were analyzed for significance to 10 short-term outcomes and were used to construct Receiver Operating Characteristic Curves and the area under the curve (AUC) calculated for 10 outcomes.

Results

A low Apgar score related to all (1-minute) or most (5-minute) outcomes by univariate and multivariate logistic regression analysis. Including any of the 4 low Apgar scores only improved the clinical factor AUC by 0.9% ± 2.7% (± SD) and was significant in just 5 of the 40 score/outcome scenarios.

Conclusion

The contribution of a low Apgar score for identifying risk of short-term morbidity does not appear to be clinically significant.

Health sciences/Risk factors

Health sciences/Medical research/Translational research

The Apgar scoring system was developed by Dr. Virginia Apgar over seventy years ago as a tool to assess the condition of a newborn at birth based on five variables: heart rate, respiratory effort, muscle tone, reflex irritability, and color. Its original purpose was to allow for immediate observation and prompt identification of newborns who need resuscitative measures during transition to extrauterine life.¹ However, over the ensuing seven decades, despite the advancement made in evidence-based newborn resuscitation and the advent of Neonatal Resuscitation Program (NRP) that requires evaluation of the newborn infant without any role for the Apgar score², its international recognition and universal use continues.

Over those years, the use of the Apgar scoring tool has gone far beyond its original purpose to guide clinical management decisions and establish a correlation to long term infant health outcomes.^3–7 In a review of 501 papers published in 2018-19, the Apgar score was used as a prognostic factor for outcomes in 19%, more than half of these focused on short term morbidities⁸. Numerous studies have examined the association between low Apgar score and a variety of short-term neonatal morbidities^9–11 but the significance and value of a low Apgar score in identifying newborn infants likely to manifest these morbidities has not been systematically examined.

We conducted a retrospective study using data from the medical records of infants born at ≥ 23 weeks gestational age at a large regional academic medical center between 9/1/2013 and 3/30/2020. The Labor and Delivery service maintained a database of live births, including medical record number, mode of delivery, and Apgar scores. The electronic medical record (EMR) for the mothers and newborns of each delivery were then queried for demographic information and all discharge diagnoses. Information contained in both the database and the EMR (birthdate, medical record numbers) was used to confirm matching of each data set.

Ten short term outcomes, defined as occurring during the initial hospital stay, were selected. Each was counted using any of the ICD9 (before 2016) and ICD10 codes that can be applied. The specific codes for each are listed in Table 1. The selected common morbidities were as follows: Bronchopulmonary dysplasia (BPD); necrotizing enterocolitis (NEC); Intraventricular hemorrhage (IVH); retinopathy of prematurity (ROP); Hypoxic ischemic encephalopathy (HIE); respiratory distress syndrome (RDS); transient tachypnea of the newborn (TTNB); newborn sepsis, hypoglycemia (HypoG); and meconium aspiration syndrome (MAS).

Eight predictor variables that would be available during the initial hospitalization and that have been previously associated with one or more of the short-term outcomes were recorded for each mother-infant dyad. These included gestational age, birthweight, gender, race, mode of delivery, being small for gestational age (SGA) and 1- and 5-minute Apgar scores.

The study was approved as exempt by the Virginia Commonwealth University IRB.

Analysis

Because of the well-documented variability in Apgar scoring across countries¹², individuals¹³ and by newborn conditions¹⁴, we chose to use cut-off scores at both one and five minutes. The most common definition of significance in the 2018-19 review was a total score of less than 7 (Apgar One_06 and Apgar Five_06) ⁸. A score of less than 4 has been commonly used to identify possible asphyxia (Apgar One_03 and Apgar Five_03)¹⁵. We did not use any scores beyond five minutes because it is generally only done when the previous scores are low, which creates a likely selection bias for this data.

The sensitivity, specificity, negative (NPV) and positive (PPV) predictive value as well as odds ratio for the four low Apgar scores were calculated for each short-term morbidity. Multivariable logistic analysis was performed for each morbidity using each of the low Apgar scores and gestational age by week, gender, race, mode of delivery, and whether they were small for gestational age. Odds ratios, 95% confidence intervals, and p-values were calculated for all models. Receiver operator characteristic (ROC) curves for the multivariable models were calculated with and without each low Apgar score and the differences in the area under the curve (AUC) when each was included or omitted was calculated. For comparison, a similar analysis was done using the Apgar score first and then adding in the clinical factors. Statistical significance between these AUCs was calculated DeLong’s test¹⁶.

Figure 1 is a consort diagram of the study 17,135 mothers in the labor and delivery birth delivery database from September 1, 2013 to March 30, 2020. Of these 16,703 had complete EMR data for the newborns. There were 444 with missing discharge diagnosis codes, had major congenital anomalies that might impact the outcomes, were < 23 weeks gestation or died in the delivery room. The final cohort consisted of 15,542 (90.7%) infants. The median length of stay was 2 days (IQR 2–3 days).

Table 2 presents the incidence of each short-term outcome for the study cohort, ranging from 7.3% for MAS to 0.4% for NEC overall and the incidence of each potential risk factor for each outcome. By univariate analysis of clinical factors, gender was a significant risk for TTNB, RDS and HypoG, race for all outcomes except HIE, gestational age for all outcomes, mode of delivery for all outcomes except MAS, and SGA status significantly related to TTNB, HypoG and IVH. Focusing on the Apgar score, as noted in Table 3, the 4 different low Apgar scores were significantly associated with the ten outcomes in 38 of the 40 scenarios. The only exceptions were TTNB and HypoG for Apgar Five_03

To examine whether a low Apgar score remained a significant risk factor when other clinical risk factors were accounted for, multivariable logistic models were created with each of the four low Apgar scores for each outcome. Table 4 shows that in this analysis, there were 11 scenarios where a low score was not significant. For HypoG, only Apgar Five_03 remained significant, and for NEC, none of the Apgar scores remained significant. In addition, the Apgar One_06 was not significant for BPD or ROP, the Apgar Five_03 score was not significant for IVH and the Apgar Five_06 was not significant for TTNB.

To further define how much a low Apgar score contributes to the risk identification of newborn infants for the ten outcome diagnoses, ROC curves were created for each diagnosis using multivariate equations incorporating gestational age, gender, race, mode of delivery, and SGA status with and without each of the 4 low Apgar scores, and the significance of the difference in the AUC when the low Apgar was present or absent determined. Birthweight was not included as birthweight and SGA status were, in combination, stronger contributors to the final model. We also examined the AUC for the ROC curves created using the Apgar score first and then adding the clinical factors. In Tables 5 and 6, for each Apgar score, the upper rows are for the condition where the score is added to the ROC constructed from the clinical factors, while the second group of rows shows what happens to the AUC when the clinical factors are added to the model constructed first with the Apgar score. When the Apgar One score was added (Table 5), the AUC increased significantly for HIE at both the < 3 and < 6 levels and for RDS and MAS for a score < 6. There was no effect for the other 17 outcomes. The average change in AUC was 3.92% (CI 0.60 to 7.25). In contrast, adding the clinical factors to the Apgar score curve increased the AUC significantly for all the outcomes and far more substantially (average 27.64% CI 22.81 to 32.47). The results for the Apgar Five (Table 6) are similar. Adding Apgar Five scores to clinical factors changes the AUC of the ROCs by 1.836% (CI -0.3593 to 4.031) while adding clinical factors to Apgar Five increased the AUCs by 45.01% (CI 36.93 to 53.09). Figure 2 illustrates the difference in the ROC curves when the addition of a low Apgar is significant (HIE) and when it is not (sepsis).

The primary and predominant purpose of the Apgar score has been to assess the status of an infant in the first few minutes of life.^1,7 The rationale for such a scoring system is based on the understanding that having difficulties in the transition to extrauterine life is not good for the newborn, i.e. that such difficulties are associated with worse outcomes so identifying these babies could lead to interventions that could mitigate these outcomes. This is supported by the observation that the adoption of the Apgar score did not become widespread, and then universal, until there was evidence that low scores occurred far more frequently in babies who either died or had neurological deficits in the first year of life^{17, 18}.

Over the decades, the score has consistently been used as a risk factor in clinical studies.⁷ It has been associated not only with an increased incidence of long-term neurological conditions, including cerebral palsy and seizures,¹⁹, but also with a wide variety of conditions such as attention deficit disorder/hyperactivity²⁰, permanent dentition²¹, cancer²², food allergy²³, autism spectrum disorder²⁴, polycystic kidney disease²⁵ and amblyopia²⁶. The Apgar score is used as often for research into morbidities that manifest in the post-natal period, including all the discharge diagnoses used as short-term outcomes in this current study.^10,27–35 Short term outcomes have also been used for all studies that have examined modifications or replacements for the Apgar, and any future such efforts are likely to do the same.^36–38 It is noteworthy that the NRP does not use the one- or five-minute Apgar score. Rather, to identify which newborn infant might qualify for closer post-natal observation, it relies on one criterion, the need for respiratory support, but this could miss babies at risk for several of the diagnoses in the current study.²

This study used a range of morbidities occurring during the initial hospital stay to determine, first if a low Apgar score is more frequent in those babies who were given these diagnoses compared to those without the conditions and confirmed previous associations for the risk factors and the various short-term morbidities.

Our study also found that a low Apgar score at one or five minutes was found significantly more often for all but two of the outcome/Apgar analyses and the negative predictive value was generally strong across all outcomes (Table 3). When taking into account other risk factors (Table 4), at least one of the four low Apgar scores were statistically more frequently found in all but one of the ten outcomes. For NEC, when the other risk factors were added in, Apgar scores of ≤ 3 or ≤ 6 at one or five minutes was no longer significant.

The AUC value of the ROC is often used to assess the clinical value of a predictive model³⁹ with higher values above 0.5 indicating a better model. We have used this to further analyze how much the presence of a low Apgar score contributes to identifying newborn infants who will go on to have one of the short-term outcomes included in this study. This confirmed that low Apgar scores can make a major and significant contribution in predicting HIE, which is not surprising since low scores are often part of the diagnosis⁴ and supports the validity of this analytic method. Overall, the inclusion of a low Apgar score added little to the predictive model. It was only statistically significant for the Apgar One_06 for RDS and MAS. Otherwise, it improved the AUC by less than 3.5%, and in many cases by less than 1%. In contrast, the addition of clinical factors to ROCs constructed by Apgars scores alone increased by 14–86%, indicating that the Apgar score does not contribute as much to identifying newborns at risk for short term morbidities as clinical factors.

There are several significant limitations to this study that should be addressed. The study used retrospective data from a single center. The ten outcomes had a wide range of incidences, which can have an effect on predictive values, for example, and several are associated primarily with prematurity, such as RDS, IVH, and NEC, but previous studies have included Apgar scores in risk assessments for these conditions.^28,32,34 The accuracy of discharge diagnosis codes has been questioned.⁴⁰ As one example, we found several instances where codes for both TTNB and RDS were assigned to the same subject. Our goal was to ensure that all potential cases were captured, so we used a wide range of codes. As a result, for some short-term outcomes, such as MAS, there was a high incidence. We do note that the codes are commonly used in retrospective neonatal studies, and they were used consistently within this single center. Other risk factors such as maternal age, race, or maternal chorioamnionitis were not included. We chose to use the two most common⁸ cut-off values at one and five minutes rather than all the Apgar scores from 1 to 10 to account for some of the known variability in scoring and capture a sufficient number of subjects per outcome to analyze. Other investigators have used the complete Apgar scale, usually.in long term outcome studies involving over one million.¹⁰ Finally, Dr. Apgar designed her system to assess the status of the infant immediately after birth. Starting in 1966,¹⁷ however, it has been used as a risk factor hundreds of times.

Strengths include a larger number of subjects than most studies which have examined the Apgar score in relation to short term outcomes. While we looked at the common ways of assessing a risk factor, such as sensitivity, specificity, positive and negative predictive value, and the odds ratio within the context of a multivariable analysis as well as the AUC of the ROC graphs, adding the AUC analysis with and without the low Apgar is a way to directly answer the question of its utility.

The Apgar score has been assessed around the world to an estimated three billion or more newborn infants over the last seventy years. During that time, concerns have been repeatedly raised about it. Yet it remains an important tool in the delivery room for assessing the immediate condition of the newborn. It appears to have good utility for assessing risks of long-term outcomes when applied to large populations, but our findings suggest that it is not a significant contributor to identifying newborn infants who would benefit from a higher level of care because of the risk of short-term outcomes.

ADDITIONAL INFORMATION:

The authors have no competing interests for any aspect of this study. The work was done in accordance with the Declaration of Helsinki.

Funding: No funding was received in support of this work.

Authors contributions: Drs. Yitayew and Rozycki did the primary project design. All authors reviewed the raw data. Dr. Huang performed all the statistical analysis. All author contributed to writing and revising the manuscript.

Apgar V. A proposal for a new method of evaluation of the newborn infant. Curr Res Anesth Analg 1953; 32: 260-7.
Wyckoff MH, Wyllie J, Aziz K, de Almeida MF, Fabres J, Fawke J, et al. Neonatal Life Support Collaborators. Neonatal Life Support: 2020 International Consensus on Cardiopulmonary Resuscitation and Emergency Cardiovascular Care Science With Treatment Recommendations. Circulation. 2020 142: S185- S221.
Behnke M, Eyler FD, Carter RL, Hardt NS, Cruz AC, Resnick MB. Predictive value of Apgar scores for developmental outcome in premature infants. Am J Perinatol. 1989; 6: 18-21.
Mosalli R. Whole body cooling for infants with hypoxic-ischemic encephalopathy. J Clin Neonatol. 2012; 1:101-6.
Ehrenstein V, Pedersen L, Grijota M, Nielsen GL, Rothman KJ, Sørensen HT. Association of Apgar score at five minutes with long-term neurologic disability and cognitive function in a prevalence study of Danish conscripts. BMC Pregnancy Childbirth. 2009; 9: 14
Razaz N, Cnattingius S, Persson M, Tedroff K, Lisonkova S, Joseph KS. One-minute and five-minute Apgar scores and child developmental health at 5 years of age: a population-based cohort study in British Columbia, Canada. BMJ Open. 2019; 9: e027655.
Razaz N, Norman M, Alfvén T, Cnattingius S. Low Apgar score and asphyxia complications at birth and risk of longer-term cardiovascular disease: a nationwide population-based study of term infants. Lancet Reg Health Eur. 2022; 24: 100532.
Rozycki HJ, Yitayew M. The Apgar score in clinical research: for what, how and by whom it is used. J Perinat Med. 2022; 51: 580-585.
Roy B, Webb A, Walker K, Morgan C, Badawi N, Novak I. Risk factors for perinatal stroke in term infants: A case-control study in Australia. J Paediatr Child Health. 2023; 59: 673-679.
Razaz N, Cnattingius S, Joseph KS. Association between Apgar scores of 7 to 9 and neonatal mortality and morbidity: population based cohort study of term infants in Sweden. BMJ. 2019; 365: l1656.
Arpino C, Domizio S, Carrieri MP, Brescianini DS, Sabatino MG, Curatolo P. Prenatal and perinatal determinants of neonatal seizures occurring in the first week of life. J Child Neurol. 2001; 16: 651-6.
Siddiqui A, Cuttini M, Wood R, et al. Can the Apgar Score be Used for International Comparisons of Newborn Health?. Paediatr Perinat Epidemiol. 2017; 31: 338-345.
O'Donnell CP, Kamlin CO, Davis PG, Carlin JB, Morley CJ. Interobserver variability of the 5-minute Apgar score. J Pediatr 2006; 149: 486-489.
Lopriore E, van Burk GF, Walther FJ, de Beaufort AJ. Correct use of the Apgar score for resuscitated and intubated newborn babies: questionnaire study. BMJ. 2004; 329: 143-144.
Casey BM, McIntire DD, Leveno KJ. The continuing value of the Apgar score for the assessment of newborn infants. N Engl J Med. 2001; 344: 467-71.
DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 1988; 44: 837–45.
Drage JS, Kennedy C, Schwarz BK. The Apgar Score as an index of neonatal mortality: A report from the Collaborative study of cerebral palsy. Obstet Gynecol. 1964; 24: 222-30.
Drage JS, Kennedy C, Berendes H, Schwarz BK, Weiss W. The Apgar score as an index of infant morbidity. A report from the collaborative study of cerebral palsy. Dev Med Child Neurol. 1966; 8: 141-8
Persson M, Razaz N, Tedroff K, Joseph KS, Cnattingius S. Five and 10 minute Apgar scores and risks of cerebral palsy and epilepsy: population based cohort study in Sweden. BMJ. 2018; 360: k207.
Jenabi E, Ayubi E, Farashi S, Bashirian S, Mehri F. The neonatal risk factors associated with attention-deficit/ hyperactivity disorder: an umbrella review. Clin Exp Pediatr. 2023 Jul 14. doi: 10.3345/cep.2022.01396
da Silva Júnior IF, Costa FDS, Correa MB, de Barros FCLF, Santos IDSD, Matijasevich A, Demarco FF, Azevedo MS. Pre-, Peri-, and Postnatal Risk for the Development of Enamel Defects in Permanent Dentition: A Birth Cohort in Southern Brazil. Pediatr Dent. 2023; 45: 328-335
Kampitsi CE, Nordgren A, Mogensen H, Pontén E, Feychting M, Tettamanti G. Neurocutaneous Syndromes, Perinatal Factors, and the Risk of Childhood Cancer in Sweden. JAMA Netw Open. 2023; 6: e2325482.
Mitselou N, Hallberg J, Stephansson O, Almqvist C, Melén E, Ludvigsson JF. Cesarean delivery, preterm birth, and risk of food allergy: Nationwide Swedish cohort study of more than 1 million children. J Allergy Clin Immunol. 2018; 142: 1510-1514.e2.
Modabbernia A, Sandin S, Gross R, Leonard H, Gissler M, Parner ET, et al. Apgar score and risk of autism. Eur J Epidemiol. 2019; 34: 105-114.
Burgmaier K, Kunzmann K, Ariceta G, Bergmann C, Buescher AK, Burgmaier M, et al. Risk Factors for Early Dialysis Dependency in Autosomal Recessive Polycystic Kidney Disease. J Pediatr. 2018; 199: 22-28.e6.
Mocanu V, Horhat R. Prevalence and Risk Factors of Amblyopia among Refractive Errors in an Eastern European Population. Medicina (Kaunas). 2018; 54: 6.
Moftian N, Samad Soltani T, Mirnia K, Esfandiari A, Tabib MS, Rezaei Hachesu P. Clinical Risk Factors for Early-Onset Sepsis in Neonates: An International Delphi Study. Iran J Med Sci. 2023; 48: 57-69.
Takaya A, Igarashi M, Nakajima M, Miyake H, Shima Y, Suzuki S. Risk factors for transient tachypnea of the newborn in infants delivered vaginally at 37 weeks or later. J Nippon Med Sch. 2008; 75: 269-73.
Altman M, Vanpée M, Cnattingius S, Norman M. Risk factors for acute respiratory morbidity in moderately preterm infants. Paediatr Perinat Epidemiol. 2013; 27: 172-81.
Oliveira CPL, Flôr-de-Lima F, Rocha GMD, Machado AP, Guimarães Pereira Areias MHF. Meconium aspiration syndrome: risk factors and predictors of severity. J Matern Fetal Neonatal Med. 2019; 32: 1492-1498.
Zhang J, Mu K, Wei L, Fan C, Zhang R, Wang L. A prediction nomogram for moderate-to-severe bronchopulmonary dysplasia in preterm infants < 32 weeks of gestation: A multicenter retrospective study. Front Pediatr. 2023; 11: 1102878
Schifrin BS, Ater S. Fetal hypoxic and ischemic injuries. Curr Opin Obstet Gynecol. 2006; 18: 112-22.
Kordasz M, Racine M, Szavay P, Lehner M, Krebs T, Luckert C, et al. Risk factors for mortality in preterm infants with necrotizing enterocolitis: a retrospective multicenter analysis. Eur J Pediatr. 2022; 181: 933-939.
Ying GS, Bell EF, Donohue P, Tomlinson LA, Binenbaum G; G-ROP Research Group. Perinatal Risk Factors for the Retinopathy of Prematurity in Postnatal Growth and Rop Study. Ophthalmic Epidemiol. 2019; 26: 270-278.
Szpecht D, Szymankiewicz M, Nowak I, Gadzinowski J. Intraventricular hemorrhage in neonates born before 32 weeks of gestation-retrospective analysis of risk factors. Childs Nerv Syst. 2016; 32: 1399-404.
Rüdiger M, Braun N, Aranda J, Aguar M, Bergert R, Bystricka A. et al. Neonatal assessment in the delivery room: Trial to Evaluate a Specified Type of Apgar (TEST-Apgar). BMC Pediatr. 2015; 15: 18
Dalili H, Sheikh M, Hardani AK, Nili F, Shariat M, Nayeri F. Comparison of the combined versus conventional Apgar scores in predicting adverse neonatal outcomes. PLoS ONE. 2016; 11: e0149464.
Witcher TJ, Jurdi S, Kumar V, Gupta A, Moores RR Jr, Khoury J, Rozycki HJ. Neonatal Resuscitation and Adaptation Score vs Apgar: newborn assessment and predictive ability. J Perinatol. 2018; 38: 1476-1482.
Schlattmann P. Statistics in diagnostic medicine. Clin Chem Lab Med. 2022; 60: 801-807.
O'Malley KJ, Cook KF, Price MD, Wildes KR, Hurdle JF, Ashton CM. Measuring diagnoses: ICD code accuracy. Health Serv Res. 2005; 40: 1620-39.

Tables 1 to 6 are available in the Supplementary Files section.

There is NO conflict of interest to disclose.

Table1.jpg
Table 1
Table2.jpg
Table 2
Table3.jpg
Table 3
Table4.jpg
Table 4
Table5.jpg
Table 5
Table6.jpg
Table 6

Download PDF

Journal Publication

published 28 Mar, 2024

Read the published version in Journal of Perinatology →

Editorial decision: revise
17 Nov, 2023
Review #2 received at journal
08 Nov, 2023
Reviewer #2 agreed at journal
25 Oct, 2023
Review #1 received at journal
12 Oct, 2023
Reviewer #1 agreed at journal
12 Oct, 2023
Reviewers invited by journal
21 Sep, 2023
Submission checks completed at journal
11 Sep, 2023
First submitted to journal
07 Sep, 2023
Editor assigned by journal
07 Sep, 2023

You are reading this latest preprint version

The Contribution of Low Apgar Scores in Identifying Neonates with Short-term Morbidities in a Large Single Center Cohort

Status:

Journal Publication

Version 1

Abstract

Objective

Methods

Results

Conclusion

Figures

INTRODUCTION

METHODS

Analysis

RESULTS

DISCUSSION

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1