Data sources
ALSPAC is a geographically defined longitudinal birth cohort in the old administrative county of Avon in Southwest England. Pregnant women resident in Avon, UK with expected dates of delivery 1st April 1991 to 31st December 1992 were invited to take part in the study. The initial number of pregnancies enrolled was 14541. Of these initial pregnancies, there was a total of 14676 foetuses, resulting in 14062 live births and 13988 children who were alive at 1 year of age.[,]
When the oldest children were approximately 7 years of age, an attempt was made to bolster the initial sample with eligible cases who had failed to join the study originally. As a result, when considering variables collected from the age of 7 onwards (and potentially abstracted from obstetric notes) there are data available for more than the 14541 pregnancies mentioned above. The number of new pregnancies not in the initial sample (known as Phase I enrolment) that are currently represented on the built files and reflecting enrolment status at the age of 24 is 913 (456, 262 and 195 recruited during Phases II, III and IV respectively), resulting in an additional 913 children being enrolled. The phases of enrolment are described in more detail in the cohort profile paper and its update.[17] The total sample size for analyses using any data collected after the age of seven is therefore 15454 pregnancies, resulting in 15589 foetuses. Of these 14901 were alive at 1 year of age.
ALSPAC is a dataset which has frequent follow ups, and includes 34 child-completed questionnaires and 25 mother- or caregiver-completed questionnaires.[17] The ALSPAC study website contains details of all data available through a fully searchable data dictionary and variable search tool.[] Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee and the Local Research Ethics Committees. Informed consent for the use of data collected via questionnaires and clinics was obtained from participants following the recommendations of the ALSPAC Ethics and Law Committee at the time.
Data analysis
Descriptive analyses using linear, logistic and multinomial regression were conducted to assess relationships between maternal social class (exposure at baseline), total daily calories (mediator at age 7 years), BMI (outcome at age 11 years), and confounders. Total daily calories were not normally distributed; therefore, median daily calories were reported and used to inform simulations (Table 2). Prevalence of obesity at age 11 years by maternal social class was reported with relative and absolute inequalities (Table 3). Stata SE 15.1 was used to perform all analyses.[]
Logistic regression was used to model the association between maternal social class and BMI at 11 years within a marginal structural modelling (MSM) framework.[] Inverse probability weights (IPWs) were constructed between the exposure with baseline confounding.[34] Weights were truncated between 1% and 99% to deal with the potential influence of outliers.[16] Predicted probabilities obtained from the regression models were used to estimate the prevalence of obesity overall and by maternal occupational social class. This provided the total direct effect (TDE) of maternal social class on obesity.[16]
Relative and absolute inequalities were estimated by repeating regression models using maternal occupational social class as a continuous term. Relative inequalities (risk ratios) were given by the ratio of fitted probabilities of overweight and obesity between the highest and lowest maternal occupational social groups, while absolute inequalities (risk differences) were given by the difference between the fitted probabilities between the highest and lowest maternal occupational social groups.
Total daily calories was then included in the model as a continuous variable with IPWs for baseline and intermediate confounding. This provided the control direct effect (CDE) from the adjusted model i.e. the estimated effect of maternal social class on obesity when total daily calories was fixed at observed levels.[16,] CDE is the model against which simulated scenarios were compared.
To simulate intervention scenarios, the mediator was adjusted to reflect reductions in calorie intake. Predicted probabilities of obesity were re-estimated for each scenario and compared to the original CDE model.
Effectiveness for intervention scenarios was modelled by simulating various reductions to calorie intake. For each reduction in calories, a normal distribution was generated around the adjusted level in order that likely variability was reflected, meaning reduction of intake varied between individual children receiving the intervention.
Interventions were either universal (for all children; scenario 1), targeted (based on elevated family or individual risk of future obesity; scenarios 2 and 4) or indicated past on past weight status (scenario 3). Family based targeting was income-based, where children of low income families (less than 60% the UK median[,]) received an intervention or individual-based, where children with reportedly high consumption received and intervention.
An indicated intervention (scenario 3) was simulated for children living with obesity at a prior age. Obesity at age 7 was defined using z-scores BMI using the UK90 reference data[22] and cut-offs for epidemiological application.[23]
In each scenario, only eligible children received an intervention. In the universal scenario, all children were eligible for an intervention but in scenarios 2–4 eligibility was determined according to targeted and indicated criteria.
In terms of uptake of eligible children, for simulations 1 and 2, 75% of children were randomly assigned to the intervention group given that compliance with an intervention would be less than 100%; however, there was no evidence of relevant population interventions to guide a realistic uptake level. When targeting on individual risk (scenario 4) and with the indicated intervention (scenario 3), all children with high consumption or living with obesity at an earlier age were eligible for an intervention.
Simulations (Table 1)
Each scenario represents a potential population policy action or intervention and follows a structure with a level of effectiveness (extent of calorie reduction), targeting or indicating based on income or risk of future obesity, and a level of uptake of eligible children.
Scenario 1 modelled the impact of a universal intervention that reduced population intake of calories down for children aged 7 years; uptake of the intervention was set at 75%. The population distribution of intake was shifted down by reducing median calorie intake to the EAR. Median intake in calories (taken from food diaries) was 1732.4 for boys and 1654.1 for girls, while the EAR is estimated to be 1649 for boys and 1530 for girls. Therefore, to bring median intake in line with EAR, boys would need to consume 83.4 less calories (4.8%) and girls 124.1 (7.5%), which equated to a 6.1% overall decrease overall. Sex-specific reductions were applied with random variation.
Scenario 2 modelled a more intensive intervention and was targeted to children of low-income families; uptake among eligible children was set at 75%. The intensive intervention was designed to represent a healthy weight loss intervention and was informed by EAR values for adults (sex and age groups combined)[31] and the recommendation for adults to eat 500 calories fewer per day to achieve healthy, long-term weight loss.[32] Based on this, the effectiveness of the hypothetical intervention, equated to a 21.3% decrease in daily intake. Given the relationship between disadvantage and obesity,[5] children from low-income families were targeted as they are at heightened risk.
Scenario 3 modelled the same intensive reductions in intake based on recommendations for effective, healthy and long-term weight loss in adults (as in scenario 2, this equated to a -21.3% decrease in daily intake). This intervention was for children living with obesity at a prior time, given that obesity in children tracks through adolescence and into adulthood.[2] Eligible children were indicated using weight status; uptake (all children living with obesity at age 7 years) was 9.3%.
Scenario 4 modelled the impact of a targeted intervention that reduced intake among children reported to be consuming the most. This intervention effectively truncated the population distribution of intake by targeting children with intake exceeding EAR. For these children, intake was then fixed at the level of the EAR in order that every child’s intake was equal to or less than the EAR (1649 and 1530 daily calories for boys and girls respectively).
For all simulations, as guided by the lower nutrient intake bound,[] a lower bound was set at 2SD below the mean intake as reported from food diaries in the ALSPAC analytic sample. This lower bound prevented simulations from reducing calorie intakes for children with low reported consumption. Variables used in targeted and indicated scenarios were recorded at the same time as the mediator and were independent to the model.
Table 1
Simulated intervention scenarios
Scenarios | Calorie reduction | Target | Uptake |
1. Universal intervention to meet estimated average requirements (ear) | -6.1% (-4.8% for boys, -7.5% for girls) | All children | 75% |
2. Targeted intensive intervention for children of low income families | -21.3% | Children from low income families | 75% |
3. Indicated intensive intervention for children with prior obesity | -21.3% | Children living with overweight or obesity at age 7 years | 9.3% |
4. Targeted intervention for children consuming excess total daily calories | Variable | Boys consuming > 1649 and girls consuming > 1530 kcal per day | 62.9% (55.7% of boys 70.7% of girls) |