Enhancing Children’s Numeracy and Executive Function Skills via Explicit Integration: A Randomized Controlled Trial

doi:10.21203/rs.3.rs-4486431/v1

Download PDF

Article

Enhancing Children’s Numeracy and Executive Function Skills via Explicit Integration: A Randomized Controlled Trial

https://doi.org/10.21203/rs.3.rs-4486431/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Executive functions (EF) are crucial to regulating learning and are predictors of emerging mathematics. However, interventions that integrate to improve mathematics remain poorly understood. 193 four-year-olds (mean age = 3 years:11 months pre-intervention; 111 female, 69% White) were assessed 5 months apart, with 103 children randomized to an integrated EF and mathematics intervention. We hypothesized that the intervention would improve mathematics scores more than practice-as-usual. Multi-level modelling and network analyses were applied to the data. The intervention group improved more than the control group in overall numeracy, even when we controlled for differences across settings in EF and mathematics enhancing practices. EF and mathematics measures showed greater interconnectedness between EF and mathematics post-intervention. In addition, disadvantaged children in the intervention group made greater gains than in the control group. Our findings emphasize the need to consider EFs in their integration with co-developing functions, and in their educational and socio-economic context.

Biological sciences/Psychology

Scientific community and society/Social sciences

executive functions

early numeracy

integrated interventions

network changes

Executive functions (“EFs” henceforth) are cognitive skills that help maintain goals in mind, inhibit inappropriate responses and think flexibly (Miyake et al., 2000; Friedman & Miyake, 2017). Multiple empirical and meta-analytic findings highlight robust and replicable correlations between early mathematics and EF (Cragg & Gilmore, 2014; Spiegel et al., 2021), even before the onset of formal schooling (e.g., Coolen et al, 2021; Fuhs et al., 2016; James-Braham et al., 2023; Mulder et al., 2017; Ribner, 2020). Despite this strong correlational and longitudinal evidence, interventions focused on training EF in isolation have failed to yield improvements in other correlated cognitive domains, including mathematics (e.g., Kassai et al., 2019; Melby-Lervag & Hulme, 2016; Sala & Gobet, 2017). Recently, meta-analyses and position pieces have highlighted that interventions integrating EFs and mathematical content are more likely to improve early mathematics attainment, given that mathematics requires the integration of mathematics-specific skills and EF (Peng & Swanson, 2022; Scerif, Blakey, et al., 2023a).

However, the empirical literature on integrated EF and mathematics interventions remains limited and requires further replication, particularly for very young children. Peng and Swanson (2022)’s meta-analysis identified very few integrated interventions prior to school entry. Some studies report encouraging evidence of improvements in mathematics following combined mathematics and EF interventions (Kroesbergen et al., 2014; McClelland et al., 2019; Schmitt et al., 2015), but others report less success (Clements et al., 2020; Prager et al., 2023) or failures to replicate (Willoughby et al., 2021). Therefore, while integrating EF and co-developing functions like mathematics has the potential to improve mathematics, the empirical evidence, particularly in young children, remains limited and mixed. Scerif et al. (2023a) argued that these inconsistencies depend on a lack of explicit focus on the mechanisms of integration between EFs and mathematics. They hypothesise that for young children who are establishing their mathematical skills, practicing EF challenge in the context of mathematical content provides opportunities for deeper processing and learning, thereby enhancing the co-development of EF and mathematics. This proposal is consistent with general theoretical frameworks for neurocognitive development, such as the interactive specialisation (Johnson, 2011) or mutualistic framework (e.g., Kievit et al., 2019), in which the dynamic interplay of domain-general control functions (such as EFs) and domain-specific skills (such as numerical cognition) promotes their change. The current study aims to test the efficacy of an integrated intervention, at a time when these skills are rapidly developing for very young children.

The preschool period is of interest for integrated EF and mathematics interventions for theoretical and societal reasons. From a theoretical viewpoint, EF and mathematical skills are both developing rapidly, with evidence of bidirectional longitudinal relationships (e.g., Coolen et al., 2021; Schmitt et al., 2017; Miller-Cotto & Byrnes, 2020), so that integrated interventions may leverage this interplay. In addition, the preschool period offers a window of opportunity for societal uplift, as young children’s skills are rapidly emerging. Intervening at this juncture could lay the foundation with strong executive skills that may benefit in their mathematical learning (e.g., Joswick et al., 2019), but also more broadly in how well they do at school and beyond. Indeed, there is evidence that EF and mathematics are not fixed skills, but are malleable, so that integrated interventions may benefit children growing in conditions of disadvantage most because they have more to gain from such experiences (e.g., Ramani & Scalise, 2020). However, the way in which integrated interventions improve outcomes for all children remain unclear and must be studied further.

Traditional analyses of the efficacy of intervention trials can fall short of studying how the interplay between multiple cognitive processes operates, because they focus on outcome variables in isolation (e.g., improvement in mathematics), instead of describing changes within networks of correlated cognitive skills. Recently, graph-theory based network analyses have been championed to complement traditional univariate analyses, to characterize (changes in) the interrelations between cognitive functions (Borsboom et al., 2021). This novel approach is far less familiar in psychology than it is in other fields of science (e.g., Butts, 2009; Pržulj & Malod-Dognin, 2016), but it has strong potential value as a tool to investigate how integrated interventions operate and therefore better understand intervention-induced change across domains. In particular, network analysis enables researchers to consider how cognitive processes change in their inter-relationships following an integrated intervention, in a way that univariate statistics do not. For example, network analyses have been used to investigate changes in the interconnectedness of EF indices from before to after an EF-focused intervention in late childhood and adolescence (Menu et al., 2022). This approach revealed multiple changes that extended beyond changes in univariate statistics following the intervention. More specifically, children’s EF network showed both weaker and fewer connections than the adolescent network prior to the intervention. However, post-intervention the children’s network had denser, more numerous and stronger connections, resembling the adolescent network. To our knowledge, network models have not yet been used to model the inter-relations between EF and mathematics, either naturalistically, or following interventions that integrate EF and mathematics.

Therefore, there are a number of pressing limitations to the existing evidence base on integrated EF and mathematics interventions. First, the published evidence of integrated EF training remains limited, precluding meta-syntheses of results. This evidence is required to inform more explicit theories of change and data on improvements in mathematics following integrated EF interventions are needed. Second, there remains a clear gap in understanding how the relationships across executive and mathematical skills change in the face of fast early development and integrated interventions. Network analyses can offer novel insights because they are able to supplement findings of quantitative improvements in individual cognitive skills, to additionally investigate changes in relations among them.

The current study evaluated the efficacy of the Orchestrating Numeracy and the Executive (“ONE”) programme. This programme was designed to provide early childhood educators with training and supportive activities integrating EF and mathematics learning. The programme consisted of: 1) professional development (“PD”) for Early Years practitioners, focused on fostering educators’ understanding of EF in early mathematics, and 2) an induction into a set of 25 activities, co-developed with educators, predicated on integrating EF and early mathematics. The activities were designed to be easily embedded into preschool contexts and routines. The ONE followed the structure of a similarly paced PD-based intervention integrating EFs into play-based activities (albeit without a specific mathematics focus) in Australia (PRSIST, Howard et al., 2020), which resulted in improvements in EF for the intervention settings (but not an improvement in mathematics attainment). The ONE adapted the delivery framework of PRSIST, but combined EF challenge with mathematics-specific content by generating new or modifying existing preschool activities. It aligned with the non-statutory Early Years curriculum guidance in the United Kingdom (Department for Education, 2020) with the support and advice of UK-based Early Years Practitioners. The target mechanism of change was the explicit integration of EF challenge embedded in mathematics activities. Here we evaluate mechanistic hypotheses about the effects of this integrated EF and mathematics intervention.

First, we hypothesized that early mathematics scores would improve to a greater degree for children in the intervention group than for a comparison group of children. This a priori hypothesis and the trial protocol were detailed on the Open Science Framework [ANON]. We assessed improvements across both cumulative indices of early mathematics and EF, and separate contributing numerical (e.g., counting, cardinality, ordinal processing) and EF (e.g., inhibitory control, cognitive flexibility, working memory) skills. Second, we used network-based approaches to test the hypothesis that the interconnectedness of EF and mathematics measures, as indexed by network parameters, changed more in the intervention group more than in the comparison group.

Ethics Approval Statement. This cluster randomized controlled trial (RCT) received research ethics approval from the [BLINDED] Research Ethics Committee. Early Years education settings opted into the study after receiving information about all its elements. Parents and guardians decided whether to opt out from the study by communicating this to settings, preserving their anonymity. This opt-out model of participation was selected because it is more likely to represent families and children from socio-economically disadvantaged backgrounds in longitudinal designs (Bray et al., 2015).

Participants.

Child-level demographics. The study sample consisted of 193 children (Mage at baseline = 47.2 months, range=41-54; 111 female; reported ethnicity: 69% White, 16.1% Asian, 10.3% Multiple Ethnic Groups, 2.3% Black, 2.3% Other). Child demographics by intervention and control group are reported in Table 1. Economic disadvantage was identified by using eligibility for Early Years Pupil Premium (EYPP). Eligibility for this program in England includes family annual income below GBP 16,190 and/or meeting other high-risk criteria (e.g., asylum seeker status). EYPP eligibility is therefore an index of economic disadvantage, although it may underestimate disadvantage, because parents who are eligible do not all apply (for reasons associated with stigma, social desirability, and / or administrative barriers in the application process, Roberts, Griggs, & Robb, 2017). EYPP eligibility was assessed based on reporting by the child’s nursery school (N=147) and parent-reported income (N=77). Of the 161 children (83.4% of the sample) for which these data were available, 24.8% (N = 40) were deemed eligible for EYPP (higher than the 14% national UK average for 2022; UK Government, 2023). Of note, when the study was conducted, all 3- to 4-year-olds in England were eligible for at least 15 hours of free preschool, whether they attended a private setting or not, making preschool an appropriate environment to target disadvantage, because preschool was accessible to all. The control group and the intervention group were well matched in terms of age in months, sex, EYPP eligibility and school readiness (see Table 1).

Table 1. Summary of demographic information for control and intervention children

Measure	Control	Intervention	Difference
Number participants (N)	90	103
Age pre-intervention (months, SD)	47.2 (.36)	47.3 (.37)	n.s. (p = .841)
Sex (% female)	56.7	58.3	n.s (p = .824)
EYPP eligibility (% eligible)	21	28	n.s. (p = .406)
Special educational needs (SEND) (%)	6.7	4.9	n.s. (p = .497)
English spoken at home (%)	80	68	n.s. (p = .155)
Average BESSI score	1.16	1.19	n.s. (p = .329)

Note. For children for whom information was not returned, the data were treated as unknown. Ethnicity was reported by (voluntary) completion of a parent questionnaire. 87 parents (45%) returned this information. The Brief Early Skills and Support Index (BESSI, Hughes et al., 2015) is a teacher-reported a measure of school readiness, with higher scores representing lower school readiness. Items cover: (1) behavioural adjustment, (2) language and cognition, (3) daily living skills, and (4) family support. Each item is given a score of 1 (strongly agree or agree) or 2 (strongly disagree or disagree), with a higher score representing more problem behaviours.

Setting-level characteristics. Fifty-eight settings were approached to take part in this research on the basis of geographical spread and feasibility of travel from [BLINDED], of which 20 (34.5%) consented to take part (see CONSORT diagram). Four of those services took part in an initial co-development phase of the research, with the other 16 participating in this RCT evaluation of The ONE Programme reported here. Settings were randomized to either the control group or the intervention group by a research team member who had not interacted with any of the settings, stratifying on the basis of setting size (large / small), setting type (private or not) and UK-based neighbourhood disadvantage metrics (the Indices of Multiple Deprivation (IMD) decile and Income Deprivation Affecting Children Index (IDACI) based on the postcode of the preschool, UK Government, 2022). The process allocated 8 settings to the intervention and 7 to the control group (one control setting withdrew due to ongoing COVID-19 pressures), well matched on stratification variables (see Table 2).

Table 2. Summary of characteristics for control and intervention settings

Measure	Control settings	Intervention settings	Difference
Number of settings	7	8
IMD (SD)	5.5 (2.62)	5.5 (2.14)	n.s. (p = 1.0)
IDACI (SD)	5.0 (2.39)	5.0 (2.82)	n.s. (p = 1.0)
Average number children per setting (SD)	30.7 (25)	30.9 (21)	n.s. (p = .725)
Setting type (% private)	62.5	42.9	n.s. (p = .317)

Note. Characteristics of settings volunteering into the study. IMD = Index of Multiple Deprivations, Indices of Multiple Deprivation decile; IDACI = Income Deprivation Affecting Children Index decile, with scores ranging from 1 to 10, with lower scores indexing greater deprivation.

In addition, prior to randomization and to pre-intervention child assessments, settings were observed via a half-day observation of interactional quality of the Early Years environment and interactions, using the Sustained and Shared Thinking and Emotional Wellbeing Scale (SSTEW, Siraj et al., 2015). The SSTEW scale was developed to assess the quality of interactions between adults and children in early years childhood education settings, and its overall score predicted early numeracy indices in a large sample of Australian pre-schoolers (Howard et al., 2018). We supplemented SSTEW by bespoke mathematics observation items, capturing interactional quality in the context of counting and cardinality, shape and spatial awareness, patterning and ordering, and numerical knowledge. This observation schedule yielded a score per setting that allowed us to evaluate setting comparability in the adult support that was already provided to children in each setting. In addition, it allowed us to model the effects of our integrated intervention on children, while controlling for how children were nested in settings that varied in terms of baseline interactional quality (see Analysis Plan). Settings in the control and intervention group were well-matched in terms of the quality of Early Years interactions (M_intervention= 4.27, SD = 1.48; M_control= 4.01, SD = 1.85).

Procedure.

Intervention Group. The intervention protocol was co-developed with early years practitioners and consisted of: four weekly 30-minute face-to-face interactive workshop-style PD sessions with Early Years Practitioners followed by eight weeks remaining weeks. The four sessions supported practitioners’ explicit understanding of how early mathematics and EF co-develop, introduced 25 Mathematics + EF activities, and explained how EF can be embedded into a range of routine early mathematics learning activities. All activity cards described their mathematical content and executive demands explicitly. The activities ranged from EF-enhanced modifications of common early childhood games (e.g., “What’s the Time Mr Wolf?”, with embedded executive demands – e.g., “We do not walk if Mr Wolf says... ‘it’s 2 o’clock’”), to more novel activities introducing challenge in EF and mathematics through play (e.g., “Number Robot”, a handmade cardboard function machine requiring cognitive flexibility to apply mathematical functions, Moss et al., 2016). All activities started with mathematical content and EF challenge at a base level. Instructions and training were provided to scale complexity as the activities became familiar to children.

Activities were designed to use low-cost and readily available materials. In consultation with pilot settings and early years specialists, the activities were explicitly designed to be chosen flexibly each week by teachers, rather than in a fixed order, to suit each setting’s context, given the diversity of setting types (e.g., presence or absence of outdoor space, preference for small or large group activities), thereby maximising acceptability and feasibility. Preschool staff were asked to implement a minimum of three of these activities per week with 3- to 4-year-old children at their setting, for the 12-week duration of the programme. The intervention was carried out at the whole-class level and was not targeted towards specific groups of children.

Despite flexibility and choice, there were core demands made of all educators, and these core demands reflected the theory of change of the intervention that were explicitly explained to classroom educators. First, that the three activities undertaken within a week should be chosen to target breadth in mathematical content, by choosing one activity in each of the three key areas of mathematics represented in the activity pack (numbers and counting, patterns and ordering, space and shapes). Practitioners were asked to play the activities in their basic form in weeks 1 – 8 of the programme, but in Week 8 they were reminded to increment the executive challenge of chosen activities as children became increasingly familiar with them. In addition to the recording of activities on a poster provided to log adherence, one representative per setting was contacted in the 8^th and 12th weeks to enable practitioners to reflect on how the programme was going, to enable a member of the delivery team to provide support, and, in Week 12, to conduct an interview (establishing acceptability and barriers of the programme) and an observation (to check fidelity of delivery).

Control Group. We compared the group of children nested in settings receiving the intervention to a practice-as-usual control group of children who received standard early years education following the Early Years Foundation Stage curriculum (Department for Education, 2020). We were specifically interested in whether the programme improved children’s mathematical skills above and beyond teaching in mathematics that is already embedded in the curriculum. As the intervention took place in early years settings, children and educators in the practice-as-usual settings were not passive: children in this group received instruction and teaching by their educators, following standard practice that aims to foster socio-emotional self-regulation and mathematical skills as set out in the curriculum. We aimed to capture these practices across all settings via structured observations, while contrasting explicit EF and mathematics integration to practice-as-usual levels of integration. Our trial design was in line with education trials, guided by policy-makers and practitioners, who want to know whether a programme works above and beyond usual practice.

Pre and Post-Intervention Assessments. All children were tested individually across two 30-minute sessions, counterbalanced across children, on two separate days, both before and after the intervention period. Random assignment to either the intervention or practice-as-usual arm occurred after completion of baseline data collection. Post-intervention child-level assessments were carried out by researchers who were blind to trial arm allocation, on average 5 months after the pre-intervention assessments.

Mathematics. General numeracy - Early Years Toolbox – Numeracy (Howard et al., 2021). The early years toolbox numeracy (EYTN) task is a tablet-based measure of general numeracy skills. Interspersed items on the task pertain to various mathematical domains: number sense, cardinality and counting, numerical operations, spatial and measurement constructs and patterning. The total accuracy score was used for analysis, with one point scored for each correct item.

Specific mathematical skills. Count High (Coolen et al., 2021). To assess children's counting skills, children were instructed to count as high as they could and the highest number reached without having made any mistakes was recorded, stopping at 100 as maximum. Give N (adapted from Cahoon et al., 2021). A version of the Give-N task was used as a measure of cardinality, following the adapted procedure outlined by Cahoon et al. (2021). Children were asked to place a given number of plastic fruit on a plate for 3 blocks of 5 trials, using numbers 3, 4, 6, 11 and 15. The final score was the number of correct trials out of a possible 15. Number Comparison (adapted from Nosworthy et al., 2013). This task is designed to measure children’s digit comparison abilities. Two number digits (1-9) were presented side by side on the screen of a tablet and the child was asked to tap the larger of the two numbers. The final score was calculated as a proportion of numbers correct out of all items answered within 1 minute. Number naming (Nosworthy et al., 2013). As a measure of symbolic number knowledge, children were presented with each digit from 1-9 twice on a screen in a random order, resulting in 18 total digits. The researcher pointed at each digit in turn, asking the child, “What number is this?”. The score used was the number of correct items out of a possible 18. Order Processing (Cahoon et al., 2021). Children were presented with a set of three number cards, each containing one Arabic numeral (1-9), which they were asked to place in order from smallest to biggest. Following 4 practice trials, there were 12 main trials. The task ended after six cumulative mistakes. A total score out of 12 was calculated for analysis. British Ability Scale - Pattern Construction. The pattern construction scale from the third edition of the British Ability Scale (BAS3; Elliott, & Smith, 2011), was used as a measure of spatial ability. This scale requires children to copy spatial patterns using wooden blocks, foam squares and plastic cubes with different patterned and coloured sides. A standardised t-score based on the child’s age in months was used for analysis.

Executive Function. Corsi Blocks Task (following Blakey et al., 2020). This is a measure of children’s visuospatial short-term memory. Nine wooden blocks were attached to a white piece of cardboard in a random array. The researcher tapped blocks in a pre-set random order and the child was instructed to tap the same blocks. For each span level (e.g., 2 block-sequences), the child completed 3 trials. If 2 or more trials were correct, the child progressed onto the next span level (up to 6 block-sequences). The variable used for analysis was the overall number of correct trials, regardless of sequential order. Mr Ant (Howard & Melhuish, 2017) is a visuo-spatial memory task presented on a tablet in which the child is asked to remember the location of colourful ‘stickers’ placed on different body parts of a cartoon ant. In each trial, the stickers are presented one after the other. A blank ant then reappears and the child is asked to indicate where the stickers had previously been, by tapping those locations. There are three trials in each block, with the child progressing to the next block if they are correct on at least one trial, regardless of sequential order. A score was calculated as one point for each consecutive level, beginning from the first, with 2 or 3 items correct; then, from the first level with only 1 item correct, 0.33 points for each correct item. Rabbits & Boats (Howard & Melhuish, 2017) is a tablet-based shifting task, based on a traditional card sort task. Across three blocks, the child must sort cards first according to colour (red/blue), then to shape (rabbit/boat), and finally switching the rule depending on whether or not there is a black border. Each block contains 6 trials and the child must get at least 5 trials correct on blocks 1 and 2 in order to progress to block 3. A switch accuracy score, calculated as the sum of correct responses in blocks 2 and 3, was used for analysis. Fish-Shark Go/No-Go (Howard & Melhuish, 2017) is a tablet-based task of inhibitory control. Fish and sharks move across the screen, one by one in pseudo-random order, and the child is instructed to tap the fish (go trials) and not tap the sharks (no-go trials). There were 3 blocks of 25 trials, each consisting of 20 go trials and 5 no-go trials. Proportional go and no-go accuracy scores were multiplied to create an overall impulse control score, which was used for analysis. In addition, data reduction (via exploratory factor analysis) was employed to investigate our hypothesised one-factor latent structure of EF skills for this sample (in line with the existing literature in this age group, e.g., Wiebe et al., 2011; Coolen et al., 2021). A single factor with an Eigen value greater than 1 was identified, accounting for 47% of the variance in EF scores, and EF latent factor scores were produced. The latent factor scores provided a single variable for EF comparable to the single composite variable for overall numeracy, EYTN. Information on reliability for these measures is detailed in the Supplementary Online Materials, for brevity.

Data Analysis Plan: Transparency and Openness Section

We pre-registered the trial design and measures on Open Science Framework prospectively, before data were collected [ANON]. As recommended by the APA Journal Article Reporting Standards (JARS) for quantitative, qualitative, and mixed methods research, we report how we determined our sample size, all data exclusions (no data exclusions were employed), all manipulations (no data transformations were employed), and all measures in the study. Anonymized data and analysis code are available at [ANON]. Our planned child-level efficacy outcomes variables were early mathematics and EFs measures, as reported at [ANON]. An intention-to-treat analytical approach (with all children in settings that had been randomized to the intervention included in the intervention arm) was employed, consistent with other educational trials (e.g., Brown et al., 2023). The efficacy analysis was carried out using IBM SPSS v 29.0. The network analyses were exploratory and were conducted in R statistical software (version 4.2.2, R Core Team, 2022) using packages qgraph (version 1.9.3, Epskamp et al., 2012), bootnet (version 1.5, Epskamp et al., 2018) and networktools (version 1.5.0, Jones, 2022).

Pre-registered:

Intervention Efficacy Analyses. Target sample size (N = 240 children) was determined a priori using G*Power 3.1 (Faul et al., 2009) to afford power greater than .80 to detect a small (f²=0.10, as expected for educational intervention) interaction effect for intervention arm (integrated, BAU) and timepoint (pre-intervention, post-intervention), with alpha=0.05, repeated measure correlation of .8, with up to 20% attrition. Due to ongoing COVID-19 impact (e.g., nursery staff turn-over, lower time availability for settings), one setting withdrew from the study before pre-intervention assessments and parents of one child withdrew data from the study. The final N was N = 193. No data were excluded. Deviations from pre-registration. We had planned to use two-way mixed ANCOVAs, but missing data (average univariate missingness = 5.8%; maximum univariate missingness = 17.6%) and distributional violations required approaches that deviated from the pre-registered analyses. Multi-Level Linear Modelling (MLM) with restricted maximum likelihood estimation (REML) was employed to model main effects over and above Time 1 individual differences, because this is robust to moderate to small proportion of missing data and to distributional violations (Snijders & Bosker, 2012). MLMs modelled the effects of Time (Time 1, Time 2), Intervention group (Control, Intervention), and Early Years Pupil Premium (EYPP) eligibility (EYPP; Yes, No, Unknown). Time and participant data were modelled as repeated effects. Setting-level differences in baseline scores for interactional quality (SSTEW, Siraj et al., 2015; BLINDED) were modelled as random effects. Nesting of children-level data within settings was employed to model setting level variables (e.g., baseline differences in interactional quality as above, SSTEW) and child-level variables (e.g., EYPP eligibility). We computed effect size using Hedge’s g.

Exploratory:

Network Analyses. To explore the structure of the relationships between all EF and mathematics variables at once, rather than focusing on bivariate correlations or univariate changes from pre- to post intervention, we implemented Gaussian graphical network models based on a regularised partial correlation network using Spearman correlations (Epskamp & Fried, 2018). The EF and mathematics tasks were represented as nodes in each network, while the partial correlations between the tasks represented the network edges (i.e., connections between nodes). To test whether this integrated intervention led to greater changes in the network structure than practice as usual, we tested overall network change by calculating the correlation coefficients between all edges of the network (i.e., the connections between the nodes) pre- and post-intervention, in the intervention and the control group. To further characterise the estimated networks, we tested the relative importance of each node in the network by calculating centrality indices: strength, expected influence, closeness, and betweenness all characterise the connectedness of nodes in a network. The Strength index refers to the absolute sum of all edges (i.e., correlations) to a particular node (e.g., all paths to a mathematics node). In contrast, Expected Influence takes into account whether an edge (a correlation) has a particular sign (positive or negative). Betweenness refers to how often a node is on the shortest path between other nodes, and Closeness refers to a mean distance from a node to all other nodes in the network. Additional node and edge stability are reported in Figure S4 and S5. In addition to interrogating the importance of individual nodes in the network, we tested whether there are any prominent bridge nodes between EF and mathematics nodes, i.e., nodes in one group that are most strongly connected to all nodes from the other group. The detection of bridge nodes enabled us to determine the strongest links between domains, i.e., which EF node was most strongly connected to mathematics nodes, and vice versa. Finally, to determine whether there were clusters of nodes in the network, and whether the cluster structure changed with the intervention, we ran cluster analysis. In graph-based approaches, the presence of clusters shows that some nodes are more strongly related than others and it is determined via a data-driven approach.

Intervention Efficacy

As described above, efficacy analyses focused on an intention-to-treat analytical approach. This conservative analytical approach treats children allocated to the intervention arm as having received the intervention, even if educators did not deliver activities to the requested level of adherence, either because this was not feasible, or because of other constraints. We report information on feasibility, acceptability, adherence and implementation quality of the programme in the supplementary materials.

Mathematics. Unadjusted means, estimated marginal means (with standard deviations), statistics (F, p) and effect sizes (Hedge’s g) for all mathematics measures are reported in Table 3. For EYTN, there was a statistically significant main effect of Intervention group, driven by higher improvements in numeracy for children in the Intervention group compared to children in the Control group. There were also main effects of intervention on Give N and Number Comparison, again driven by higher improvements in the Intervention compared to the Control Group. EYPP eligibility had a significant main effect on all mathematics variables except for Count High and Order Processing. For all main effects, children who grew up in disadvantage (eligible for EYPP) had significantly lower scores compared to non-EYPP eligible children.

In addition, for EYTN, there was also a statistically significant Intervention * EYPP interaction effect (see Figure 1). For children with EYPP eligibility, changes in EYTN scores were larger in the Intervention group (T1 = 25.04; T2 = 32.07) than in the Control group (T1 = 14.86, T2 = 18.48, p =.001). In addition, EYPP-eligible children scored less well on this overall numeracy measure than non-eligible children in the Control group (p <.001), but this difference was reduced for children in the Intervention group (p = .026). Furthermore, for spatial skills (as indexed by BAS3-PC), there was also an Intervention * EYPP eligibility interaction effect (see Figure 2). Children with EYPP eligibility improved more in the Intervention group (T1 = 48.49; T2 = 53.98) than in the Control group (T1 = 42.31; T2 = 44.30). Children with EYPP eligibility had poorer spatial skills than children without EYPP eligibility in the Control group, p <.001, but not in the intervention group, p =.366. In addition, children with EYPP in the Intervention arm had better spatial skills than children with EYPP in the control group, p =.006. None of the other Intervention * EYPP interaction effects reached statistical significance.

Executive Functions. Unadjusted condition means, estimated marginal means (with standard deviation), statistics (F, p) and effect sizes (Hedge’s g) for EFs measures are reported in Table 4. There was a main effect of intervention on Corsi Blocks Score, but there were no other statistically significant main intervention effects. EYPP eligibility had a significant main effect on all EF variables, except for Rabbits and Boats. For all main effects, children who grew up in disadvantage (eligible for EYPP) had significantly lower scores compared to EYPP not eligible children and children whose status was unknown.

In addition, there were a significant Intervention * EYPP eligibility interaction effect for Corsi Blocks, for Mr Ant and for the EF latent variable (Figures 3 and 4). For Corsi Blocks, the interaction effect was driven by significantly greater changes in Corsi Blocks scores for children who were EYPP eligible in the Intervention Group (T1 = 3.68; T2 = 4.70) compared to EYPP eligible children in the Control group (T1 = 2.58; T2 = 2.63, p = .020). EYPP eligible children in the Control group scored lower than non EYPP eligible children (p<.001), but this difference was smaller for EYPP eligible children in the Intervention group (p = .011). For Mr Ant, the interaction effect was again driven by greater change in scores for EYPP children in the Intervention group (T1 = 1.35; T2 = 1.50) compared to the Control group, (T1 = .91; T2 = .93, p =.009). In addition, EYPP eligible children scored less well than EYPP non-eligible children in the Control Group (p < .001), but not in the intervention group (p=.932). For the latent EF variable, the interaction effect was driven by significantly greater change factor scores for children who were EYPP eligible in the Intervention Group (T1 = -.38; T2 = -.17), compared to EYPP eligible children in the Control group, (T1 = -.78; T2 = -.90, p = .021). EYPP eligible children in the Control group had lower EF factor scores than non EYPP eligible children (p<.001), but this difference was not significant for EYPP eligible children in the Intervention group (p = .170).

Table 3. Effects of intervention, disadvantage and their interaction on mathematics variables

*Measure*	*Group*	*Time 1 Unadj. Mean (stdev)*	*Time 2 Unadj. Mean (stdev)*	*Effect*	*Estimated Marginal Means (stdev)*	*F ratio*	P	*Hedge’s g*
*Early Years Toolbox Numeracy* *(raw score)*	*Intervent.*	28.59 (13.88)	34.91 (14.90)	*Intervention*	M_Con=25.99 (14.58); M_Int=32.18 (14.70)	F(1,21)=7.44	*.012*	*.42*
	*Control*	27.07 (13.69)	33.43 (14.83)	*EYPP*	M_{EYPP_Yes}=22.61(16.21); M_{EYPP_No}=33.47(13.26)	F(2,210)=15.08	*<.001*	*.77*
	*Control*	27.07 (13.69)	33.43 (14.83)	**IntEYPP***	M_{EYPP_Yes_Con}=16.65 (11.32); M_{EYPP_Yes_Int}=28.56 (17.42)	F(2,210)=3.38	*.036*	*.64*
*Count High (maximum count)*	*Intervent.*	16.84 (18.31)	23.25 (23.17)	*Intervention*	M_Con=15.53 (14.44); M_Int=19.29 (21.09)	F(1,25)=2.00	.170
	*Control*	15.57 (17.67)	17.91 (9.78)	*EYPP*	M_{EYPP_Yes}=13.66(20.24); M_{EYPP_No}=19.44(17.51)	F(2,164)=2.70	.070
	*Control*	15.57 (17.67)	17.91 (9.78)	**IntEYPP***	M_{EYPP_Yes_Con}=11.70 (17.25); M_{EYPP_Yes_Int}=15.63 (19.77)	F(2,161)=.03	.967
*Give N (score)*	*Intervent.*	6.47 (4.66)	8.77 (4.76)	*Intervention*	M_Con=5.83 (4.79); M_Int=8.60 (4.80)	F(1,27)=11.25	*.002*	*.58*
	*Control*	8.00 (4.84)	8.36 (4.76)	*EYPP*	M_{EYPP_Yes}=4.65(4.93); M_{EYPP_No}=8.75(9.17)	F(2,282)=20.63	*<.001*	*.49*
	*Control*	8.00 (4.84)	8.36 (4.76)	**IntEYPP***	M_{EYPP_Yes_Con}=2.68 (4.61); M_{EYPP_Yes_Int}=6.61 (4.86)	F(2,286)=1.07	.343
*Number Comparison (proportion correct)*	*Intervent.*	.57 (.19)	.66 (.19)	*Intervention*	M_Con=.55 (.21); M_Int=.62 (.19)	F(1,32)=4.58	*.040*	*.35*
	*Control*	.54 (.22)	.63 (.19)	*EYPP*	M_{EYPP_Yes}=.55(.19); M_{EYPP_No}=.63(.20)	F(2,216)=4.20	*.016*	*.40*
	*Control*	.54 (.22)	.63 (.19)	**IntEYPP***	M_{EYPP_Yes_Con}=.48 (.11); M_{EYPP_Yes_Int}=.61 (.19)	F(2,219)=1.32	.270
*Number Naming (score)*	*Intervent.*	11.22 (6.52)	13.00 (5.25)	*Intervention*	M_Con=11.47 (5.87); M_Int=11.53 (5.97)	F(1,38)=.003	.956
	*Control*	11.79 (6.29)	13.59 (5.37)	*EYPP*	M_{EYPP_Yes}=9.56 (7.11); M_{EYPP_No}=13.27 (4.98)	F(2,276)=9.76	*<.001*	*.66*
	*Control*	11.79 (6.29)	13.59 (5.37)	**IntEYPP***	M_{EYPP_Yes_Con}=8.49 (6.79); M_{EYPP_Yes_Int}=10.63 (7.16)	F(2,278)=1.98	.140
*Order Processing (score)*	*Intervent.*	1.15 (2.84)	2.97 (4.21)	*Intervention*	M_Con=1.69 (3.78); M_Int=1.98 (3.68)	F(1,27)=.27	.610
	*Control*	1.75 (3.64)	2.54 (3.92)	*EYPP*	M_{EYPP_Yes}=1.44 (3.39); M_{EYPP_No}=2.40 (3.38)	F(2,187)=1.92	.149
	*Control*	1.75 (3.64)	2.54 (3.92)	**IntEYPP***	M_{EYPP_Yes_Con}=.65 (1.12); M_{EYPP_Yes_Int}=2.23 (3.93)	F(2,190)=1.75	.177
*BAS – PC (t-score)*	*Intervent.*	51.91 (11.41)	53.99 (10.72)	*Intervention*	M_Con=50.29 (11.71); M_Int=53.26 (11.09)	F(1,27)=2.99	.095
	*Control*	51.55 (12.68)	53.98 (10.58)	*EYPP*	M_{EYPP_Yes}=47.27(13.07); M_{EYPP_No}=54.24(10.38)	F(2,203)=10.23	*<.001*	*.63*
	*Control*	51.55 (12.68)	53.98 (10.58)	**IntEYPP***	M_{EYPP_Yes_Con}=43.31 (10.19); M_{EYPP_Yes_Int}=51.23 (13.15)	F(2,206)=4.26	*.015*	*.65*

Note. Abbreviations: EYPP = Eligible for Early Years Pupil Premium, a UK based index of economic disadvantage. EYPP_Yes = EYPP Eligible, therefore disadvantaged. Con = Control, Int = Intervention. Int*EYPP = Interaction effect between Intervention and EYPP eligibility. BAS - PC = British Ability Scale, Pattern Construction. Please note that condition means and effect size indices for main effect of EYPP are reported only for children whose status was confirmed, and for the interaction effect they are reported for disadvantaged children (EYPP eligible) here, for brevity. Effect size (Hedge’s g) is also reported for statistically significant effects only, for brevity. Other estimated marginal means are reported in the Supplementary Online Materials (Table S2).

Table 4. Effects of intervention, economic disadvantage and their interaction on EF variables

*Measure*	*Group*	Time 1 Unadj. Mean (stdev)	Time 2 Unadj. Mean (stdev)	*Effect*	*Estimated Marginal Means (stdev)*	*F ratio*	P	*Hedge’s g*
*Corsi Blocks (score)*	*Intervention*	4.84 (2.80)	5.60 (2.79)	*Intervention*	M_Con=4.48 (2.64); M_Int=5.34 (2.81)	F(1,28)=4.55	*.042*	*.31*
	*Control*	5.18 (2.65)	5.17 (2.64)	*EYPP*	M_{EYPP_Yes}=3.40(2.95); M_{EYPP_No}=5.55(2.55)	F(2,224)=19.23	*<.001*	*.80*
	*Control*	5.18 (2.65)	5.17 (2.64)	**IntEYPP***	M_{EYPP_Yes_Con}=2.60 (2.13); M_{EYPP_Yes_Int}=4.19 (3.15)	F(2,225)=3.87	*.022*	*.56*
*Mr Ant (score)*	*Intervention*	1.29 (.70)	1.48 (.82)	*Intervention*	M_Con=1.28 (.77); M_Int=1.38 (.77)	F(1,24)=.56	.460
	*Control*	1.35 (.76)	1.55 (.77)	*EYPP*	M_{EYPP_Yes}=1.16 (.78); M_{EYPP_No}=1.48 (.75)	F(2,232)=4.18	*.016*	*.42*
	*Control*	1.35 (.76)	1.55 (.77)	**IntEYPP***	M_{EYPP_Yes_Con}=.90 (.76); M_{EYPP_Yes_Int}=1.42 (.78)	F(2,236)=4.51	*.012*	*.66*
*Rabbits & Boats (post-switch score)*	*Intervention*	3.36 (4.17)	5.26 (4.26)	*Intervention*	M_Con=4.57 (4.29); M_Int=4.54 (4.31)	F(1,28)=.001	.973
	*Control*	4.47 (4.22)	5.64 (4.30)	*EYPP*	M_{EYPP_Yes}=4.11 (4.28); M_{EYPP_No}=5.00 (4.32)	F(2,267)= 1.08	.340
	*Control*	4.47 (4.22)	5.64 (4.30)	**IntEYPP***	M_{EYPP_Yes_Con}=4.22 (4.06); M_{EYPP_Yes_Int}=4.01 (4.21)	F(2,269)=.164	.849
*Go-nogo (impulse control score)*	*Intervention*	.49 (.20)	.59 (.20)	*Intervention*	M_Con=.54 (.20); M_Int=.54 (.21)	F(1,24)=.03	.863
	*Control*	.51 (.19)	.59 (.21)	*EYPP*	M_{EYPP_Yes}=.49(.22); M_{EYPP_No}=.56(.21)	F(2,153)=3.96	*.021*	*.33*
	*Control*	.51 (.19)	.59 (.21)	**IntEYPP***	ME_{EYPP_Yes_Con}=.48 (.20); M_{EYPP_Yes_Int}=.49 (.21)	F(2,153)=.09	.918
*EF latent factor (factor scores)*	*Intervention*	-.09 (.99)	.004 (.97)	*Intervention*	M_Con= -.21 (.10); M_Int= -.01 (.98)	F(1,26)=1.88	.183
	*Control*	.10 (1.00)	-.004 (.99)	*EYPP*	M_{EYPP_Yes}= -.56(1.01); M_{EYPP_No}= .131(.97)	F(2,245)=14.03	*<.001*	*.70*
	*Control*	.10 (1.00)	-.004 (.99)	**IntEYPP***	ME_{EYPP_Yes_Con}= -.84 (.93); M_{EYPP_Yes_Int}= -.27 (1.04)	F(2,245)=3.22	*.042*	*.57*

Note. Abbreviations: EYPP = Eligible for Early Years Pupil Premium, a UK based index of economic disadvantage. Int*EYPP = Interaction effect for Intervention and EYPP status. Con = Control, Int = Intervention. Please note that condition means and effect size indices for main effect of EYPP are reported only for children whose status was confirmed, and for the interaction effect they are reported for disadvantaged children (EYPP eligible) here, for brevity. Effect size (Hedge’s g) is also reported for statistically significant effects only, for brevity. Other estimated marginal means are reported in the Supplementary Online Materials (Table S3).

Intervention Mechanisms: Network Analyses.

The above analysis revealed that the intervention led to improvements in a number of individual mathematics and EF indices. To better understand how the intervention impacted the relations between EF and maths skills, we complemented these univariate analyses with network analysis. The network analysis revealed that EF and mathematics are highly connected and that the structure and strength of edges changed with the intervention (see Figure 5, and Figure S3 for bivariate correlations across all variables). The T2 control network was more similar to the T1 network than the T2 intervention network, as indicated by their correlation coefficients (control vs T1: r = 0.714; intervention vs T1: r = 0.566), showing that the intervention network changed more than the control network during the intervention (i.e., above and beyond changes associated with routine practice or time).

A focus on additional network parameters gave further insights into the ways in which this change operated (see Figure 5). First, nodes in the intervention network showed increased centrality (i.e., increased connectedness), as indexed by higher Strength, Expected Influence, Closeness, and Betweenness, of several mathematics nodes (Figure 5b). For example, there was increased connectedness for Number Comparison and Order Processing after the intervention, measures that require high integration of EF and mathematical knowledge compared to other measures (e.g., Number Naming or Count High, which rely on rote learning). In turn these changes supported the view that integrated EF and mathematics activities increase connectedness of these skills. Second, the strength and connections of bridge nodes between EF and mathematics clusters differed in the intervention network compared to the control network. For example, Corsi Blocks (an index of maintenance in memory) was identified as the main EF bridge node, and the EF node that was most strongly to all mathematics nodes (Figure 5c). Corsi Blocks was most strongly connected to BAS-PC (an index of spatial skills) in the T1 network (r = 0.307) and control network (r = 0.251), but it was most strongly connected to EYTN (an index of overall numeracy) in the intervention network (r = 0.335). Furthermore, the strength of the Corsi Blocks bridge node was higher in the intervention group than in the control group, suggesting that the intervention might have contributed to the integration and increasing the impact of EF nodes on mathematics nodes. In turn, this added support to the suggestion that integrated approaches support the co-development of EF and mathematics.

Finally, data-driven cluster analyses identified three clusters in all three networks (Figure 6), but the structure of clusters (i.e., the nodes which comprise each cluster) was more similar for the T1 (Figure 6a) and control networks (Figure 6b), than for the intervention network (Figure 6c). In the intervention network, there was a more prominent change in cluster structure, with most EF and mathematics nodes grouped together in a big cluster (Order Processing, Number Comparison, Rabbits & Boats, Go/No-Go, BAS-PC and Mr Ant), and EYTN and Corsi Blocks forming a central cluster. Additional findings on bridge nodes and cluster changes, consistent with greater integration in the intervention network, are detailed in the Appendix.

The present study aimed to evaluate the efficacy of an integrated EF and mathematics intervention in improving early numeracy outcomes for children. Previous research points to concurrent and longitudinal correlations between early numeracy and EF (e.g., Coolen et al., 2021; Fuhs et al., 2016; Mulder et al., 2017; Ribner, 2020), but interventions that have focused on executive functions in isolation have tended not to result in improvements in correlated functions (e.g., Melby-Lervag & Hulme, 2014, 2016; Sala & Gobet, 2017). Recent reviews have hypothesised that interventions integrating executive challenge within the targeted domain(s) – in this case, mathematical content – have the potential to improve early numeracy most effectively (e.g., Peng & Swanson, 2022; Scerif et al., 2023a). These proposals also connect with broader frameworks of neurocognitive development, such as interactive specialisation (Johnson, 2011) and mutualism (Kievit et al., 2019). Yet empirical evidence on the efficacy of integrated EF interventions, for young children in particular, has been more limited and mixed. Moreover, the relationships between numerical and EF skills both prior to and following interventions are under-investigated.

We had hypothesised that an integrated EF and mathematics intervention would result in improvements in mathematics. In the current study, an integrated intervention resulted in greater differential change on an overall early numeracy measure for children in the intervention arm compared to those in the control group. Children in the intervention group also improved more than children in the control group on EF measures, particularly for working memory indices and in the context of economic disadvantage. Our efficacy findings therefore support our hypothesis, even when modelling baseline practice-as-usual differences across settings in the level of support offered by educators to children in their care. The beneficial effects of integrating early mathematics and EF on mathematics add to a growing body of evidence in favour of integrated interventions (e.g., Clements et al., 2016; Cameron et al., 2019). Moreover, our study addressed calls to gather more evidence on the integration of domain-specific and domain-general codeveloping skills, in order to understand successes and failures of interventions (e.g., Peng & Swanson, 2022; Scerif et al., 2023a; Wilkey & Price, 2019). Although we did not collect neural or long-term longitudinal data, our findings are also consistent with broader theoretical frameworks of neurocognitive development that emphasise the integration of co-developing skills (Johnson, 2011; Kievit et al., 2019).

Complementing efficacy analyses that focus on variables in isolation, our network-based approach showed that the interconnectedness between EF and mathematics skills was modified by this integrated intervention. The EF-mathematics network for children in the intervention group differentiated from the pre-intervention network to a greater degree in terms of overall similarity, centrality indices, bridge nodes and data-driven clusters of nodes compared to the control group’s network. These findings point to additional benefits that would not be expected from simply addressing either EF or mathematics on their own. The efficacy-based findings and network analyses provided two complementary approaches that, together, indicate that the integrated EF and mathematics intervention did not simply improve outcome variables in isolation, but also changed their interconnectedness. This might be because children practiced EF and mathematics skills together to a greater degree than in practice-as-usual.

One of the key benefits of graph theory-based approaches is that they model intercorrelations between multiple variables, rather than treating them in isolation (Borsboom et al., 2021). Network analyses offer a strong complementary alternative to data reduction approaches such as exploratory or confirmatory factor analyses that have come under criticism recently (Camerota et al., 2020). This is because networks do not only model shared variance, but they also represent correlations between nodes once all others have been modelled (Eskamp & Fried, 2018; Younger et al., 2022). Our findings are consistent with the increases in connectedness reported for another study investigating inter-relations between distinct EF after an EF-focused intervention in older children and adolescents, with the network for children in the intervention group changing more clearly in the direction of the baseline adolescent network than children in the control group (Menu et al., 2022). The current intervention network also differentiated from the pre-intervention network to a greater degree in terms of overall similarity, it displayed increased centrality (connectedness), stronger bridge nodes and different data-driven clusters of nodes, supporting the suggestion that an intervention bringing EF and mathematics together fosters the co-development of these skills.

Understanding not only whether integrated interventions work, but also for whom they work more effectively is very important. In the current study, children growing up in conditions of economic disadvantage scored lower on most of our numeracy and EF variables, but, when in the intervention group, they improved more than children who were also at economic disadvantage, but in the control group. These greater benefits extended to overall numeracy, spatial skills, visual short-term memory skills and a latent EF variable. The lower performance on EF and mathematics tasks in children living in poorer socioeconomic circumstances is consistent with prior research (Hackman, Farah & Meaney, 2010; Hanner et al., 2019; Raver et al., 2013). Risks for lower EF and mathematics performance are likely to co-occur with economic disadvantage (Blakey et al., 2020, James-Braham et al., 2023). However, strong EF can act as a protective buffer and predictor of good mathematics performance at school entry (Ribner et al., 2017; 2020). Crucially, inequalities in EF and mathematics are likely to depend on a complex host of factors, some of which may be very hard to modify (e.g., systemic barriers to access to resources, pervasive environmental stressors, etc., Miller-Cotto et al., 2022). However, other factors are likely to be modifiable through changes in policies and educational opportunities in early years settings (such as opportunities to practice mathematical activities; Hanner et al., 2019; support for parents, Ku & Blair, 2023; high quality early years support, Hall et al., 2012). In this study, exposure to an integrated EF and mathematics intervention benefited the sample of children who were the most economically disadvantaged, supporting the view that greater opportunities for exposure and practice can improve both EF and mathematics in the context of economic disadvantage. These findings are also consistent with the greater success of curriculum-based interventions in improving EF and / or mathematics for children who are experiencing more economic disadvantage than for children experiencing less severe disadvantage (e.g., Ramani & Scalise, 2020; Blair & Raver, 2014). We believe that curriculum-based approaches are promising for levelling the playing field early on, before attainment gaps set in and widen. The approach is advantageous as it does not involve changing parenting behaviours, particularly for parents who may already be under-resourced with limited time.

Together with these positive outcomes, there are limitations and much-needed future research before we have a good understanding of integrated interventions such as The ONE programme. As a first limitation, here we contrasted the intervention regime with a practice-as-usual control group, rather than a control group engaged in a different intervention regime. We did this explicitly, because ethically we felt it was most appropriate to first demonstrate feasibility and acceptability of a newly developed intervention programme, as well as its efficacy, before contrasting it with another regime. The need for an active control group was reduced by the fact that our activities were delivered by the classroom educators, rather than a novel set of adults (e.g., researchers) who might make children’s experience very different to practice-as-usual. In addition, the activities did not involve the use of unusual manipulatives and media. This is important, because it reduced the possibility that any improvement could depend on increased attention to a novel set of objects or new researchers interacting with children in each classroom. Instead, educators integrated activities in their everyday practice. In addition, we reasoned that conceiving of “practice-as-usual” in educational interventions studies as “non-intervention” may in and of itself be misguided. The pre-existing educational environment on which an intervention is overlaid offers active elements that must be measured, rather than ignored. This was indeed why we characterised the educational differences across all settings, using an adaptation of standardised observational measure of the educational environment and pedagogy used in adult/child interactions (the Sustained and Shared Thinking and Emotional Wellbeing Scale, SSTEW, Siraj et al., 2015). We then modelled these differences analytically, to study whether The ONE added to variation in educational contexts. However, future studies could compare integrated interventions such as The ONE programme directly with isolated EF interventions. The additional contrast with an active, but not integrated, EF comparison regime, would further isolate the mechanisms underpinning whether and how integrated interventions are more effective. For example, a comparison group working on EF activities without mathematical content (e.g., PRSIST, Howard et al., 2022, with a focus on EF and not integrated EF and mathematics) might show changes in EF nodes, but more limited or no changes in the edges connecting EF and mathematics nodes.

A second needed future direction is to replicate the intervention benefits for children growing up in conditions of economic disadvantage by broadening how we approach disadvantage. By using EYPP eligibility as an index of disadvantage, disadvantage here was simply operationalised as low income. A broader operationalisation, going beyond low income only, is needed (Duncan & Magnuson, 2012). It would be helpful to extend this to look at parental education, family resources, cultural practices related to learning, and the quality of early years setting. Furthermore, in the current study, benefits for children at economic disadvantage varied across indices of mathematics and EF: in the context of disadvantage, children exposed to The ONE programme benefited more than children in the control group on overall early numeracy, spatial processing and visual short-term memory indices, but not on other measures. Potential explanations start with measurement considerations: perhaps children at economic disadvantage had more “room to grow” on these measures. However, explanations also extend to greater “integration practice”: perhaps integrating space and shape games with EF may have occurred more frequently than in practice-as-usual, in particular for disadvantaged children in the intervention. The differential stronger benefits for some skills compared to others require further formal investigation.

Finally, a further required step is to test whether the current benefits of The ONE programme replicate on a larger sample of diverse children and settings. This is because the current sample of disadvantaged children was relatively small, although it exceeded the national United Kingdom average of EYPP eligibility. Furthermore, here we could only control for, but not model explicitly, the impact of diversity across settings. A replication of the programme with a larger number and more varied types of preschool settings is important to examine the interplay between children’s characteristics, preschools’ characteristics, and intervention success. A future large-scale trial is necessary to test these multiple factors and their interplay with sufficient statistical power. This will allow for greater understanding of whether and how the intervention is most effective when it has gone to scale.

In conclusion, executive functions are known to correlate strongly and robustly with co-developing functions such as early mathematical skills, but interventions that have focused on training EF in isolation have thus far failed to show reliable improvements in early mathematics. Interventions that integrate EF with co-developing functions hold more promise, but greater evidence about their efficacy, particularly for children growing up in disadvantage, and a better understanding of their mechanisms, are required. Here, network analyses pointed to greater changes in the EF-mathematics interplay associated with the intervention than with the simple passage of time. In combination, these findings point to the need to carefully consider and leverage the interplay between EF and co-developing cognitive domains, rather than intervening on these cognitive functions in isolation.

Data Availability

The data and the analytic code necessary to reproduce the analyses presented here are available on Open Science Framework [link]. A full description of the baseline and endline assessment materials is also available on Open Science Framework [link]. The intervention efficacy analyses were preregistered on Open Science Framework before data collection began [link].

Acknowledgments

A Project Grant by the Nuffield Foundation (to GS, ZH, SH, and RM, “Fostering resilience by injecting executive challenge into early mathematics”, FR-000022619) supported this study. We are very grateful to all remaining Advisory Board members for the Project Grant, for their intellectual contributions during advisory board meetings and beyond: in alphabetical order, Jennie Challender, Aleisha Clarke, Keely Cook, Katy Jeary, Ruth Maisey, Gill Mason, Joanne Mason, Edward Melhuish, Kathy Sylva, and Ellen Wright. We are also heavily indebted to Angelina Bogdanova, Abigail Heath, and Francesca Plaskett for contributing to post-intervention data collection and data curation. Finally, none of this work could have been achieved without the huge support and effort of children, early years educators and parents at our volunteering settings.

Author Contributions

GS – Conceptualization; Data curation; Formal Analysis; Funding Acquisition; Investigation; Methodology; Project Administration; Supervision; Writing – Original Draft Preparation; Writing – Review & Editing

JS - Data curation; Formal Analysis; Writing – Original Draft Preparation; Writing – Review & Editing

HA – Data curation; Investigation; Writing – Review & Editing

EB - Conceptualization; Methodology; Writing – Review & Editing

SG - Conceptualization; Data curation; Investigation; Formal Analysis; Methodology; Project Administration; Writing – Original Draft Preparation; Writing – Review & Editing

AG - Data curation; Investigation; Writing – Original Draft Preparation; Writing – Review & Editing

ZH - Conceptualization; Funding Acquisition; Methodology; Writing – Review & Editing

SH - Conceptualization; Funding Acquisition; Methodology; Writing – Review & Editing

LK – Data curation; Investigation; Writing – Review & Editing

RM – Conceptualization; Funding Acquisition; Methodology; Writing – Review & Editing

ROC - Data curation; Investigation; Methodology; Project Administration; Supervision; Writing – Original Draft Preparation; Writing – Review & Editing

FOR - Conceptualization; Methodology; Writing – Review & Editing

VS - Conceptualization; Methodology; Writing – Review & Editing

Conflict of Interest

The authors declare no conflict of interest.

Blakey, E., Matthews, D., Cragg, L., Buck, J., Cameron, D., Higgins, B., Pepper, L., Ridley, E., Sullivan, E. & Carroll, D.J. (2020). The Role of Executive Functions in Socioeconomic Attainment Gaps: Results from a randomised control trial. Child development, 91(5), 1594-1614, https://doi.org/10.1111/cdev.13358 .
Borsboom, D., Deserno, M.K., Rhemtulla, M. et al. (2021). Network analysis of multivariate data in psychological science. Nat Rev Methods Primers 1, 58. https://doi.org/10.1038/s43586-021-00055-w
Bray, I., Noble, S., Boyd, A. et al. A randomised controlled trial comparing opt-in and opt-out home visits for tracing lost participants in a prospective birth cohort study. BMC Med Res Methodol 15, 52 (2015). https://doi.org/10.1186/s12874-015-0041-y
Braun, V., & Clarke, V. (2006). Using thematic analysis in psychology. Qualitative Research in Psychology, 3(2), 77–101. https://doi.org/10.1191/1478088706qp063oa
Brown, E.R., Groom, M., & Anstell, S. (2023). Embedding Executive Challenge into Early Maths: A two-arm cluster randomised controlled trial. Education Endowment Foundation Evaluation Protocol. Education Endowment Foundation Evaluation Protocol. https://d2tic4wvo1iusb.cloudfront.net/production/documents/projects/The-ONE-protocol.pdf?v=1702551477
Butts C. T. (2009). Revisiting the foundations of network analysis. Science (New York, N.Y.), 325(5939), 414–416. https://doi.org/10.1126/science.1171022
Cahoon, A., Gilmore, C., Simms, V. (2021). Developmental pathways of early numerical skills during the preschool to school transition. Learning and Instruction, 75. https://doi.org/10.1016/j.learninstruc.2021.101484.
Camerota, M., Willoughby, M. T., & Blair, C. B. (2020). Measurement models for studying child executive functioning: Questioning the status quo. Developmental psychology, 56(12), 2236–2245. https://doi.org/10.1037/dev0001127
Clements, D. H., Sarama, J., Layzer, C., Unlu, F., & Fesler, L. (2020). Effects on mathematics and executive function of a mathematics and play intervention versus mathematics alone. Journal for Research in Mathematics Education, 51(3), 301-333.
Coolen, I., Merkley, R., Ansari, D., Dove, E., Dowker, A., Mills, A., Murphy, V., von Spreckelsen, M., & Scerif G. (2021). Domain-general and domain-specific influences on emerging numerical cognition: Contrasting uni-and bidirectional prediction models. Cognition, 215. https://doi.org/10.1016/j.cognition.2021.104816.
Cragg, L., & Gilmore, C. (2014). Skills underlying mathematics: The role of executive function in the development of mathematics proficiency. Trends in neuroscience and education, 3(2), 63-68.
Department for Education. (2020). Development matters: Non-statutory curriculum guidance for the early years foundation stage. London: Crown copyright.
Duncan, G. J., & Magnuson, K. (2012). Socioeconomic status and cognitive functioning: Moving from correlation to causation. WIRES: Cognitive Science, 3, 377–386.
Elliott, C.D., & Smith, P. (2011). BAS3 British Ability Scales Technical Manual. GL assessment.
Epskamp, S., & Fried, E. I. (2018). A tutorial on regularized partial correlation networks. Psychological methods, 23(4), 617–634. https://doi.org/10.1037/met0000167
Epskamp, S., Cramer, A. O., Waldorp, L. J., Schmittmann, V. D., & Borsboom, D. (2012). qgraph: Network Visualizations of Relationships in Psychometric Data. Journal of Statistical Software, 48(4), 1-18. https://doi.org/10.18637/jss.v048.i04.
Faul, F., Erdfelder, E., Buchner, A., & Lang, A.-G. (2009). Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses. Behavior Research Methods, 41, 1149-1160.
Foygel, R., & Drton, M. (2010). Extended Bayesian information criteria for Gaussian graphical models. Arxiv. https://doi.org/10.48550/arXiv.1011.6640
Frankenhuis, W. E., Young, E. S., & Ellis, B. J. (2020). The Hidden Talents Approach: Theoretical and Methodological Challenges. Trends in cognitive sciences, 24(7), 569–581. https://doi.org/10.1016/j.tics.2020.03.007
Friedman, J. H., Hastie, T., & Tibshirani, R. (2008). Sparse inverse covariance estimation with the graphical lasso. Biostatistics, 9 (3), 432-441. https://doi.org/10.1093/biostatistics/kxm045
Friedman, N. P., & Miyake, A. (2017). Unity and diversity of executive functions: Individual differences as a window on cognitive structure. Cortex; a journal devoted to the study of the nervous system and behavior, 86, 186–204. https://doi.org/10.1016/j.cortex.2016.04.023
Fuhs, M. W., Hornburg, C. B., & McNeil, N. M. (2016). Specific early number skills mediate the association between executive functioning skills and mathematics achievement. Developmental psychology, 52(8), 1217–1235. https://doi.org/10.1037/dev0000145
Hackman, D. A., Farah, M. J., & Meaney, M. J. (2010). Socioeconomic status and the brain: mechanistic insights from human and animal research. Nature reviews neuroscience, 11(9), 651-659.
Hall, J., Sylva, K., Sammons, P., Melhuish, E., Siraj-Blatchford, I., & Taggart, B. (2012): Can preschool protect young children’s cognitive and social development? Variation by center quality and duration of attendance, School Effectiveness and School Improvement: An International Journal of Research, Policy and Practice, DOI:10.1080/09243453.2012.749793 To link to this article: http://dx.doi.org/10.1080/09243453.2012.749793
Hanner, E., Braham, E. J., Elliott, L., & Libertus, M. E. (2019). Promoting math talk in adult–child interactions through grocery store signs. Mind, Brain, and Education, 13(2), 110-118.
Howard, S. J., Siraj, I., Melhuish, E. C., Kingston, D., Neilsen-Hewett, C., De Rosnay, M., ... & Luu, B. (2018). Measuring interactional quality in pre-school settings: introduction and validation of the Sustained Shared Thinking and Emotional Wellbeing (SSTEW) scale. Early Child Development and Care.
Howard, S. J., Cook, C. J., Everts, L., Melhuish, E., Scerif, G., Norris, S., Twine, R., Kahn, K., & Draper, C. E. (2020). Challenging socioeconomic status: A cross-cultural comparison of early executive function. Developmental science, 23(1), e12854. https://doi.org/10.1111/desc.12854
Howard, S. J., Vasseleu, E., Batterham, M., & Neilsen-Hewett, C. (2020). Everyday Practices and Activities to Improve Pre-school Self-Regulation: Cluster RCT Evaluation of the PRSIST Program. Frontiers in psychology, 11, 137. https://doi.org/10.3389/fpsyg.2020.00137
Howard, S.J , Neilsen-Hewett, C., de Rosnay, M., Melhuish, E.C., & Buckley-Walker, K, (2021). Validity, reliability and viability of pre-school educators’ use of early years toolbox early numeracy. Australasian Journal of Early Childhood, 47(2), 92–106.
Howard, S.J., & Melhuish, E.C. (2017). An Early Years Toolbox (EYT) for assessing early executive function, language, self-regulation, and social development: Validity, reliability, and preliminary norms. Journal of Psychoeducational Assessment, 35(3), 255-275.
Hughes, C., Daly, I., Foley, S., White, N. and Devine, R. T. (2015). Measuring the foundations of school readiness: Introducing a new questionnaire for teachers – The Brief Early Skills and Support Index (BESSI). British Journal of Educational Psychology, 85, 332-356.
James-Brabham, E., Loveridge, T., Sella, F., Wakeling, P., Carroll, D. J., & Blakey, E. (2023). How do socioeconomic attainment gaps in early mathematical ability arise?. Child development, 10.1111/cdev.13947. Advance online publication. https://doi.org/10.1111/cdev.13947
Johnson, M. H. (2011). Interactive specialization: a domain-general framework for human functional brain development?. Developmental cognitive neuroscience, 1(1), 7-21.
Jones P (2022). networktools: Tools for Identifying Important Nodes in Networks. R package version 1.5.0, https://CRAN.R-project.org/package=networktools.
Joswick, C., Clements, D. H., Sarama, J., Banse, H. W., & Day-Hess, C. A. (2019). Double impact: Mathematics and executive function. Teaching Children Mathematics, 25(7), 416-426.
Karr, J. E., Rodriguez, J. E., Goh, P. K., Martel, M. M. & Rast, P. The unity and diversity of executive functions: A network approach to life span development. Dev. Psychol. 58, 751–767 (2022).
Karr, J.E., Rodriguez, J.E., Rast, P. et al. A Network Analysis of Executive Functions in Children and Adolescents With and Without Attention-Deficit/Hyperactivity Disorder. Child Psychiatry Hum Dev (2023). https://doi.org/10.1007/s10578-023-01518-9
Kassai, R., Futo, J., Demetrovics, Z., & Takacs, Z. K. (2019). A meta-analysis of the experimental evidence on the near- and far-transfer effects among children's executive function skills. Psychological bulletin, 145(2), 165–188. https://doi.org/10.1037/bul0000180
Kievit, R. A., Hofman, A. D., & Nation, K. (2019). Mutualistic coupling between vocabulary and reasoning in young children: A replication and extension of the study by Kievit et al.(2017). Psychological science, 30(8), 1245-1252.
Kroesbergen, E. H., van’t Noordende, J. E., & Kolkman, M. E. (2014). Training working memory in kindergarten children: Effects on working memory and early numeracy. Child Neuropsychology, 20(1), 23–37. https://doi.org/10.1080/09297049.2012.736483
Ku, S., & Blair, C. (2023). Profiles of early family environments and the growth of executive function: Maternal sensitivity as a protective factor. Development and Psychopathology, 35(1), 314-331.
McClelland, M. M., Tominey, S. L., Schmitt, S. A., Hatfield, B. E., Purpura, D. J., Gonzales, C. R., & Tracy, A. N. (2019). Red Light, Purple Light! Results of an Intervention to Promote School Readiness for Children From Low-Income Backgrounds. Frontiers in psychology, 10, 2365. https://doi.org/10.3389/fpsyg.2019.02365
Melby-Lervåg, M., Redick, T. S., & Hulme, C. (2016). Working Memory Training Does Not Improve Performance on Measures of Intelligence or Other Measures of "Far Transfer": Evidence From a Meta-Analytic Review. Perspectives on psychological science : a journal of the Association for Psychological Science, 11(4), 512–534. https://doi.org/10.1177/1745691616635612
Menu, I., Rezende, G., Le Stanc, L., Borst, G., & Cachia, A. (2022). A network analysis of executive functions before and after computerized cognitive training in children and adolescents. Scientific reports, 12(1), 14660. https://doi.org/10.1038/s41598-022-17695-x
Miller-Cotto, D., & Byrnes, J. P. (2020). What’s the best way to characterize the relationship between working memory and achievement?: An initial examination of competing theories. Journal of Educational Psychology, 112(5), 1074.
Miller‐Cotto, D., Smith, L. V., Wang, A. H., & Ribner, A. D. (2022). Changing the conversation: A culturally responsive perspective on executive functions, minoritized children and their families. Infant and Child Development, 31(1), e2286.
Milosavljevic, B., Cook, C. J., Fadera, T., Ghillia, G., Howard, S. J., Makaula, H., Mbye, E., McCann, S., Merkley, R., Mshudulu, M., Saidykhan, M., Touray, E., Tshetu, N., Elwell, C., Moore, S. E., Scerif, G., Draper, C. E., & Lloyd-Fox, S. (2023). Executive functioning skills and their environmental predictors among pre-school aged children in South Africa and The Gambia. Developmental science, e13407. Advance online publication. https://doi.org/10.1111/desc.13407
Miyake, A., Friedman, N. P., Emerson, M. J., Witzki, A. H., Howerter, A., & Wager, T. D. (2000). The unity and diversity of executive functions and their contributions to complex “frontal lobe” tasks: A latent variable analysis. Cognitive psychology, 41(1), 49-100.
Moss, J., Bruce, C.D., Caswell, B., Flynn, T. & Hawes, Z. (2016). Taking shape: Classroom activities to improve young children’s geometric and spatial thinking. Toronto, ON: Pearson. https://wordpress.oise.utoronto.ca/robertson/
Mulder, H., Verhagen, J., Van der Ven, S. H., Slot, P. L., & Leseman, P. P. (2017). Early executive function at age two predicts emergent mathematics and literacy at age five. Frontiers in psychology, 8, 1706.
Nosworthy, N., Bugden, S., Archibald, L., Evans, B., & Ansari, D. (2013). A two-minute paper-and-pencil test of symbolic and nonsymbolic numerical magnitude processing explains variability in primary school children’s arithmetic competence. PLoS ONE 8(7): e67918. https://doi.org/10.1371/journal.pone.0067918
O’Connor, R., Gattas, S., Blakey, E., Hawes, Z., Howard S.J., Merkley, R., Simms, V., [et al.] & Scerif G. (in preparation). The ONE Programme Co-development: Working with educators to produce an intervention that is feasible as well as effective.
Peng, P., & Swanson, H. L. (2022). The domain-specific approach of working memory training. Developmental Review, 65, 101035.
Prager, E. O., Ernst, J. R., Mazzocco, M. M. M., & Carlson, S. M. (2023). Executive function and mathematics in preschool children: Training and transfer effects. Journal of experimental child psychology, 232, 105663. https://doi.org/10.1016/j.jecp.2023.105663
Pržulj, N., & Malod-Dognin, N. (2016). NETWORK ANALYSIS. Network analytics in the age of big data. Science (New York, N.Y.), 353(6295), 123–124. https://doi.org/10.1126/science.aah3449
R Core Team (2022). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.
Ramani, G. B., & Scalise, N. R. (2020). It’s more than just fun and games: Play-based mathematics activities for Head Start families. Early Childhood Research Quarterly, 50, 78-89.
Raver, C. C., Blair, C., & Willoughby, M. (2013). Poverty as a predictor of 4-year-olds' executive function: new perspectives on models of differential susceptibility. Developmental psychology, 49(2), 292.
Ribner, A. D. (2020). Executive function facilitates learning from math instruction in kindergarten: Evidence from the ECLS-K. Learning and Instruction, 65, 101251
Ribner, A., Moeller, K., Willougby, M., Blair, C., & Family Life Project Key Investigators (2018). Cognitive Abilities and Mathematical Competencies at School Entry. Mind, brain and education : the official journal of the International Mind, Brain, and Education Society, 12(4), 175–185. https://doi.org/10.1111/mbe.12160
Roberts, E., Griggs, J., & Robb, S. (2017). Study of Early Education and Development: Experiences of the Early Years Pupil Premium. Research Report, UK Department for Education. Retrieved on: https://assets.publishing.service.gov.uk/media/5a80bc13e5274a2e8ab51dc5/SEED-Experiences_of_the_Early_Years_Pupil_Premium_-_RR645.pdf
Sala, G., & Gobet, F. (2017). Does Far Transfer Exist? Negative Evidence From Chess, Music, and Working Memory Training. Current directions in psychological science, 26(6), 515–520. https://doi.org/10.1177/0963721417712760
Scerif, G., Blakey, E., Gattas, S., Hawes, Z., Howard, S., Merkley, R., ... & Simms, V. (2023a). Making the Executive ‘Function’ for the Foundations of Mathematics: the Need for Explicit Theories of Change for Early Interventions. Educational Psychology Review, 35(4), 110.
Scerif, G., Gattas, S., Hawes, Z., Howard, S., Merkley, R., & O'Connor, R. (2023b, March 7). Orchestrating Numeracy and The Executive: The One Programme. End of Grant Report (Nuffield Foundation). https://doi.org/10.31234/osf.io/2gxzv
Schmitt S. A., McClelland M. M., Tominey S. L., Acock A. C. (2015). Strengthening school readiness for head start children: evaluation of a self-regulation intervention. Early Child. Res. Q. 30 20–31. 10.1016/j.ecresq.2014.08.001.
Schmitt, S. A., Geldhof, G. J., Purpura, D. J., Duncan, R., & McClelland, M. M. (2017). Examining the relations between executive function, math, and literacy during the transition to kindergarten: a multi-analytic approach. Journal of Educational Psychology, 109(8), 1120.
Siraj, I., Kingston, D., & Melhuish, E. (2015). Assessing quality in early childhood education and care. Sustained shared thinking and emotional wellbeing (SSTEW) Scale for 2–5 year olds provision. London: UCL and IOE Press.
Snijders, Tom A.B., and Bosker, Roel J. (2012). Multilevel Analysis: An Introduction to Basic and Advanced Multilevel Modeling, second edition. London etc.: Sage Publishers.
Spiegel, J. A., Goodrich, J. M., Morris, B. M., Osborne, C. M., & Lonigan, C. J. (2021). Relations between executive functions and academic outcomes in elementary school children: A meta-analysis. Psychological bulletin, 147(4), 329–351. https://doi.org/10.1037/bul0000322
Welsh, J. A., Nix, R. L., Blair, C., Bierman, K. L., & Nelson, K. E. (2010). The Development of Cognitive Skills and Gains in Academic School Readiness for Children from Low-Income Families. Journal of educational psychology, 102(1), 43–53. https://doi.org/10.1037/a0016738
Wiebe, S.A., Sheffield, T., Nelson, J.M., Clark, C.A., Chevalier,N., & Espy, K.A. (2011). The structure of executive function in 3-year-olds. Journal of Experimental Child Psychology,108, 436–452, https://doi.org/10.1016/j.jecp.2010.08.008
Wilkey, E. D., & Price, G. R. (2019). Attention to number: The convergence of numerical magnitude processing, attention, and mathematics in the inferior frontal gyrus. Human brain mapping, 40(3), 928–943. https://doi.org/10.1002/hbm.24422
Willoughby, M. T., Piper, B., King, K. M., Nduku, T., Henny, C., & Zimmermann, S. (2021). Testing the Efficacy of the Red-Light Purple-Light Games in Preprimary Classrooms in Kenya. Frontiers in psychology, 12, 633049. https://doi.org/10.3389/fpsyg.2021.633049
Younger JW, O’Laughlin KD, Anguera JA, Bunge SA, Ferrer EE, Hoeft F, McCandliss BD, Mishra J, Rosenberg-Lee M, Gazzaley A and Uncapher MR (2023) Better together: novel methods for measuring and modelling development of executive function diversity while accounting for unity. Front. Hum. Neurosci. 17:1195013. doi: 10.3389/fnhum.2023.1195013.

(Not answered)

TheONEEfficacySupplementaryOnlineMaterials26.05.24.docx
floatimage1.png
CONSORT diagram showing the flow of participants through each stage of this randomized trial.

Download PDF

Review #1 received at journal
05 Aug, 2024
Reviewer #2 agreed at journal
19 Jul, 2024
Reviewer #1 agreed at journal
19 Jul, 2024
Reviewers invited by journal
06 Jun, 2024
Submission checks completed at journal
28 May, 2024
Editor assigned by journal
27 May, 2024
First submitted to journal
27 May, 2024

You are reading this latest preprint version

Enhancing Children’s Numeracy and Executive Function Skills via Explicit Integration: A Randomized Controlled Trial

Status:

Version 1

Abstract

Figures

Introduction

Methods

Participants.

Results

Intervention Efficacy

Intervention Mechanisms: Network Analyses.

Discussion

Declarations

Data Availability

Acknowledgments

Author Contributions

Conflict of Interest

References

Additional Declarations

Supplementary Files

Status:

Version 1