Participants
The study recruited 1,104 sixth-grade students from nine elementary schools in middle- to low-income areas across five provinces in Korea. These students had been studying English as a regular subject for more than three years. However, 196 students were excluded because of absences, missing test values, or failure to submit questionnaires, resulting in a final sample of 908 participants. The data were collected over three years, with measurements taken at similar times each year to ensure consistency. All the participants attended public schools following the Korean National English Curriculum, with a shared achievement goal. The data were collected at the start of the second semester in 2021 (N = 598), 2022 (N = 64), and 2023 (N = 246), reflecting the typical English proficiency and learning experiences of Korean sixth-grade students. The schools were selected to represent the characteristics of sixth graders across different regions, with each school having approximately 20 students per class and more than 80 sixth-grade students, indicating that they were large-scale schools.
Procedure
Tests were prepared and administered by Korean and native-speaker teachers (in 3 schools by both Korean and native-speaker teachers and in 6 schools by Korean teachers only). Extensive pilot work was conducted with researchers in each region to ensure the appropriateness of the tests, particularly for the phonological awareness tests formulated for this study.
To administer the TOWRE Nonword and word Reading Test, the examiner ensured a quiet environment free from distractions and prepared a stopwatch and scoring sheet. The examiner handed the list of nonwords and words to the students, instructing them to read as many nonwords and words as possible, as quickly and accurately as possible, within 45 seconds. The examiner started the stopwatch when the student began reading, marking each nonword and word read correctly on the scoring sheet. If the student skipped a word or remained silent for more than 3 seconds, the examiner marked the word incorrectly and prompted them to move to the next one, continuing until 45 seconds elapsed.
Standardized instructions were provided to ensure consistency across testers. Before beginning each task, the students received verbal and visual instructions and three examples of the task. The first reading test was administered two weeks after the students began the first grade, and the second test was administered two weeks later. The average of the two test scores was used. Surveys were conducted at each school to investigate the students' learning backgrounds. Written consent was obtained from the children's parents and teachers at the start of the study.
Measurement Variables
The study evaluated students' second language (L2) reading-related abilities, specifically decoding skills, via nonword and word reading tests. These tests aimed to determine reliance on phoneme‒grapheme correspondence (phonological route) and word recognition (lexical route).
Nonword Reading Test: Administered using two forms (A and D) of the Test of Word Reading Efficiency (TOWRE-2). The participants had 45 seconds to read as many pronounceable nonwords as possible. The number of correctly read nonwords was used to measure phonological decoding efficiency. The average score from both forms provided insights into participants' phonological decoding abilities.
Sight Word Efficiency (SWE) Test: In addition, from TOWRE-2, this test involved reading a list of high-frequency words within 45 seconds. The participants' scores were based on the number of correctly read words, indicating their sight word vocabulary size and ability to recognize words quickly. Two forms (A and D) were used, and the test was reliable, with Cronbach’s alpha values of 0.925 for nonword reading and 0.938 for word reading. The students were categorized into three groups on the basis of the Z scores derived from the tests. Table 2 provides details on the classification of reading ability groups by standard scores.
Table 2
Classification of Reading Ability Groups by Standard Scores
Group Classification | General Reading (GR) | Reading Difficulty (RD) | Severe Reading Difficulty (SRD) |
Z score range | Z score > -1.0 | -1.0 ≥ Z score ≥ -1.5 | Z score <-1.5 |
Analysis methods
The analysis was conducted in two main stages to address the research questions. First, profile analysis techniques, ANOVA, and chi-square tests were used to examine the common characteristics of the learners' educational backgrounds. This initial step involved developing several latent profile models. To ensure accurate results, skewed variables were normalized via transformation methods in SPSS (version 24.0), and all continuous variables were standardized. The students were then grouped into different latent profiles on the basis of their reading learning backgrounds. These classifications were saved and analyzed concerning various covariates. Multinomial logistic regression analysis was employed for the second research question, which explored the relationship between the identified profiles and English reading proficiency. This analysis assessed how the different profiles related to students' English reading ability.
Latent profile analysis (LPA) was used to classify students into latent subgroups on the basis of their learning backgrounds. This technique, which is suitable for heterogeneous populations, estimates the probability of individuals belonging to latent subgroups and provides precise classification indices. Model fit was evaluated via several criteria:
The information criteria used were the Akaike information criterion (AIC), Bayesian information criterion (BIC), and sample size-adjusted BIC (aBIC), with lower values indicating a better fit.
Entropy: Values close to one indicate good classification accuracy.
Statistical Testing: Parametric bootstrap likelihood ratio difference (BLR) and Lo‒Mendell‒Rubin adjusted (LMR) tests, with significance (p < .05) favoring models with more profiles.
LPA was conducted via Mplus 8.7, which handles missing data with full information maximum likelihood (FIML). Models with different numbers of profiles were compared on the basis of their absolute and relative fits. Chi-square tests and ANOVA were used to examine differences in individual, home, regional, and school characteristics among the derived profiles.
To address the second research question, multinomial logistic regression was used to analyze the impact of six independent variables on students' reading profiles:
Home English environment: Measured through items on English and Korean books owned or borrowed.
Parental Support: Evaluated on the basis of parental involvement in English education.
Preschool English Experience: Assessed through items about English learning in kindergarten.
Postschool English Experience**: Includes items on English learning outside of school.
Interest in English: Rated through one item on interest in English.
Necessity of English: Rated through one item on the perceived necessity of English.
All the variables were rated on a 4-point Likert scale, where higher scores indicated better conditions or higher levels. The regression model aimed to predict students' reading profiles on the basis of these learning background factors.