Study design and participants
Details on the study design, sampling methods, response rates and some results of the CHNS have been published elsewhere 9-13. Briefly, CHNS is an ongoing multipurpose longitudinal cohort study initiated in 1989, and has been followed up every 2-4 years. A multistage, random cluster approach was used to draw the sample from 9 provinces (from north to south) and 3 largest autonomous cities in mainland China. The CHNS rounds have been completed in 1989, 1991, 1993, 1997, 2000, 2004, 2006, 2009, 2011 and 2015. In each survey round, demographic, socioeconomic, lifestyle, nutritional and health information were collected.
In this analysis, we conducted a prospective cohort study using seven waves (1997 to 2015) of the CHNS data. In this cohort, members were surveyed in at least two study rounds, and the first survey round is considered as baseline. We excluded participants who were pregnant, <18 years old, or having diabetes at baseline. Participants with missing dietary carbohydrate data or with extreme dietary energy data (male: >4200 or <600 kcal/day; female: >3600 or <500 kcal/day) were further excluded 14. Finally, 16,260 participants were included in this analysis (eFigure 1).
The institutional review boards of the University of North Carolina at Chapel Hill and the National Institute of Nutrition and Food Safety, and Chinese Center for Disease Control and Prevention, approved the study. Each participant provided their written informed consent.
Dietary nutrient intakes
Dietary measurements in CHNS are described in detail elsewhere 15. Briefly, both individual and household level data were collected in each survey round. Dietary information was collected by 3-day dietary recalls through questionnaires, in combination with using a 3-day food-weighed method to assess cooking oil and condiment consumption. The three consecutive days were randomly allocated from Monday to Sunday and are almost equally balanced across the seven days of the week for each sampling unit. Nutrient intakes were calculated using the China food composition tables (FCTs). The accuracy of 24-hour dietary recall designed to assess energy and nutrient intake has been validated 15.
In this study, three-day average intakes of dietary macronutrients and micronutrients in each round were calculated. We evaluated energy-adjusted nutrient intake for each nutrient using sex specific linear regression models 16. Cumulative average intake values of each nutrient were calculated for each participant, using all results up to the last visit prior to the date of new-onset diabetes, or using all results among those without new-onset diabetes, to reduce within-subject variation and represent long-term dietary intake and minimize within-person variation.
Definition of the low-carbohydrate diet (LCD) scores
In our present study, macronutrients were divided into high- and low-quality carbohydrate, plant-based protein and fat, as well as animal-based protein and fat. Food sources constituting these subtypes are shown in the eTable 1 17,18.
Low-carbohydrate diet (LCD) scores was defined following an established method 19. First, the study participants were divided into 11 strata each of fat, protein, and carbohydrate intake, expressed as a percentage of energy (eTable 2). For fat and protein, participants in the highest strata received 10 points for that macronutrient and participants in the lowest strata received 0 points. For carbohydrate, the lowest strata received 10 points and the highest 0 points. The scores of three macronutrients were summed to create the overall LCD scores, which ranged from 0-30 (a higher score reflects less intake of carbohydrate and more intake of fat and protein, while a lower score reflects more intake of carbohydrate and less intake of fat and protein).
A total LCD score for low-quality carbohydrate was built based on percentage of energy intake from low-quality carbohydrate, total protein and total fat. Furthermore, we also built a plant-based LCD score for low-quality carbohydrate, based on percentage of energy intake from low-quality carbohydrate, plant protein and plant fat; and an animal-based LCD score for low-quality carbohydrate based on percent of energy from low-quality carbohydrate, animal protein and animal fat. As such, each participant was given the overall, plant-based and animal-based scores. (eTable 1, eTable 2). In detail, a higher total LCD score reflects less intake of low-quality carbohydrate and more intake of total fat and total protein, a higher plant-based LCD score reflects less intake of low-quality carbohydrate and more intake of plant fat and plant protein, and a higher animal-based LCD score reflects less intake of low-quality carbohydrate and more intake of animal fat and animal protein.
Assessments of blood pressure and covariates
After the participants had rested for 5 minutes, seated blood pressure was measured by trained research staff using a mercury manometer, following the standard method. Triplicate measurements on the same arm were taken in a quiet and bright room. The mean systolic blood pressure (SBP) and diastolic blood pressure (DBP) of the three independent measures were used in analysis.
Demographic and lifestyle information was obtained through questionnaires, including age, sex, smoking status, alcohol consumption, occupation, education level, residence, regions, and concomitant diseases. Body height and weight were measured following a standard procedure with calibrated equipment. Body mass index (BMI) was calculated as weight (kg) by height squared (m2). Physical activity was collected by staff-administered questionnaires exploring all occupational, transportation, domestic and leisure activities in adults.
Study outcome
The participants had been asked to report their diabetes status with a questionnaire-based interview at each follow-up. New-onset diabetes were confirmed if the answer was “yes” to the question “has a doctor ever told you that you suffer from diabetes?”. In addition, blood samples were collected and assayed only in 2009. Therefore, in 2009, outcome was also ascertained by an additional criterion: fasting blood glucose ≥ 7.0 mmol/L or glycated hemoglobin [HbA1c] ≥ 6.5%) 20, 21.
When a participant was first identified with new-onset diabetes in a following survey, the middle date between this and the nearest survey before was used to calculate the follow-up time. For those free of diabetes in all following surveys, the last survey date was used to calculate the follow-up time.
Statistical Analysis
Intake of dietary carbohydrate were expressed as the percentage of total energy using nutrient density method 22 and then categories into quartile (<49, 49 -<56, 56 -<63, ≥63 of energy intake from carbohydrate intake). Population characteristics are presented as mean ± standard deviations (SDs) for continuous variables, and proportions for categorical variables. Differences in population characteristics by quintile of carbohydrate intake were compared using ANOVA tests, or chi-square tests, accordingly.
We used Cox proportional hazards models to calculate hazard ratios (HRs) and 95% confidence intervals (CIs) for the risk of new-onset diabetes. Model 1 included adjustments for age, sex, and body mass index (BMI). Model 2 included adjustments for age, sex, regions, BMI, smoking status, systolic blood pressure (SBP), diastolic blood pressure (DBP), education level, urban or rural residence, occupation, physical activity, as well as total energy intake, sodium intake, potassium intake and fiber intake. Furthermore, we used a cubic B-spline with 4 knots (20%, 40%, 60%, 80% of carbohydrate intake) to display the potentially non-linear relationship of total, high-quality and low-quality carbohydrate, and total, animal-based and plant-based LCD scores for low-quality carbohydrate with new-onset diabetes in a more intuitive way with adjustments for covariates in Model 2.
Moreover, possible modifications of the association between total, high-quality and low-quality carbohydrate intake and new-onset diabetes were evaluated for the following variables: age (<60 vs. ≥60 years), sex, BMI (<24 vs. ≥24 kg/m2), waist circumference (<80 vs. ≥80 cm), total protein intake ([median] <12 vs. ≥12 % of energy), and total fat intake ([median] <31 vs. ≥31 % of energy), and interactions between subgroups and carbohydrate intake were examined by likelihood ratio testing.
We consider a two-side P value<0.05 as statistically significant in all analysis. All statistical analyses were conducted using R version 3.6.1.