1. Study design and population
Data for this study were obtained from the Kailuan cohort, a prospective cohort located in northern China. In 2006, employees of the Kailuan Group, including retired employees, were invited to participate, and a total of 101,510 employees agreed to take part and provided signed consent forms. The participants underwent questionnaire interviews, health examinations, and laboratory assessments at Kailuan Hospital. Subsequent medical examinations and questionnaires were administered every 2 years. Participants who underwent fewer than three medical examinations between 2006 and 2010 were excluded. All personal information of the participants included in the final analysis was kept confidential. The study protocol adhered to the guidelines of the Declaration of Helsinki and was approved by the Ethics Committee of Kailuan Medical Group, Kailuan Company.
2. Variable evaluation
All participants underwent fasting blood tests, including measurements of CRP, fasting blood glucose, and lipids. Participants with missing blood test results for any of the three medical examinations were excluded. To ensure stability and scientific rigor, participants with CRP levels >15 mg/L were also excluded. Blood measurements were performed using automated analyzers following standard operating procedures. Height and weight were measured by experienced nurses, and the body mass index (BMI) was calculated as weight (in kilograms) divided by height (in meters) squared.
The demographic information included sex, age, educational attainment, and income. Educational attainment above high school level was classified as “high,” and anything below that was considered “low.” Family monthly income exceeding 1000 yuan per person was classified as “high income,” whereas incomes below this threshold were considered “low income.” Lifestyle data, including smoking, alcohol consumption, and physical exercise, were obtained from the patients’ responses to the lifestyle questionnaire.
The primary outcome of this study was cancer incidence, and the secondary outcome was mortality. A prospective follow-up design was employed, tracking all participants from the baseline assessment conducted in 2010-2011 until the occurrence of cancer or death. Death data were obtained from the death certificates provided by the National Vital Statistics Bureau. The follow-up period spanned from the baseline assessment until 2021, during which all outcome events were carefully recorded.
3. Calculation of CRP variability
CRP was measured during three distinct periods: 2006-2007, 2008-2009, and 2010-2011. To assess the CRP variability, we used the CV, a widely accepted measure of variability. The CV was calculated by dividing the mean by the standard deviation (SD).
To strengthen the robustness and reliability of our findings, we also included the variability independent of the mean (VIM) as a supplementary measure. The VIM provides additional insights into the variability of CRP independent of its mean value. The VIM is a statistical transformation of the SD that is specifically designed to be uncorrelated with the mean levels. It was calculated by fitting a curve of the form y = kxp to a scatter plot of the SD CRP (y-axis) against the mean CRP (x-axis) for all individuals in the cohort. The parameter p is estimated from the data, and k is a constant that can be chosen to ensure that the VIM is on the same scale as the SD. Specifically, if M represents the average value of the mean CRP in the cohort, then k is determined as Mp, and the VIM CRP value for any individual is calculated as VIM CRP = (k SD/xp). This statistical approach allows the assessment of CRP variability independent of its mean, providing additional insights into the relationship between CRP fluctuations and cancer occurrence. The CV and VIM were categorized into three groups based on tertiles: low, moderate, and high.
VIM = k × SD(CRP)/Mean(CRP)x
SD(CRP) = constant × Mean(CRP)
k = Mean (Mean(CRP))x
4. Statistical analysis
Normally distributed continuous variables were summarized using the mean and SD and analyzed using t-tests. Non-normally distributed continuous variables were analyzed using the Wilcoxon rank-sum test. Categorical variables were analyzed using the chi-squared test. This approach ensured the appropriate analysis of different variable types.
We employed Cox regression analysis to investigate the relationship between CRP variability and the risk of cancer. Model A was unadjusted, whereas Model B was adjusted for sex, age, education, and income. In addition to the adjustments made in Model B, Model C was further adjusted for BMI, smoking, alcohol consumption, and physical exercise. We created Restricted Cubic Spline (RCS) plots to visually depict the intricate relationship between continuous CV of CRP and the risk of cancer. Furthermore, we fitted the model using the "TRAJ" program in SAS version 9.4 (SAS Institute) to group individuals with similar patterns of CRP change between 2006 and 2010. We then delved into examining the association between CRP variability and cancer occurrence, specifically within subgroups characterized by distinct patterns of change in CRP trajectories. We counted the incidence of different site-specific cancers during the follow-up period and explored the association between CV of CRP and site-specific cancers. We conducted stratified analyses to examine whether this association persists across populations with different characteristics. Several sensitivity analyses were conducted to ensure the robustness of the results and eliminate confounding factors. Owing to the potential lag in cancer incidence, participants who developed cancer within the initial 2-year follow-up period were excluded from the study. To address the impact of extreme values in CRP measurements on the results, participants with CRP levels exceeding 10 mg/L were also excluded. Acknowledging the potential impact of metabolic factors, including blood glucose and lipid levels, on cancer incidence, we included additional adjustments in our analysis. Furthermore, to emphasize that the identified association was specifically attributed to CRP variability during the follow-up period, we controlled for baseline CRP levels. Additionally, our analyses considered aspirin use as a relevant factor.
VIM as a complementary indicator of variability, we additionally used COX regression analyses and RCS plots to explore the association between VIM of CRP and cancer incidence. In addition, we used Cox regression analysis to explore the association between the CV and death, as well as cancer-specific mortality. The competing risk plots illustrated the relationships between different levels of CRP variability and the occurrence of cancer and death.
All statistical tests were two-sided and were considered significant at P < 0.05. Statistical analysis was performed using SAS (version 9.4) and R (version 4.2.3).