Development and validation the uterine fibroid symptom and quality of life short-form: based on classical test theory and modern test theory

doi:10.21203/rs.3.rs-3027197/v1

Download PDF

Research Article

Development and validation the uterine fibroid symptom and quality of life short-form: based on classical test theory and modern test theory

https://doi.org/10.21203/rs.3.rs-3027197/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Objectives

Previous research has shown that the self-consciousness and sexual function domain of the uterine fibroid symptom and quality of life (UFS-QoL) questionnaire exhibited poor adaptability and clinical applicability. In light of this, our objective was to develop a shorter version of the UFS-QoL questionnaire, while ensuring its reliability, validity, and clinical applicability, in order to enhance routine clinical assessments.

Methods

A secondary analysis was conducted on a prospective cohort study involving 20 Chinese hospitals and 2,411 Chinese women with fibroids. We employed both classical test theory (CTT) and item response theory (IRT) to select two items for each UFS-QoL subscale. In cases where inconsistencies arose from these two methods, we utilized responsiveness to identify items with a higher ability to detect change in clinical therapy. The short-form scale was evaluated for dimensionality, internal consistency reliability, and criterion and known-group validity through factor analysis, Cronbach's alpha, Spearman correlations, and independent samples t-tests. To assess the ability to detect change, we compared pre-treatment and post-treatment (6 and 12 months) scores using effect size and standardized response mean.

Results

Considering the poor adaptability and clinical applicability, we made the decision to remove the self-consciousness subscale and retain only one item from the sexual functioning subscale. As a result, we administered and validated the 11-item short-form version of UFS-QoL (UFS-QoL-11). UFS-QoL-11 exhibited a strong correlation with UFS-QoL in each subscale, with correlation coefficients exceeding 0.7 (all P < 0.001). Exploratory factor analysis identified three subscales (symptom severity, concern, and daily life interference) with eigenvalues > 1 for UFS-QoL-11. The test items accounted for 64.46% of the total variance, slightly higher than the parent scale score of 63.60%. UFS-QoL-11 demonstrated excellent internal consistency and reliability across all subscales (> 0.7), representing a slight improvement compared to the full UFS-QoL. Furthermore, UFS-QoL-11 exhibited sensitivity in detecting varying levels of current health status, particularly at 6 and 12 months post-surgery. The responsiveness of UFS-QoL-11 was comparable to that of UFS-QoL, with all effect sizes > 0.5, demonstrating a strong ability to detect changes.

Conclusion

UFS-QoL-11 performs as well as the original version, while reducing the administrative burden by 70%. We firmly believe that the implementation of the newly developed UFS-QoL-11 has the potential to facilitate widespread, accurate, timely, and sensitive monitoring in both clinical and research settings.

uterine fibroid symptom and quality of life

UFS

QoL

uterine fibroids

health-related quality of life

Chinese women

fibroids

UFS-QoL-11 significantly reduces the administrative burden by 70%.
The short form yielded nearly identical results compared to the 37-item full scale.
Implementing the newly developed UFS-QoL-11 has the potential to enhance widespread, accurate, timely, and sensitive monitoring in clinical and research settings.

The lifetime risk of developing uterine fibroids (UFs) in women aged > 45 years is up to 60%.[1] Patients with UFs experience a high symptom burden, including moderate to severe abdominal pain,[2] low back pain, urinary frequency and urgency, pain during intercourse,[2] and vaginal bleeding.[3] Given the high prevalence and symptom burden among women, UFs impose a significant healthcare burden on women's individual health[4] as well as on the healthcare and social security systems as a result of work productivity loss during treatment and disease recurrence.[5] Consequently, measuring UF-related symptoms and QoL status in a valid and reliable manner could support high-quality practice and comprehensive patient management.

As the only questionnaire designed to assess the whole spectrum of fibroid-related symptoms and their impact on QoL, the English version of the uterine fibroid symptom and quality of life (UFS-QoL) questionnaire was published in 2002.[6] The UFS-QoL questionnaire consists of 37 items, eight of which assess the severity of symptoms (single domain) and 29 assess health-related quality of life (HRQL) in six subscales (concern, activities, energy/mood, control, self-consciousness, and sexual function). All the responses were classified into five Likert-scale options. The UFS-QoL questionnaire has been used and validated in Brazilian Portuguese,[7, 8] Spanish,[9] traditional Chinese,[10] and simplified Chinese[11, 12] as a disease-specific measure of health-related QoL.

For an instrument to be clinically useful, an obstacle to its routine use is its length. Although, the 37 items of the scale assess overall patient information to evaluate their symptoms and quality of life, the length might deter its clinical efficacy, as it is time-consuming and difficult to use efficiently in daily high-frequency clinical care, especially in primary hospitals with low resource settings. Shorter questionnaires that improve response rates and reduce participant burden are recommended as follow-up measures.[13, 14] Our group evaluated the adaptability and clinical applicability of the Chinese UFS-QoL.[12] We found that the symptom severity, activity, and mood subscales of the Chinese UFS-QoL scale were valid and reliable. However, the self-consciousness domain exhibited poor internal consistency and detectability, we pointed out before applying the UFS-QoL to Chinese patients, the cross-cultural test criteria of adjusting the subscales to the Chinese culture, shortening or deleting the poor discrimination items to ensure the scale validity, and further optimizing and reevaluating the practicability of these items, should be considered. Therefore, there are both practical application challenges and psychometric evidence to pursue the construction of a shorter version of the UFS-QoL while maintaining its reliability, validity, and clinical applicability and enhancing routine clinical assessments.

In this study, we employed different psychometric techniques from both classical test theory (CTT) and modern test theory (item response theory [IRT])[15] to develop a short-form version of the UFS-QoL (UFS-QoL-SF). While CTT identifies the basic properties of the data, such as eigenvalues and factor loadings, IRT provides more detailed information at the item level. These techniques complement each other when developing short scales.[15, 16] Using the UFS-QoL-SF, we designed it to reduce the respondent burden and be short enough for use in clinical practice and research. We predicted that the UFS-QoL-SF would have similar psychometric properties and reproduce scores that would be useful in assessing the efficacy of therapies without the loss of information compared with the long version of the UFS-QoL.

To develop the short-form scale, we extracted data from a 20-centered prospective cohort study (Uterine fibroids multicenter network information system: www.hifuctr.com) for a second analysis, which included patients who underwent self-selected hysterectomy, myomectomy, or high-intensity focused ultrasound (HIFU) therapy after being fully informed of the treatment options (the multicenter study was approved by a China-registered clinical trial ethics committee (ChiECRCT-2011034).[17] Details regarding the study design, data collection, and primary outcomes regarding the efficacy of the treatment have been published.[17] Prior to undergoing any study-related procedures at the clinical site, the patients completed the UFS-QoL questionnaire, the study short-form-36 (SF-36), and a brief sociodemographic questionnaire. Follow-up visits were scheduled at 6 and 12 months post-surgery, including complications, magnetic resonance imaging evaluation, overall treatment effect evaluation, the UFS-QoL questionnaire (for those who had undergone HIFU or myomectomy, because the instructions of the UFS-QoL questionnaire were based on the presence of uterine fibroids and menstrual periods), SF-36, and several health care utilization items were recorded.

Patient-reported Outcome Measures

Uterine ﬁbroid symptom and quality of life questionnaire (UFS-QoL)

The UFS-QoL was developed from focus groups of women with uterine ﬁbroids.[6, 11] The UFS-QoL questionnaire consists of 37 items, 8 of which assess the severity of symptoms (single domain) and 29 of which assess HRQL with six subscales (concern, activities, energy/mood, control, self-consciousness, and sexual function). All the responses were classified into five Likert-scale options. A higher score on the questionnaire's severity subscale indicates more severe symptoms, whereas a lower score on the HRQL subscales indicates poorer QoL.

Medical outcomes of the study short-form 36 (SF-36)

The Chinese SF-36 is a 36-item self-administered generic measure used to assess general health status. and validated its cross-cultural application, reality, and validity.[18-21] SF-36 consists of eight subscales: physical, functioning, role-physical, bodily pain, general health, vitality, social functioning, role-emotional, and mental health, as well as two composite scores: physical and mental. Individual subscale items were combined to form a subscale rating, which was then converted to a 0–100 scale.[22, 23] Higher QoL scores correspond to a four-week recall period.[23]

Item selection

We selected two items per UFS-QoL scale using a method that maximizes content validity.[13, 24] Due to the poor internal consistency and ability to detect self-consciousness in our previous study,[12] we did not retain the three items of self-consciousness. At the same time, because sexual functioning showed poor adaptability and high correlation between the two items, we only kept one item of the two items. We used the CTT and IRT to select two items for each of the five subscales. For the inconsistent items extracted from the two methods, we used responsiveness to choose items with higher ability to detect changes in clinical therapy. Therefore, we administered the 11-item version (UFS-QoL-11) to test its factor structure, criterion-related validity, and responsiveness, following the recommendations for short-form scale development. A flow chart of item selection is shown in Figure 1.

Based on the CTT to choose shortened items, we followed a method using regression analysis that maximizes the content validity of a 2-item scale. The first item had the highest correlation with the parent scale. The second item selected was the one with the highest beta weight of the remaining items in multiple regression, with the parent scale score as the dependent variable and individual items as predictor variables.

Item response theory offers detailed information at the item level, making it a powerful technique for developing short-form scales.[15] The responses of participants to each item were used to estimate their location on a latent trait (i.e., the level of symptoms). This latent trait is estimated using the responses on all individual items, and not, as in the CTT, via the sum score of all items. Within the IRT, every single item is defined by a discrimination parameter (alpha) and one or more location parameters or threshold parameters. The threshold parameters indicate the location on the scale of the latent continuum where the item best discriminates among individuals. The discrimination parameter reflects the true difference in theta per item and is comparable to factor loading.

To determine the candidate items for the short-form of the scale, we selected the items with two of the highest discriminatory parameters, as given by the IRT analyses. As the UFS-QoL has five ordered response options (1 = “not at all,” 2 = “A little bit,” 3 = “Somewhat,” 4 = “A great deal,” 5 = “A very great deal”), we used the graded response model to estimate the item response parameters. In addition, we inspected the item information curves to select items that covered a similar range of latent traits as the full scale. Finally, we compared the effect sizes of inconsistent items to choose the final items.

Statistical analysis

Spearman's correlation coefficient was used to determine the strength of the UFS-QoL-11 correlation test with the parents’ UFS-QoL. To examine construct validity in the UFS-QoL-11, we employed principal axis factor analysis with orthogonal rotation, which was used in conjunction with orthogonal rotation to determine the final number of factors based on their eigenvalues, congruence, and clinical significance. Each dataset was first analyzed for normality using the Kaiser–Meyer–Olkin (KMO) measure and Bartlett test of sphericity. A KMO value > 0.5 indicated acceptable structural validity. Cronbach's α was used to determine the internal consistency of the UFS-QoL-11. Cronbach's α coefficient, which ranges from 0 to 1, was used to determine the degree to which items on the subscales measure were related to the same concepts. A larger value indicated a smaller measurement error, which indicated a higher level of reliability. Criterion validity was assessed using correlations between baseline scores on the UFS-QoL-11 and SF-36, which were designed to measure general health.

We examined the evidence of known-group validity so that the UFS-QoL-11 could distinguish between clinically distinct groups by testing its ability to differentiate between patients based on health status (defined as SF-36-1). The ability to detect change was evaluated by comparing the 6-month pre-treatment and post-treatment scores to the 12-month scores at 6-month intervals. The effect size (change in mean score divided by baseline standard deviation)[25] and standardized response mean (change in mean score divided by change standard deviation) were computed. A value of 0.2 was considered to have a “small” effect, 0.5 a “moderate” one, and ≥ 0.8 a “large” effect.

Descriptive analyses (means and standard deviations [SD]) were performed using sociodemographic and clinical characteristics. Means of differences, 95% confidence intervals (CI), and statistical significance (P < 0.05) were tested using independent sample t-tests. The questionnaires were scored according to developers' instructions. Version 9.1.3 of SAS was used to conduct the analyses. All statistical tests were predetermined, and no missing data imputations were performed. All statistical tests were conducted with a fixed type I error probability of 0.05 and a two-tailed design.[22]

According to the screening criteria, we chose items based on the CTT and IRT results. All assumptions of IRT were met (i.e., the scale is unidimensional, and there was no noteworthy local dependence and answer authenticity) (see Table S1-S2). The CTT and IRT parameters per item are shown in Table 1. The control and sexual function subscales showed consistent results between the two methods. Table 2 demonstrates a shortened scale with nearly identical responsiveness to UFS-QoL. By comparing the effect size and standardized response means (SRM) in other subscales, the symptom severity and activities subscale showed higher responsiveness to CTT items. However, Concern and Energy/Mood showed a higher ability to detect changes in IRT items. As shown in Table S3, all items had a distribution of <1% at every follow-up. In Figure 2, selected items showed high correlation with UFS-QoL in each subscale, with all correlation coefficient over 0.7 (all P<0.001).

In the factor analysis, values of the KMO test > 0.7, and the statistical significance of Bartlett's test indicated adequate sampling.[26, 27] We used orthogonal rotation to isolate the potential UFS-QoL-11 factors (Table 3). In this study, the value of KMO was 0.86, and the value of Bartlett's sphericity test was 2,536.26 (P < 0.001), indicating that factor analysis was suitable for the data. The UFS-QoL-11 retains three factors: symptom severity, concern, and daily life interference. Overall, the test variables explained 64.46% of the total variance, which was slightly higher than the parent scale score of 63.60%.[12]

Cronbach's alpha was calculated for each UFS-QoL-11 subscale to assess internal consistency. The UFS-QoL-11 questionnaire demonstrated good internal consistency and reliability for all the subscales (> 0.7). There was a small increase in internal consistency for the UFS-QoL compared with the full UFS-QoL (Table 4). The degree of correlation between similar subscales on the UFS-QoL-11 and SF-36 was used to assess the reliability of the correlation testing for the UFS-QoL-11 (Table S4). All domains of the SF-36 correlated positively and moderately with the “daily life interference” domain of the UFS-QoL-11 (r:0.35–0.51, P < 0.001).

The UFS-QoL-11 was sensitive enough to detect varying levels of current health status, particularly at 6 and 12 months post-surgery (Table S5). Patients with poor general health status (those who rated "In general, would you say your health" as fair or poor on the SF-36-1) had statistically significantly higher severity for all symptoms and poorer QoL than those with good general health status (those who rated "In general, would you say your health" as excellent, very good, or good) (all P <.0001), all effect sizes > 0.3 indicated a moderate effect 6 and 12 months post-surgery.

The responsiveness of UFS-QoL-11 was comparable to that of UFS-QoL; all effect size>0.5, displayed a good ability to detect changes (Table 5). After treatment, there was a significant decrease in symptom severity scores and improvement in the concern and daily life interference subscale scores. The mean score change from baseline to the 12-month follow-up for symptom severity was -18.8 (P < 0.001), with an effect size of -0.79 and a SRM of -0.95 (Table 5). The mean change score for another two subscales ranged from 12.2 to 19.0, with effect sizes of 0.70 in both two subscales and SRM ranging from 0.78 to 0.79, 12 months post-treatment.

In this study, we used the largest cohort of patients with UFs through different psychometric techniques, CTT and IRT, to arrive at a reliable and valid short-form of UFS-QoL, which is the only questionnaire designed to assess the symptoms of UFs and their impact on QoL. Importantly, for clinical application, we used responsiveness to confirm the inconsistency between the CTT and IRT. There was a strong correlation between UFS-QoL and its 11-item short-form, UFS-QoL-11, which supports the use of the scale in other populations as well. Factor analysis supported the hypothesized three-factor structure in 11-item, symptom severity, concern, and daily life interference. The scale and its subscales demonstrated good internal consistency, structural validity, known-group validity, and responsiveness. Throughout the treatment process, UFS-QoL-11 would be feasible for monitoring symptoms and QoL in clinical routine and long-term practice, and would enable clinicians and researchers to distinguish between symptoms and QoL changes sensitively.

The CTT and IRT are important approaches for optimizing psychological scales. According to statistics, several items were good candidates for a short-form scale to provide maximum information using two methods. An important barrier was that the statistics did not provide a clear suggestion for the selection of the best items using different approaches.[16] The Food and Drug Administration (FDA) claims that a patient-reported outcome (PRO) questionnaire is an instrument not only with satisfactory psychological criteria, but should also measure what it is intended to measure, which means that the item could be sensitive enough to assess the concept of interest with each subscale in Chinese patients with UFs.[28] Therefore, we combined these statistical parameters with clinical applicability that ability to detect change score to confirm the item. We strongly urge the use of these 11 items when assessing patients or research participants to maximize the comparability of clinical assessments.

Brief measures are favorable for clinical practice to reduce patient burden and infringement upon encountering time.[14] Overall, in the present sample and the original validation study, the UFS-QoL-11 performed as well as the full version in the measures of reliability, validity, and responsiveness as the full version.[12] When comparing the correlations between the UFS-QoL and the full scale, the correlations were nearly identical. In sum, the UFS-QoL-11 scale will provide comparable information about the latent trait of symptoms and QoL for each respondent as the full scale currently does. Simultaneously, its brevity will increase the likelihood that the assessment of symptoms and QoL will be implemented in everyday clinical practice. Therefore, the UFS-QoL-11 can be easily administered during a typical outpatient visit and may serve as a discussion point to help UFs understand patients’ symptom experiences, especially when symptoms appear refractory to treatment or physiological improvement.

Although the results of the present study are encouraging, there are several limitations that should be highlighted so that the findings are considered with caution and in light of these deficiencies. First, although the UFS-QoL-11 demonstrated good internal consistency and concurrent validity, we could not examine other important psychometric properties such as test-retest reliability. Future studies should examine these psychometric properties for the purpose of a comprehensive psychometric evaluation of the UFS-QoL-11 among the Chinese. Second, we need further external validation of the adaptability and clinical applicability of the UFS-QoL in real words.

Our findings demonstrated that the UFS-QoL-11 performs as well as the original 37-item version while reducing the administrative burden by 70%. Its brevity, if implemented, will increase the likelihood that the assessment of symptoms and QoL will be implemented in everyday clinical practice. As patients with UFs experience a high symptom burden and QoL interference, we hope that the development of UFS-QoL-11 will lead to more widespread, accurate, timely, and sensitive monitoring, in clinical and research settings.

UFS-QoL	Uterine fibroid symptom and quality of life
CTT	Classical test theory
IRT	Item response theory
UFS-QoL-11	11-item short-form version of ufs-qol
UFs	Uterine fibroids
HRQL	Health-related quality of life
UFS-QoL-SF	Short-form version of the ufs-qol
HIFU	High-intensity focused ultrasound
SF-36	Short-form-36
QoL	Quality of life
KMO	Kaiser–meyer–olkin
SD	Standard deviations
CI	Confidence intervals
SRM	Standardized response means
FDA	Food and drug administration
PRO	Patient-reported outcome

Ethics approval and consent to participate: Ethics approval was obtained from China Ethics Committee of Registering Clinical Trials (IRB approval number: ChiECRCT-2011034). The study was conducted in accordance with the Declaration of Helsinki. Each participant provided informed consent prior to their enrollment. There were not illiterate/uneducated participants involved in our study.

Consent for publication: Not applicable

Availability of data and materials: All data in the main text and the Supplementary Information have been uploaded into a website (www.hifuctr.com). Readers have read access to the data but are not allowed to export the data. Full access to the data will be available to researchers for the purposes of research or regulatory decision-making with a signed data access agreement after approval of a proposal. All data requests will be reviewed by the research committee at Chongqing Medical University and the corresponding authors to verify whether the request is subject to any intellectual property or confidentiality obligations.

Competing interests: The authors declare that they have no conflicts of interest.

Funding: This project was funded by the Chongqing social Science Planning Project (No. 2022BS081), China Postdoctoral Science Foundation Funded Project (No.2022MD723732).

Authors' contributions: QS conceived the study, and participated in its design and coordination and helped to draft the manuscript. WX and JZ participated in the design of the study, performed the statistical analysis and write the manuscript. All authors read and approved the final manuscript.

Acknowledgements: The authors are grateful to Dr. Nie for his valuable advice for the manuscript revision.

Stewart EA, Cookson CL, Gandolfo RA, Schulze-Rath R. Epidemiology of uterine fibroids: a systematic review. BJOG: an international journal of obstetrics and gynaecology. 2017;124(10):1501–12.
Deal LS, Williams VS, Fehnel SE. Development of an electronic daily uterine fibroid symptom diary. The patient. 2011;4(1):31–44.
Egbe TO, Badjang TG, Tchounzou R, Egbe EN, Ngowe MN. Uterine fibroids in pregnancy: prevalence, clinical presentation, associated factors and outcomes at the Limbe and Buea Regional Hospitals, Cameroon: a cross-sectional study. BMC Res Notes. 2018;11(1):889.
Whiteman MK, Kuklina E, Jamieson DJ, Hillis SD, Marchbanks PA. Inpatient hospitalization for gynecologic disorders in the United States. Am J Obstet Gynecol. 2010;202(6):541e541–546.
Baird DD, Patchel SA, Saldana TM, Umbach DM, Cooper T, Wegienka G, Harmon QE. Uterine fibroid incidence and growth in an ultrasound-based, prospective study of young African Americans. American journal of obstetrics and gynecology 2020, 223(3):402.e401-402.e418.
Spies JB, Coyne K, Guaou Guaou N, Boyle D, Skyrnarz-Murphy K, Gonzalves SM. The UFS-QOL, a new disease-specific symptom and health-related quality of life questionnaire for leiomyomata. Obstet Gynecol. 2002;99(2):290–300.
Silva RO, Gomes MT, Castro RA, Bonduki CE, Girão MJ. Uterine Fibroid Symptom - Quality of Life questionnaire translation and validation into Brazilian Portuguese. Revista brasileira de ginecologia e obstetricia: revista da Federacao Brasileira das Sociedades de Ginecologia e Obstetricia. 2016;38(10):518–23.
Oliveira Brito LG, Malzone-Lott DA, Sandoval Fagundes MF, Magnani PS, Fernandes Arouca MA, Poli-Neto OB, Nogueira AA. Translation and validation of the Uterine Fibroid Symptom and Quality of Life (UFS-QOL) questionnaire for the Brazilian Portuguese language. Sao Paulo medical journal = Revista paulista de medicina. 2017;135(2):107–15.
Calaf J, Palacios S, Cristóbal I, Cañete ML, Monleón J, Fernández J, Hernández A, Vázquez F. Validation of the Spanish version of the Uterine Fibroid Symptom and Quality of Life (UFS-QoL) questionnaire in women with uterine myomatosis. Med Clin. 2020;154(6):207–13.
Yeung SY, Kwok JWK, Law SM, Chung JPW, Chan SSC. Uterine Fibroid Symptom and Health-related Quality of Life Questionnaire: a Chinese translation and validation study. Hong Kong medical journal = Xianggang yi xue za zhi. 2019;25(6):453–9.
Wang XQ, Zhu L, Xu T, Zhang L, Lyu T, Chen R. [Validation of the Chinese version of the uterine fibroid symptom and health-related quality of life]. Zhonghua fu chan ke za zhi. 2017;52(7):455–60.
Xu W, Chen W, Chen J, Hu L, Su X, Nie Y, Shi Q. Adaptability and clinical applicability of UFS-QoL in Chinese women with uterine fibroid. BMC Womens Health. 2022;22(1):372.
Elander J, Said O, Maratos FA, Dys A, Collins H, Schofield MB. Development and validation of a short-form Pain Medication Attitudes Questionnaire (PMAQ-14). Pain. 2017;158(3):400–7.
Taft TH, Guadagnoli L, Carlson DA, Kou W, Keefer L, Pandolfino J. Validation of the Short-Form Esophageal Hypervigilance and Anxiety Scale. Clin Gastroenterol hepatology: official Clin Pract J Am Gastroenterological Association. 2022;20(2):e64–e73.
Edelen MO, Reeve BB. Applying item response theory (IRT) modeling to questionnaire development, evaluation, and refinement. Qual life research: Int J Qual life aspects Treat care rehabilitation. 2007;16(Suppl 1):5–18.
De Beurs D, Cleare S, Wetherall K, Eschle-Byrne S, Ferguson E, D BOC RCOC. Entrapment and suicide risk: The development of the 4-item Entrapment Scale Short-Form (E-SF). Psychiatry Res. 2020;284:112765.
Chen J, Li Y, Wang Z, McCulloch P, Hu L, Chen W, Liu G, Li J, Lang J. Evaluation of high-intensity focused ultrasound ablation for uterine fibroids: an IDEAL prospective exploration study. BJOG: an international journal of obstetrics and gynaecology. 2018;125(3):354–64.
Li J, Wang H. Evaluationonreliabilityandvalidity ofSF-36 Scale(Version2) inurban residentsîžqualityoflifeinChongqing. J Fourth Military Med Univ. 2009;30(14):1342–4.
Li L, Wang HM, Shen Y. Chinese SF-36 Health Survey: translation, cultural adaptation, validation, and normalisation. J Epidemiol Commun Health. 2003;57(4):259–63.
Chang CY, Huang CK, Chang YY, Tai CM, Lin JT, Wang JD. Cross-validation of the Taiwan version of the Moorehead-Ardelt Quality of Life Questionnaire II with WHOQOL and SF-36. Obes Surg. 2010;20(11):1568–74.
Ware J, Snoww K, Ma K, Bg G. SF36 Health Survey: Manual and Interpretation Guide. Lincoln, RI: Quality Metric, Inc, 1993 1993, 30.
Coyne KS, Margolis MK, Murphy J, Spies J. Validation of the UFS-QOL-Hysterectomy Questionnaire: Modifying an Existing Measure for Comparative Effectiveness Research. Value in Health. 2012;15(5):674–9.
Coyne KS, Margolis MK, Bradley LD, Guido R, Maxwell GL, Spies JB. Further validation of the uterine fibroid symptom and quality-of-life questionnaire. Value in health: the journal of the International Society for Pharmacoeconomics and Outcomes Research. 2012;15(1):135–42.
Jensen MP, Keefe FJ, Lefebvre JC, Romano JM, Turner JA. One- and two-item measures of pain beliefs and coping strategies. Pain. 2003;104(3):453–69.
Cohen J, Cohen J, Cohen JW, Cohen J, Cohen J, Cohen J, Cohen J, Cohen JW, Cohen N, Cohen C. Statistical power analysis for the behavioral science. Technometrics. 1988;31(4):499–500.
Ghanbari A, Hasandoost F, Lyili EK, Khomeiran RT, Momeni M. Assessing Emergency Nurses' Clinical Competency: An Exploratory Factor Analysis Study. Iran J Nurs midwifery Res. 2017;22(4):280–6.
Pourmovahed Z, Mahmoodabadi HZ, Ardekani SMY, Fallahzadeh H, Tavangar H, Mahmoodabad SSM. Validation of the Family Stability Questionnaire in married couples: a confirmatory factor analysis. Electron physician. 2018;10(8):7185–95.
Administration FaD. Guidance for Industry on Patient-Reported Outcome Measures: Use in Medical Product Development to Support Labeling Claims. In., vol. 65132–65133. Federal Register: 74; 2009.

Table 1 Items Selection with CTT and IRT in development data set (n =969)
		CTT		IRT^¶
Subscales		r_P^†	Linear regression model	Beta1	Beta2	Beta3	Beta4	Dscrmn
Symptom severity	U_1	0.724^*		-0.385	0.476	1.527	2.826	1.975^*
	U_2	0.658	4.794^*	-0.690	0.862	2.212	3.450	1.708
	U_3	0.656	3.966	0.096	1.105	2.296	3.255	1.780^*
	U_4	0.614	3.35	0.229	1.488	2.596	3.765	1.454
	U_5	0.483	3.222	-0.176	2.359	4.589	7.782	0.841
	U_6	0.495	3.155	0.814	2.657	4.052	6.696	1
	U_7	0.521	3.189	0.887	2.573	4.002	5.646	1.075
	U_8	0.646	3.447	0.006	1.432	2.561	3.916	1.518
Concern	U_9	-0.592	-5.635	-0.873	0.536	2.988	4.774	0.955
	U_15	-0.841^*	/	-0.704	0.062	1.089	1.691	3.409^*
	U_22	-0.836	-7.609^*	-0.692	-0.005	1.068	1.513	3.563^*
	U_28	-0.702	-5.241	-0.585	0.514	1.939	2.724	1.485
	U_32	-0.795	-6.171	-0.595	0.354	1.490	1.981	2.404
Activities	U_10	-0.686	-3.56	0.108	0.898	2.323	3.023	1.976
	U_11	-0.765	-5.33^*	-0.205	0.679	1.772	2.494	2.288
	U_13	-0.786^*	/	-0.398	0.528	1.560	2.336	2.344^*
	U_19	-0.708	-4.173	-0.559	0.731	2.269	3.094	1.878
	U_20	-0.751	-4.428	-0.028	0.936	2.197	2.962	2.580^*
	U_27	-0.709	-3.904	-0.593	0.430	1.839	2.684	1.513
	U_29	-0.63	-3.547	-0.518	0.961	2.866	3.637	1.315
Energy/Mood	U_12	-0.753	-4.377	-0.821	0.275	1.789	2.845	1.922
	U_17	-0.706	-4.009	-1.006	0.188	2.099	3.188	1.639
	U_23	-0.754	-3.608	-0.271	0.708	1.987	2.876	2.926
	U_24	-0.814	-5.107^*	-0.444	0.555	1.840	2.734	3.704^*
	U_25	-0.827^*	/	-0.443	0.463	1.728	2.618	3.005^*
	U_31	-0.726	-3.663	-0.940	0.277	1.992	2.879	1.837
	U_35	-0.731	-4.267	-0.212	0.881	2.162	3.076	2.194
Control	U_14	-0.646	-6.098	0.244	1.357	2.705	3.401	1.920
	U_16	-0.724	-5.924	-0.416	0.662	2.356	3.704	1.664
	U_26	-0.658	-5.454	-2.347	-0.962	1.059	2.631	0.947
	U_30	-0.734	-6.789^*	-0.269	0.807	2.209	2.601	2.124^*
	U_34	-0.76^*	/	-0.152	0.895	2.046	2.743	2.885^*
Sexual function	U_36	-0.804^*	/	-0.569	0.157	1.321	2.050	3.709^*
Sexual function	U_37	-0.658	/	-0.454	0.265	1.306	1.931	/

Abbreviations: CTT, classical test theory; IRT, item response theory;

*Items are retained for the SF-UFS-QoL based on the CTT and IRT
† r_P correlation coefficient for Spearman correlation test between UFS-QoL subscale score and each item.
¶ item response theory analysis with graded response model (item discrimination parameter).

TABLE 2. Responsiveness of women with uterine fibroids after treatment among parent scale, CTT and IRT in development data set (n=969)
Subscales	Mean score ±SD			Change in score ±SD		Effect size^‡		SRM^†
	Preoperative (n=2411)	6-month	12-month	6-month	12-month	6- month	12- month	6- months	12- months
	Preoperative (n=2411)	(n=1855)	(n=1774)	vs Preoperative	vs Preoperative	6- month	12- month	6- months	12- months
Symptom severity-Parent	18.28±13.97	9.29±9.62	7.15±9.00	8.99±12.04	11.14±11.86	0.64	0.8	0.75	0.94
Symptom severity-CTT*	26.68±22.37	12.02±13.73	9.53±12.26	14.65±18.61	17.14±18.19	0.65	0.77	0.79	0.94
Symptom severity-IRT	23.25±22.01	10.01±13.89	7.64±12.36	13.24±18.45	15.61±17.99	0.60	0.71	0.72	0.87
Concern-Parent	67.59±21.27	78.86±16.59	82.95±15.6	-11.27±19.12	-15.36±18.77	-0.53	-0.72	-0.59	-0.82
Concern-CTT/IRT	63.03±27.42	77.35±20.71	81.63±19.56	-14.32±24.34	-18.6±23.94	-0.52	-0.68	-0.59	-0.78
Activities-Parent	75.51±17.92	83.96±13.48	86.79±12.75	-8.46±15.9	-11.28±15.66	-0.47	-0.63	-0.53	-0.72
Activities-CTT	73.83±24.27	83.36±17.4	86.22±16.27	-9.53±21.15	-12.4±20.79	-0.39	-0.51	-0.45	-0.60
Activities-IRT*	76.25±21.7	84.99±15.64	87.18±14.84	-8.74±18.95	-10.93±18.69	-0.40	-0.50	-0.46	-0.58
Energy/mood-Parent	72.55±18.5	81.69±14.32	84.5±13.54	-9.14±16.58	-11.95±16.32	-0.49	-0.65	-0.55	-0.73
Energy/mood-CTT/IRT	73.72±21.86	82.15±17.6	84.47±16.2	-8.43±19.87	-10.75±19.33	-0.39	-0.49	-0.42	-0.56
Control-Parent	73.38±17.42	81.66±14.04	84.63±12.96	-8.28±15.85	-11.25±15.45	-0.48	-0.65	-0.52	-0.73
Control -CTT/IRT	78.13±20.91	85.88±16.23	87.79±15.07	-7.75±18.75	-9.65±18.32	-0.37	-0.46	-0.41	-0.53
Sexual functioning-Parent	69.57±22.14	80.35±20.92	84.1±19.09	-10.78±21.55	-14.53±20.73	-0.49	-0.66	-0.5	-0.7
Sexual functioning-CTT/IRT	69.45±26.08	79.17±22.8	82.79±20.96	-9.71±24.51	-13.34±23.74	-0.37	-0.51	-0.4	-0.56

Abbreviations: CTT, classical test theory; IRT, item response theory

^*Items are retained for the UFS-QoL-11

^‡ Calculated as change in mean score divided by the standard deviation of the baseline

^† Calculated as change in mean score divided by the standard deviation of the change

Table 3. Construct Validity of the UFS-QoL-11: Baseline Factor Loadings in validation data set (n=970)

Subscales	Item	Factor1	Factor2	Factor3
Daily life interference	U_34	0.792	0.026	0.072
	U_30	0.744	0.094	-0.012
	U_24	0.743	0.245	0.022
	U_25	0.727	0.257	0.057
	U_20	0.656	0.172	0.191
	U_36	0.617	0.117	0.125
	U_13	0.543	0.263	0.214
Concern	U_22	0.233	0.872	0.142
Concern	U_15	0.231	0.872	0.164
Symptom severity	U_1	0.116	0.137	0.897
Symptom severity	U_2	0.111	0.14	0.893

The bold type means items are assigned to the corresponding factor through factor analysis.

Kaiser–Meyer–Olkin (KMO): 0.85

Total variance explained: 64.22%

Table 4. Internal consistency reliability of UFS-QOL-11 subscales in validation data set (n=970)
UFS-QOL	k	Internal consistency (Cronbach’s alpha) ^*
		Baseline
		All	HIFU	Myomectomy
Symptom severity	2	0.81	0.81	0.82
Concern	2	0.83	0.83	0.81
Daily life interference	7	0.84	0.85	0.80

TABLE 5. Responsiveness of women with uterine fibroids after treatment using UFS-QoL-11 in validation data set (n=970)
Subscales	Means ± SD			Change in score ± SD		Effect size^*		SRM^†
Subscales	Baseline	6-month	12-month	6-month	12-month	6-month	12-month	6-months	12-months
Symptom severity	28.65±24.79	12.47±14.42	9.2±12.49	-16.2 (-18 to -14.3)	-19.4 (-21.2 to -17.6)	-0.65	-0.78	-0.8	-0.98
Concern	63.08±27.04	77.48±20.89	82.55±19.99	14.4 (12.2 to 16.6)	19.5 (17.3 to 21.6)	0.53	0.72	0.59	0.82
Function interference	74.79±17.49	83.1±14.16	86.42±12.76	8.3 (6.9 to 9.7)	11.6 (10.2 to 13)	0.47	0.66	0.52	0.75

Abbreviations: UFS-QoL-11, the 11 items short form uterine fibroid symptom and quality of life; SD, standard deviation; SRM, standardized response mean;

* Calculated as change in mean score divided by the standard deviation of the baseline

† Calculated as change in mean score divided by the standard deviation of the change

No competing interests reported.

SUPPLEMENTS.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

Development and validation the uterine fibroid symptom and quality of life short-form: based on classical test theory and modern test theory

Status:

Version 1

Abstract

Objectives

Methods

Results

Conclusion

Figures

Highlights

Introduction

Methods

Results

Discussion

Conclusions

Abbreviations

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Version 1