Abstract
Objective. The Medical Outcomes Study Short-form 36 (SF-36) provides numerical measurement of patient health, but does not include preferences for health states and cannot be used directly in cost-effectiveness analyses. By contrast the Euroqol EQ-5D can be used for cost-effectiveness analyses. The EQ-5D has rarely been used in systemic lupus erythematosus (SLE). We compared SF-36 and EQ-5D values across rheumatic diseases.
Methods. We studied 1316 patients with SLE, 13,722 with rheumatoid arthritis (RA), 3623 with non-inflammatory rheumatic disorders (NIRD), and 2733 with fibromyalgia (FM).
Results. The mean EQ-5D, physical (PCS) and mental (MCS) component summary scores were 0.72, 36.3, and 44.3, respectively, in SLE. There was essentially no difference among EQ-5D and PCS scores for patients with SLE, RA, or NIRD. MCS was lower in SLE compared with RA and NIRD (44.3, 49.1, 50.8, respectively). All scores were more abnormal in FM (0.61, 31.9, 41.9). Within SF-36 domains, physical function was better, but general health, vitality, social function, role-emotional, and mental health were more impaired in SLE compared with RA and NIRD. In SLE, quality of life (QOL) was predicted by damage, comorbidity, income, education, and age. Fifteen percent of patients with SLE were very satisfied with their health, and their QOL scores (0.84, 45.4, 50.1) were similar to those found in the US population for EQ-5D and MCS, but were slightly reduced for PCS.
Conclusion. EQ-5D and PCS are at the same levels in SLE as in RA and NIRD, but are more abnormal in SLE in the MCS and mental health domains. EQ-5D values allow preference-based comparisons with other chronic conditions.
- SYSTEMIC LUPUS ERYTHEMATOSUS
- RHEUMATOID ARTHRITIS
- FIBROMYALGIA
- EQ-5D
- MEDICAL OUTCOMES STUDY SHORT-FORM 36
- QUALITY OF LIFE
Comprehensive health-related quality of life measures (here called QOL) are used in randomized controlled trials to demonstrate treatment efficacy, in observational studies to show levels of impairment, and occasionally in clinical care to understand the status of an individual patient. The most commonly used comprehensive QOL measure is the Medical Outcomes Study Short-Form 36 (SF-36)1. Although the SF-36 provides numerical measures of health status, it does not include preferences for health states and therefore cannot be used directly in cost-effectiveness analyses or easily in cross-disease analyses.
The Euroqol EQ-5D, by contrast, is a QOL measure that can be used for cost-effectiveness studies across diseases2,3. The EQ-5D is a 5-item questionnaire that after appropriate weighting results in a scale that ranges from –0.11 to 1.00. An EQ-5D visual analog scale (VAS) is also available, but is not based on preference weight and cannot be used in cost-effectiveness analyses. Zero on the VAS scale represents the “worst imaginable health state” and 1 (or 100, depending on scaling) represents the “best imaginable health state.” Scores less than 0 on the EQ-5D may be thought of as “worse than death.” The EQ-5D and EQ-5D VAS are correlated at about 0.6, but have different means (vide infra).
All chronic illnesses result in reduced QOL, regardless of measure3. Systemic lupus erythematosus (SLE) is an important rheumatic disease that can result in life-threatening multisystem illness, and it is of interest to understand the QOL of individuals with it and how SLE compares with other rheumatic illnesses. There are only a few reports of cross-illness studies, and those usually involve small samples4,5.
We have developed a patient-based SLE databank in which patient’s self-report is combined with medical record validation6. This model allows aggregation of data for large numbers of patients with SLE without incurring high physician costs. We now use this databank to characterize SLE patients with respect to QOL and to compare them with patients with rheumatoid arthritis (RA), noninflammatory rheumatic disorders (NIRD), and fibromyalgia (FM) who have similar demographic and participation characteristics. One advantage of comparative studies is that they put the results into the context of the demographic data and severity of other rheumatic diseases at the same center, and provide a better framework for judging the validity of the data.
The primary goals of this study are to describe the comparative QOL of the 4 groups of patients with rheumatic disease according to SF-36 and EQ-5D/EQ-5D VAS results, to examine predictors of QOL in SLE, and to characterize results in terms of patient’s satisfaction with health. The EQ-5D can be used directly for cost-effectiveness analyses.
MATERIALS AND METHODS
Participants
This report concerns 1316 patients with SLE, 13,722 with RA, 3623 with NIRD, and 2,733 with FM who were participants in the National Data Bank for Rheumatic Diseases (NDB) longitudinal study of rheumatic disease outcomes. Participants are followed longitudinally with semiannual, detailed, 28-page questionnaires, as described7,8. Questionnaires were administered by the Internet (16.0%) or by paper questionnaires (84.0%) depending on the participants’ preferences. Internal NDB analyses indicated no differences in the validity of responses depending on questionnaire method.
Patients were enrolled continuously beginning in 1999 and ending in July 2008. Rheumatic disease diagnoses were made or confirmed by the patient’s rheumatologist. NIRD included diagnoses such as osteoarthritis, back pain syndromes, tendonitis, etc., excluding FM. Patients with SLE were enrolled largely by rheumatologist referral, but also by self-referral after confirmation of the diagnosis of SLE by the patient’s rheumatologist6. Rheumatologists rated 97.3% of cases as definite and 2.7% as probable SLE. We did not study cutaneous lupus. Patients with a physician-confirmed overlap diagnosis of SLE and FM (7.2%) and SLE and RA (13.1%) were assigned to the SLE category. The seemingly large percentage of SLE/RA overlaps was derived from the large sample of RA patients, where they represented 1% of RA diagnoses. Sensitivity analyses showed that this assignment did not change study results. RA patients in the NDB who were enrolled as part of pharmaceutical study registries were excluded so as not to bias the study with more severe patients. In this study, we selected a single random observation from each patient for analysis.
Study variables
Demographic variables including age, sex, education level, ethnicity, and household income were obtained by self-report. We calculated the physical (PCS) and mental (MCS) component summary scores and 8 individual domain scores from the SF-36 version 1 according to the authors’ recommendations1,9. The primary time period of the SF-36 questionnaire was 4 weeks. The EQ-5D is a 5-item questionnaire that assesses function (3 questions), mood (1 question), and pain (1 question)2. Scoring was accomplished using US tariffs (weights)10,11. US and European scores are not interchangeable, US scores being approximately 0.11 units greater12.
We also collected information regarding satisfaction with health. Satisfaction with health was evaluated with a 5-point scale13,14. The question asked was, “How satisfied are you with your health now?”. Possible replies were very dissatisfied, somewhat dissatisfied, neither satisfied nor dissatisfied, somewhat satisfied, and very satisfied.
SLE damage was assessed by the total damage score of the Lupus Damage Index Questionnaire (LDIQ)15,16, a self-administered 56-item questionnaire based on the Systemic Lupus International Collaborating Clinics/American College of Rheumatology (SLICC/ACR) damage index (SDI)17,18. The development of the LDIQ from its pilot testing to its administration is described elsewhere15,16. The LDIQ was administered beginning in 2008 and, therefore, was only assessed in 676 patients with SLE.
Comorbidity was measured by a patient-reported composite comorbidity index (range 0–9) consisting of 11 present or past comorbid conditions including pulmonary disorders, myocardial infarction, other cardiovascular disorders, stroke, hypertension, diabetes, spine/hip/leg fracture, depression, gastrointestinal (GI) ulcer, other GI disorders, and cancer19,20. This index has been studied in SLE21.
Statistical methods
Associations between the PCS, MCS, and EQ-5D and other variables were assessed by Spearman correlation coefficients because of the markedly non-normal distribution of the EQ-5D. We used multivariable linear regression to examine the differences in QOL scores between groups, adjusting for age and sex; and we used linear regression to describe and test the association of LDIQ components and PCS and MCS scores. Except as described in the text, analyses are unadjusted. Figure 1 was created using fractional polynomial prediction (Stata’s fpfitci). Figure 2 utilized adaptive (varying bandwidth) kernel density estimation. We began collecting EQ-5D data in 2002. Patients who were NDB participants prior to that date and not thereafter did not have EQ-5D data. Overall, 29.1% of EQ-5D data were missing. Missing data for SF-36 domains ranged between 0.6% and 2.4%. We chose not to impute missing data because of the large percentage of missing EQ-5D data and because complete EQ-5D and PCS/MCS data were not essential to the analyses.
Data were analyzed using Stata (Stata Corp., College Station, TX, USA) version 10.1. Statistical significance was set at the 0.05 level, and all tests were 2-tailed. Because the sample sizes were so large, most differences in comparisons between SLE and other groups in Table 1 are statistically significant at p < 0.001. Except as noted in the text, we do not, therefore, report statistical significance. Instead, the reader should consider whether the group differences are clinically significant.
RESULTS
The median ages of participants were SLE 49.9 years [interquartile range (IQR) 40.6, 58.8], RA 61.2 years (51.2, 71.1), NIRD 61.5 years (58.5, 75.8), and FM 55.6 years (47.6, 63.4), and the percentages who were men were SLE 6.0%, RA 23.0%, NIRD 20.9%, and FM 4.7%. SLE and RA groups did not differ by sex (p = 0.077), but all other groups differed by sex (p < 0.01); and all groups differed by age (p < 0.001). The percentages of non-Hispanic white were SLE 89.9%, RA 95.5%, NIRD 96.8%, and FM 97.4%. Differences between SLE and other groups for ethnicity were significant at p < 0.001.
Comparative SF-36 summary scores and domains
Table 1 presents the QOL measures for the 4 groups. Across all domains, summary scores, and utilities, the worst scores were found in patients diagnosed with FM. Comparing the other 3 groups is more complicated: with respect to the SF-36 summary, the PCS scores were essentially the same for SLE, RA, and NIRD (36.3, 36.7, 36.4, respectively). But important differences were found for the MCS scores. SLE patients had the worst MCS score, followed by RA and NIRD (44.3, 49.1, 50.8, respectively). The same type of difference was seen in mean EQ-5D scores (0.72, 0.73, 0.73, respectively). Compared with RA (the most relevant comparison) PCS score was not significantly lower in SLE. However, after adjustment for age and sex, SLE scores for PCS were –0.93 (95% CI –1.59 to –0.27) units lower (p = 0.006). For MCS the nonadjusted difference was significant and the age and sex-adjusted difference was –3.24 (95% CI –3.92 to –2.59, p < 0.001). For EQ-5D the unadjusted difference, but not the adjusted difference, was statistically significant.
However, the domain scores differed among the groups (Table 1). Physical function was better in SLE than in RA and NIRD (52.7, 49.5, 47.6, respectively). But scores for general health, vitality, role-emotional, and mental health were substantially worse in SLE patients. Comparing RA and NIRD, we found that physical function and bodily pain were better in RA, but that NIRD had better scores for general health, vitality, role-emotional, and mental health.
Predictors and correlates of QOL measures
We examined demographic and comorbidity correlates of SF-36 and EQ-5D scales. Higher scores for the LDIQ and comorbidity reflect worse clinical status compared with the QOL measures, in which lower scores reflect worse health status. Correlations with r > 0.8 are usually considered very strong, 0.6 to 0.8 strong, 0.4 to 0.6 moderate, 0.2 to 0.4 weak, and < 0.2 as absent. The LDIQ and comorbidity index were the strongest correlates of the QOL measures, 0.434 and 0.372 for the PCS, and 0.394 and 0.389 for the EQ-5D (Table 2). Household income was correlated with PCS and EQ-5D at 0.292 and 0.303. No other demographic measure had a correlation above 0.127. Of interest, the PCS correlation with EQ-5D (0.717) was stronger than the PCS correlation with the EQ-5D VAS (0.607), and stronger than with the EQ-5D and EQ-5D VAS (0.607).
Figure 1 shows the nonlinear association of PCS and LDIQ damage and the very limited association of MCS and damage. The relation between EQ-5D and damage (not shown) is similar to the PCS-damage relationship.
We also used the self-reported LDIQ scales to illuminate the association between the SF-36 QOL measures and SLE damage. Table 3 shows the univariate decrease (–) in PCS and MCS scores associated with having self-reported damage in each domain. In general, damage and PCS were related (8 significant results) compared with damage and MCS (3 significant results).
Interpretation of QOL scores
Although SF-36 scores are widely reported, the numbers have no clear relevance to clinicians. To place SF-36 and other QOL scores into a more meaningful context, and as an aid to clinical interpretation, we categorized QOL measures at 5 levels of patient satisfaction with health (Tables 4a, 4b). About 47% of SLE patients are somewhat (22.1%) or very satisfied (15.1%) with their health. At the very satisfied level, means scores for PCS, MCS, and EQ-5D were 45.3, 50.1, and 0.84, respectively. The distribution of PCS and EQ-5D values and the satisfaction categories are displayed in Figure 2.
DISCUSSION
Health status questionnaires provide measures of patient health across various domains. The SF-36, which addresses 8 domains, has found wide use in rheumatology clinical trials and in observational studies. While the SF-36 provides numerical measures of health status, it does not include preferences for health states and cannot be used directly in cost-effectiveness analyses. However, through use of the SF-6D22, which derives from the SF-36, cost-effectiveness data can be generated. By contrast, the EQ-5D can be used for cost-effectiveness studies across diseases. The mean EQ-5D in US adults aged 45–64 years is 0.8210. The minimally important difference (MID) for EQ-5D has been estimated to be 0.074, range –0.011 to 0.1423. As shown in Figure 2, the distribution of EQ-5D scores is distinctly irregular, with a long tail.
In a national sample of 38,678 individuals from the Medical Expenditure Panel Survey (MEPS) in 2006, EQ-5D results were obtained from those with a series of chronic conditions that are similar though more general than the diagnoses in the current study, including 111 with a diffuse connective tissue disease (ICD-9 code 710), 231 with inflammatory polyarthritis (ICD-9 714), and 348 with osteoarthritis (ICD-9 715)3. The MEPS EQ-5D mean and median EQ-5D values for diffuse connective tissue disease were 0.74 and 0.80, respectively. In the current study these values in SLE patients were 0.72 and 0.78. For inflammatory polyarthritis the MEPS values were 0.66 and 0.783, while in RA patients in our study the values were 0.73 and 0.78. Osteoarthritis (ICD-9 715), in contrast to NIRD, resulted in values of 0.70 and 0.793 compared with 0.73 and 0.78 in our study. Given the limited chronic disease sample of the MEPS, the general diagnostic categories, and the EQ-5D distribution, it is likely that the MEPS median values, which are very similar to those in the current study, are the most stable and appropriate for comparison.
There is no code for FM in the MEPS report3, but we observed values of 0.61 and 0.71 for FM. Of the 692 physical illness categories in the MEPS, only renal failure had values as low as those noted for FM. It is not surprising that FM has low EQ-5D values because the ACR criteria for FM24 select for severity by virtue of requiring the presence of widespread pain and many tender points.
Clarke, et al used the nonpreference-based EQ-VAS scale in SLE and found values of 0.65 in 634 patients without renal disease and about 0.68 in 81 with renal disease25. Among 269 US SLE patients the EQ-5D VAS was 0.6626. In the only study to investigate an unselected SLE sample with the EQ-5D, Aggarwal, et al recently reported SF-36 and EQ-5D data on 167 patients from an academic rheumatology center27. They found PCS, MCS, and EQ-5D mean (SD) scores of 35.6 (0.8), 51.9 (8.9), and 0.72 (0.19), respectively, and they found the EuroQol VAS score to be 68.7 (20.5). In the current study values for the 4 scores were 36.3 (11.5), 44.3 (11.8), 0.72 (0.21), and 0.64 (0.21). With respect to the 3 rheumatic diseases under study, we found essentially no difference between those with SLE and those with RA or NIRD. However, as shown in Table 2, the EQ-5D is sensitive to SLE damage and sociodemographic variables. Therefore, the values of the EQ-5D will vary somewhat from study to study, dependent on the SLE damage in the study population. In agreement with population studies10, we also found that the EQ-5D varied minimally according to age, education, and annual household income. There were too few men with SLE in our study to adequately examine gender effect.
Therefore, the SLE EQ-5D results can be summarized by saying that the EQ-5D mean and median are approximately 0.72 and 0.78; SLE, RA, and NIRD have the same level of EQ-5D health status.
If the EQ-5D is not ordinarily used in SLE except in the EQ-VAS format, the SF-36 is commonly used. In the Tri-Nation study, PCS and MCS scores ranged from 36.3 to 40.6 for PCS and from 43.5 to 43.8 for MCS25. In another study of 43 SLE patients the PCS and MCS scores were 38.6 and 43.1, respectively28. Studying 90 patients with SLE in 2004, Jolly and Utset noted scores of 35 and 4529. Results from the LUMINA studies showed mean PCS and MCS values of 36.7 and 46.630. In our study the PCS and MCS were 36.3 and 44.3. In general, one can say that PCS values in SLE are in the range of 35 to 40, or around the second x-line from the right in Figure 2.
As with the EQ-5D, there was essentially no difference in PCS scores among SLE, RA, and NIRD patients (Table 1), but there were differences in the individual domains. There were, however, differences in MCS scores, with the most abnormal scores being found in SLE and then RA. These result are consistent with findings of a Norwegian SLE/RA EQ-5D study4.
To place scores into a meaningful clinical perspective, we categorized them according to patient’s satisfaction with health (Tables 4a, 4b). In general, the mean SF-36 and EQ-5D scores fall just below the category “somewhat satisfied” (Figure 2). It is of particular interest that the “best” category (“very satisfied”) has a mean value for EQ-5D of 0.84 and the mean score for US adults aged 45–64 years is 0.8210. This helps to further validate the satisfaction categorizations. The best values for PCS and MCS were 45.4 and 50.1, respectively, for patients with SLE.
There are a number of limitations in the use of EQ-5D results. The EQ-5D is only one of a number of utility scales, and each scale produces a different utility31,32 and the utility weight differs according to whether the weight is determined by patients or the general populations33. In addition, the EQ-5D is less sensitive to change than the SF-36, as the EQ-5D tries to summarize health status in just 5 questions34,35. The EQ-5D is also very sensitive to outliers, given its long tail. With respect to the EQ-5D VAS and the EQ-5D, we found a stronger association between the PCS and EQ-5D (0.717) than with the PCS and EQ-5D VAS (0.607). One explanation for this difference is that the PCS and EQ-5D are based on specific questions, while the EQ-5D VAS acts more as a global scale in which “overall” QOL rather than item-specific QOL can predominate. We also found that association between EQ-5D VAS and EQ-5D was 0.607 compared with the stronger association between the PCS and EQ-5D of 0.717. We hypothesize that the PCS, as a longer and more representative measure, better identifies the overall QOL of the VAS scale.
Also, QOL is reduced with increasing age, disease duration, lower income, and educational attainment. In presenting our data, we did not adjust results for these factors because we wanted to observe results for the disorder in general. In addition, QOL is dependent on the severity of the population under study, and might be expected to vary somewhat across clinics and settings. Patients’ decisions to enroll and participate in our study, compared with non-enrollees in the community, may have introduced biases that influenced some of the results. In addition, patients in this study, as with most survey research, do not reflect the demographic characteristics of the community, being better educated, having higher household incomes, and being composed of fewer minorities.
In summary, QOL in SLE, RA, and NIRD is similar with respect to SF-36 PCS scores and EQ-5D results. SLE patients have the lowest MCS scores of the 3 disorders. Patients with FM have the lowest QOL scores, regardless of measure. QOL in SLE is predicted by damage, comorbidity, age, household income, and educational attainment. About 47% of SLE patients are somewhat (22.1%) or very satisfied (15.1%) with their health.
Footnotes
- Accepted for publication September 24, 2009.