Comparison of EuroQol-5D and Short Form-6D Utility Scores in Multiethnic Asian Patients with Psoriatic Arthritis: A Cross-sectional Study

Abstract

Objective. To compare EuroQol-5D (EQ-5D) and Short Form-6D (SF-6D) utility scores in multiethnic Asian patients with psoriatic arthritis (PsA).

Methods. Consecutive patients fulfilling the Classification Criteria for Psoriatic Arthritis attending a rheumatology outpatient clinic were recruited and completed the EQ-5D and SF-6D questionnaires. Comparisons were performed by score distribution, mean, median, and the Outcome Measures in Rheumatology filter: i.e., truth, discrimination, and feasibility.

Results. Eighty-six patients were enrolled (69 English-speaking and 17 Chinese-speaking; male:female ratio 0.91). The score distribution of SF-6D was normal, while that of EQ-5D was bimodal. A ceiling effect was observed in 20% of patients for EQ-5D and none for SF-6D. There were moderate correlations (Spearman’s rho = 0.59, p < 0.0001) between the 2 scores, but poor agreements on scatterplot, intraclass correlation (ICC 0.43 and standardized ICC 0.21), and Bland-Altman plots. EQ-5D generated lower utility scores than SF-6D in the poorer health subgroup. SF-6D had stronger correlation with the general health status and other external measures of health; and it distinguished better between good and poor general health status, with better effect size and relative efficiency statistics. EQ-5D demonstrated higher patient acceptability.

Conclusion. EQ-5D and SF-6D instruments generated different utility scores in PsA. SF-6D may be superior because of normal scaling distribution and the absence of ceiling and floor effects. SF-6D also had better construct validity and better discrimination of poor health status. More studies are required for cost-utility analysis in PsA.

Key Indexing Terms:

PSORIATIC ARTHRITIS
COMPARATIVE STUDY
COST BENEFIT
QUALITY-ADJUSTED LIFE-YEAR

Psoriatic arthritis (PsA) has deleterious effects on joints and skin, causing joint deformities, impaired physical function, and impaired health-related quality of life (HRQOL). The introduction of anti-tumor necrosis factor-α (anti-TNF) therapies has dramatically changed the treatment paradigms for PsA. Given that health resources are finite, this change highlights the importance of cost-utility analysis (CUA), the primary outcome of which is cost per quality-adjusted life-year (QALY). Indirect HRQOL measures, such as the EuroQol-5D (EQ-5D)¹ and the Short form-6D (SF-6D)², are commonly used to elicit health state values for calculating QALY. Both instruments measure health in terms of physical function, pain, and mental health. Also, both have a scoring function derived from the statistical modeling of preferences for multideficit health states elicited from the general population of the United Kingdom³. Both instruments classify a respondent’s self-reported health status according to a specific descriptive or classification system and assign a utility score. A utility score of 1 represents a state of perfect health and a utility score of 0 represents being dead.

Differences between these 2 instruments arise from a combination of differences in descriptive systems and valuations attached to the health states, and these may lead to different utility scores when applied to the same patient. These differences have been noted in other disease groups, including patients with rheumatic diseases^4,5,6,7,8,9. Although gaps have been noted in the existing literature, to date no conclusion has been drawn with regard to which instrument performs better. This highlights the necessity for such comparisons to be made in a wider spectrum of diseases and sociocultural contexts.

The differences between the EQ-5D and SF-6D have also been observed in PsA cohorts that have poor health status and have undergone anti-TNF therapies⁹. However, the difference between the 2 utility scores has not been demonstrated in the general population of patients with PsA. It should be noted that the SF-6D has an advantage in PsA because it can be derived from the Medical Outcomes Study Short Form-36 (SF-36), which has been commonly used and extensively validated in PsA cohorts^10,11,12. The aim of our study was to compare the utility scores EQ-5D and SF-6D in a population of multiethnic Asian patients who have PsA, particularly with regard to the aspects of truth, discrimination, and feasibility corresponding to the Outcome Measures in Rheumatology (OMERACT) filter¹³. We compared the 2 instruments for their distribution and agreement, construct validity, discriminant capacity, and acceptability.

MATERIALS AND METHODS

Data source and collection

From June 2010 through October 2011, we recruited consecutive patients with PsA [based on the Classification Criteria for Psoriatic Arthritis¹⁴] who attended an outpatient clinic at the rheumatology center at Singapore General Hospital. Patients of different ethnicities (Chinese, Malay, Indian, and others) were recruited. All participants completed identical and validated questionnaires, either the Singapore (English) or the Singapore (Simplified Chinese) version of the SF-36 Health Survey, version 2.0 (SF-36v2), and the EQ-5D, and provided sociodemographic data. The study was reviewed and approved by the Institutional Review Board of SingHealth. Before entry into the study, participants were informed of its nature and purpose and each participant signed an informed consent form.

Instruments

The EQ-5D is a standardized tool to measure health state. It has a 20 cm visual analog scale (EQ-VAS), which records the respondent’s self-rated health as a score from 0 to 100 (0 representing worst imaginable health state and 100 best imaginable health state), and a descriptive system comprising 5 health domains (mobility, self-care, usual activities, pain/discomfort, and anxiety/depression). The 5-domain descriptive system classifies 243 different health states using 3 levels of severity for each domain (no problems, some problems, extreme problems). The EQ-5D tariff was estimated using the time tradeoff (TTO) method in a sample of 3395 respondents from the UK general population¹. It consists of a set of numbers that indicates the level of HRQOL for each EQ-5D health state, on a scale from 1 (full health) to 0 [dead; range −0.594 to 1, where negative values are valued as worse than dead (WTD)]. The EQ-5D is commonly used in quantifying the influence of medical interventions on HRQOL. By comparing the difference in EQ-5D health states before and after treatment, analysts calculated the treatment effectiveness in QALY¹⁵. Both the English and Chinese versions of the EQ-5D have been validated in Singaporean patients with rheumatic diseases^16,17 and measurement equivalence has been demonstrated for the Singapore English and Chinese versions¹⁸. Favorable CUA for anti-TNF therapy in PsA has been reported¹⁹.

The SF-6D is a classification tool for describing health as derived from 7 of the 8 health domains that are covered by the SF-36v2 health survey. It has 6 multilevel domains: physical functioning, role participation (combined role-physical and role-emotional), social functioning, bodily pain, mental health, and vitality. Each of the 6 domains has 4–6 levels of response, thus the SF-6D describes 18,000 health states. A preference tariff was estimated using the standard gamble (SG) method to obtain utility values from the UK general population on 249 of the possible health states². The resulting SF-6D index, which ranges from 0.296 (worst health state) to 1.0 (best health state), was used in the assessment of QALY and the cost-effectiveness of various healthcare interventions. A few studies in PsA have demonstrated improvement in SF-6D scores with anti-TNF treatment beyond the minimal clinically important difference (MID) of 0.03^20,21. The English and Chinese versions of the SF-6D have been demonstrated to be equivalent in the Singaporean population including patients with rheumatic diseases²².

Statistical analysis

Collected data were entered into a Microsoft Excel spreadsheet and analyzed using SPSS software, version 17.0 (SPSS Inc.) and Stata/SE software version 11.0 (StataCorp LP). All statistical tests were 2-tailed and conducted at a 5% level of significance.

EQ-5D and SF-6D utility scores were calculated using the scoring algorithms developed by Dolan¹ and Brazier, et al², respectively. Missing data in SF-36v2 were estimated according to protocol. We compared the distribution, mean, median, and agreement between the utility scores. For distribution, we compared the mean ± SD and median [interquartile range (IQR)] utility scores generated by the 2 instruments. Ceiling and floor effects were considered present if > 15% of participants responded giving the highest and lowest possible scores²³. The within-subject differences in the 2 utility scores were compared by t test. A limit of 0.03 between scores was chosen based on the smallest estimate of the published results of MID for the EQ-5D and SF-6D^7,24. To assess the degree of agreement between the 2 utility scores, we used the interclass correlation coefficient (ICC; 2-way random-effects model with absolute agreement) and a Bland-Altman plot²⁵. An ICC > 0.7 suggests an acceptable level of agreement²⁶. As the lower bound of the 2 utility scores differs, we standardized the utility scores linearly to fit the range of 0–1 (based on the theoretical possible range).

For the aspect of truth, we compared the construct validity using the Spearman’s rank correlation between the 2 utility scores and the SF-6D summary scores, physical component summary (PCS), mental component summary (MCS), and SF-6D general health (SF-GH). The SF-GH is the participant’s response to the first question of the SF-36: “In general, would you say your health is — ?” and it is not included in the calculation of the SF-6D.

For the aspect of discrimination or discriminant capacity, we examined the discriminatory capacities of the 2 utility scores to distinguish between participants with contrasting health states. Participants with different levels of impairment were classified according to the SF-GH. The SF-GH was analyzed with the following categories: “excellent/very good,” “good,” and “fair/poor.” The ability to differentiate between SF-GH “excellent/very good” versus “good” and “good” versus “fair/poor” subgroups was calculated using 1-way ANOVA. The effect size was calculated as the standardized mean difference described by Cohen²⁷ (i.e., the difference in mean scores divided by the pooled SD). The effect size was categorized as small (0.2–0.5), moderate (> 0.5–0.8), or large (> 0.8). The influence of sociodemographic factors was evaluated by t test or 1-way ANOVA. The ability of the utility scores to detect difference between health status of SF-GH was tested by the relative effective statistics (RE). This is calculated as the ratio of the square of t statistics of the SF-6D utility score to the square of t statistics of the EQ-5D. An RE > 1.0 indicates that SF-6D is more efficient than the EQ-5D in detecting the difference. The reverse is true if the RE is < 1.0. We evaluated the RE of the 2 utility scores in differentiating health status according to SF-GH, “excellent/very good” versus “good” and “good” versus “fair and poor”²⁸. For “feasibility,” we reported the proportion of missing data for each utility score.

RESULTS

Characteristics of participants

Eighty-six participants (69 English-speaking and 17 Chinese-speaking) were enrolled into the study. Characteristics of participants in each ethnic group and in the total sample are shown in Table 1. The HRQOL among participants with PsA was much lower than that found among the healthy “normal” population: both the norm-based SF-36 PCS and MCS were below the norm mean of 50. The characteristics of PsA participants from different ethnicities were generally similar, except that the EQ-VAS was poorer among Indian participants with PsA.

View this table:

Table 1.

Participant characteristics and sociodemographics.

EQ-5D and SF-6D distribution and agreement

The score distribution for the SF-6D was normal (skewness = 0.27, kurtosis = −0.37, p = 0.10 by the Kolmogorov-Smirnov test), while that of the EQ-5D was bimodal (skewness = −1.62, kurtosis = 2.80, p < 0.001 by Kolmogorov-Smirnov test; Figure 1). The 3 most commonly reported EQ-5D profiles were 11111 (20%), 11121 (29.4%), and 11122 (10.6%), whereas the reported SF-6D profiles were spread across all states, none of which was reported by > 2 participants (2.4%). A ceiling effect was observed in EQ-5D (range −0.014 to 1.0), where 20% of participants responded with the highest possible score and 2.3% of participants had negative scores for EQ-5D corresponding to the WTD state. No ceiling or floor effects were observed for the SF-6D (range 0.355 to 1.0). The mean (± SD) EQ-5D and SF-6D utility scores were 0.74 (± 0.24) and 0.68 (± 0.13), respectively (p = 0.001). The median EQ-5D and SF-6D utility scores were 0.8 (IQR 0.09) and 0.64 (IQR 0.18). There was a mean difference of 0.05 (± 0.2) between the utility scores, which was higher than the MID of 0.03 that we chose for comparison. The paired utility scores are presented graphically as a scatterplot for the entire population (Figure 2); the Spearman product-moment correlation coefficient between EQ-5D and SF-6D was 0.59 (p < 0.0001). The deviation from the 45-degree line was evident, particularly in the low end of the utility scores. In Bland-Altman plots, EQ-5D scores were systemically lower than the SF-6D in subjects with lower averaged utility scores (Figure 2). Poor agreement between EQ-5D and SF-6D utility scores was demonstrated with the low ICC (0.43, 95% CI 0.23 to 0.59) for the entire population. The ICC was even lower when it was standardized (0.21, 95% CI −0.09 to 0.50). There were wide limits of agreement on the Bland-Altman plot (1.96 SD from −0.35 to 0.46; Figure 2).

Figure 1.

A. Score distribution of the EQ-5D was normal (skewness = 0.27, kurtosis = −0.37, p = 0.10, Kolmogorov-Smirnov test). B. Score distribution of the SF-6D was bimodal (skewness = −1.62, kurtosis = 2.80, p < 0.001, Kolmogorov-Smirnov test).

Figure 2.

A. Paired utility scores for the entire population; Spearman product-moment correlation coefficient between EQ-5D and SF-6D was 0.59 (p < 0.0001). Deviation from the 45-degree line can be seen, particularly in the low end of the utility scores. B. In Bland-Altman plot, EQ-5D scores were systemically lower than the SF-6D in subjects with lower averaged utility scores.

Aspect of truth

For construct validity, the Spearman’s rho correlation between the EQ-5D and SF-36v2 summary scores and SF-GH were moderate (0.37–0.45), while the correlations for the SF-6D with SF-GH status were good (0.57–0.84; Table 2). There were similar correlations between the 2 utility scores and the EQ-VAS. At the domain level, there were moderate correlations between the EQ-5D and SF-6D domains that measure similar constructs. The Spearman’s rho were 0.53 (p < 0.001) between SF-6D pain and EQ-5D pain/discomfort, and 0.48 (p < 0.001) between SF-6D mental health and EQ-5D anxiety/depression. SF-6D physical functioning correlated weakly with EQ-5D mobility, self-care, and usual activities (r = 0.33–0.39), which reflects that these domains measured different aspects of HRQOL.

View this table:

Table 2.

Correlation of EuroQol-5D (EQ-5D) and Short form-6D (SF-6D) with SF-general health and SF-36 summary scores.

Aspect of discrimination

The utility scores did not differ by sex, age, marital status, ethnicity, languages, or education level. For discriminant capacity, the SF-6D distinguished between participants with “better” or “poor” health status, with strong effect sizes. The EQ-5D distinguished participants with “good” versus “fair/poor” health status with only moderate effect size (Table 3). Considering the difference between “excellent/very good” and “good” of the SF-general health, the RE score was 0.98, implying that the SF-6D was as efficient as the EQ-5D in differentiating these 2 health status. Also, the RE was 1.95 for differentiating “good” from “fair/poor” of the SF-GH. This implied that SF-6D was 95% more efficient than EQ-5D in identifying patients with “good” or “poor” general health status.

View this table:

Table 3.

Discriminant capacity of EQ-5D and SF-6D utility scores.

Aspect of feasibility

The EQ-5D has 5 items, which is much fewer than the SF-6D, which is derived from the 36 items of the SF-36. The SF-6D individual items are rarely presented in isolation. The acceptability in terms of completion rate is higher in the EQ-5D. Missing raw data was 1.2% for the EQ-5D. It was 9.3% for SF-6D, and was then reduced to 3.5% by estimating the missing data according to the SF-36v2 protocol.

DISCUSSION

Using an appropriate and valid utility index is a major issue in cost-utility analysis. Therefore, it is important to understand the performance of different indirect utility instruments in various diseases. In this study, we presented and compared utility data in terms of distribution, agreement, and the OMERACT filter, which comprises truth, discrimination, and feasibility, in a cohort of multiethnic Asian patients with PsA recruited from a secondary and tertiary referral rheumatology center. We found a 0.05 difference in the utility scores generated by the 2 instruments, which is higher than the smallest published MID value of 0.03 for the SF-6D^7,24. In a review comparing 8 longitudinal studies across 11 patient groups, the mean MID for the SF-6D was 0.041 and the mean MID for the EQ-5D was 0.074²⁹. We acknowledge that the difference we found between the 2 utility scores was lower than the upper limits of MID, yet this difference was substantial.

Our study also demonstrated poor agreement and ICC with utility scores generated by the EQ-5D and SF-6D. Similar findings have been reported in cohorts of rheumatoid arthritis (RA)⁷, ankylosing spondylitis (AS)⁸, inflammatory arthritis⁹, and early arthritis^30,31. A common finding in comparison studies is that EQ-5D tends to generate lower utilities than SF-6D in subgroups with poorer health, and that the reverse is true in the healthier subgroups^8,32,33. Our study also revealed similar findings in a PsA cohort. There were more prominent deviations from the 45-degree line on the scatterplot and negative values for the difference between EQ-5D and SF-6D in the lower end of average score on a Bland-Altman plot. Adams, et al⁹ demonstrated similar phenomena in patients with inflammatory arthritis (345 RA and 159 PsA) before they started biological therapy and again 12 months later. At baseline, 12% of the PsA participants reported a negative utility score with EQ-5D, which corresponded to a status of WTD. Significantly lower utility scores with EQ-5D compared to SF-6D were observed, and the participant group with the WTD state might have directly contributed to the large difference (about 2-fold) in QALY gain for a given change in health status. Most criticisms about EQ-5D were not about the instrument itself but the preference-based values to the raw TTO scores and how the WTD status is handled^33,34. Using the same database, Adams, et al illustrated the influence of using a revised scoring method for EQ-5D in CUA analysis³⁵. They demonstrated that the revised EQ-5D has a lesser difference between the utility scores and the change in utility after biological treatment. However, that study did not conclude how this new EQ-5D scoring system may alter the utility estimates, and ultimately the results of an economic model³⁶. Although the HRQOL of our cohort was much worse than that of the general population, the general health status of our participants was better than that reported by Adams, et al, and the proportion of participants with the WTD state in our study cohort was only 2.3%. However, the large difference between the utilities generated by these 2 instruments was still present. This implies that the difference between the 2 utility scores is not just related to the preference-based weights and methods of handling the WTD status. In Singapore, where there is no primary healthcare that supports the care of chronic inflammatory arthritis, the majority of patients with PsA are cared for in secondary and tertiary centers. Hence, we believe that our sample represents the whole spectrum of illness in PsA.

It is well known that different indirect utility instruments may yield different utility scores^37,38. Many attributed this to the different methods by which these health preference utility scores were derived (TTO for EQ-5D and SG for SF-6D). The SF-6D describes more health states than does EQ-5D (18,000 vs 243 health states) and therefore may capture more health states at the extreme ends of the range and may capture smaller health changes³⁹. However, only a minority of these states have been valued by SG, and not all the states were valued by TTO when weights of the EQ-5D were calculated. This means that most of the health states carry utility values that were estimated from the utility function rather than being measured directly. Moreover, it should be emphasized that the utility scales are anchored at 0 and 1, while they are by no means representing variables lying on interval scales. The weak interval properties of various utility scores generated from indirect HRQOL instruments have been illustrated in a large-scale comparison⁴⁰. Indeed, the descriptive systems in different HRQOL differ widely in their coverage of different dimensions of HRQOL and thus the reported differences in utility scores are attributable, in part, to these differences.

There has been debate whether the valuation of a health state may be a better reflection of the true welfare value that is associated with health⁴¹, while criticisms are that utility scores and general health evaluations are basically measurements of different constructs. In AS, Boonen, et al⁸ have shown that the EQ-5D, SF-6D, and EQ-VAS correlated well with external health measurements, with only moderate agreement; the disagreement between EQ-5D and EQ-VAS was more prominent in the subgroup with poorer health. There is, however, growing interest using the general health measure as a composite measurement of disease activity in PsA. The PsA index, which consisted of patient global assessment, skin global assessment, and physician global assessment, was found to explain > 90% of the variance in the baseline scores of the Group for Research and Assessment of Psoriasis and Psoriatic Arthritis (GRAPPA) Composite Exercise project⁴², and was taken forward for further evaluation in the GRAPPA 2010 and OMERACT 11 meetings⁴³. In PsA, patient global assessment (PGA) has been shown to have reasonable reliability⁴⁴ and construct validity⁴⁵. We observed better ICC and limits of agreement for the SF-6D with EQ-VAS (data not shown), which may imply that the SF-6D provides a better reflection of general health. Although the EQ-VAS is a similar general health measurement, it has not been evaluated as a valid measure of general health status or composite disease activity score in PsA. Therefore, this finding requires further validation.

The measurement of HRQOL in PsA is under investigation, and there have been limited data on using indirect HRQOL measures for CUA in PsA. Our study adds information to the literature on the comparison of performance of these 2 utilities in PsA. In terms of “truth,” the SF-6D utility score performed better, with higher correlation with SF-36 summary scores and SF-GH. Caution in interpretation is needed, in that the SF-6D and summary scores were derived from the same instrument, the SF-36. However, SF-GH was not included in the calculation of the SF-6D. In terms of “discrimination,” the SF-6D utilities performed better in differentiating participants with poorer health status in terms of effect sizes and the RE scores. Gaujoux-Viala, et al³⁰ demonstrated similar systematic disagreement between EQ-5D and SF-6D in a prospective study on 813 patients with early arthritis, especially in patients with worse clinical outcomes. The SF-6D was shown to have better RE statistics³⁰. In their longitudinal study over 2 years, the same group of authors also recently demonstrated better responsiveness for improvement with the SF-6D in standardized response mean and effect sizes³¹.

For “feasibility,” the SF-6D adds more missing values than the EQ-5D, although the missing values could be minimized by substituting with the average from the other items in the same subscale. For distribution, the SF-6D was better in terms of having normal scaling distribution and the absence of ceiling and floor effects. However, further research is required to determine which instrument performs better.

There are several limitations to our study. First, the construct validity assessments are limited to the SF-GH. No other assessments of disease activity, such as joint count or Health Assessment Questionnaire data, were collected. Second, this was a cross-sectional study and the discrimination evaluation is limited to how the utility scores differentiate different health states, instead of a change in status with treatment over time. Third, we did not address the reliability for these utility scores in PsA. Fourth, our study cohort consisted of PsA patients with a long duration of illness, recruited from a single center, which limits the study’s generalizability to patients with early PsA. Our sample size was relatively small and this may affect the interpretation of ceiling and floor effects. Moreover, HRQOL measures are heavily influenced by comorbidities that may introduce inaccuracy. Finally, the scoring algorithms used for both instruments were developed from the UK general population, because no such algorithm was available in Singapore at the time of the study.

Despite these limitations, we have demonstrated in a cohort of patients with PsA that the SF-6D performed slightly better in terms of construct validity and discrimination than the EQ-5D, which is shorter and more feasible in clinical practice. The SF-6D also had normal distribution and lack of ceiling effects. However, the 2 instruments yielded different utility scores in PsA. This would have a great effect on QALY estimates, and it highlights the importance of choosing the appropriate instrument for cost-effectiveness evaluation. Additional research is needed to determine whether the EQ-5D or the SF-6D is the better instrument for cost-utility analysis in PsA.

Acknowledgment

The authors appreciate the support of the Duke-NUS/SingHealth Academic Medicine Research Institute and the editorial services of Jon Kilner, MS, MA, Pittsburgh, PA, USA, and Taara Madhavan, Associate, Duke-NUS Graduate Medical School, Singapore.

Accepted for publication January 9, 2013.

REFERENCES

1.
1. Dolan P
. Modeling valuations for EuroQol health states. Med Care 1997;35:1095–108.
2.
1. Brazier J,
2. Roberts J,
3. Deverill M
. The estimation of a preference-based measure of health from the SF-36. J Health Econ 2002;21:271–92.
3.
1. Kopec JA,
2. Willison KD
. A comparative review of four preference-weighted measures of health-related quality of life. J Clin Epidemiol 2003;56:317–25.
4.
1. Brazier J,
2. Roberts J,
3. Tsuchiya A,
4. Busschbach J
. A comparison of the EQ-5D and SF-6D across seven patient groups. Health Econ 2004;13:873–84.
5.
1. Conner-Spady B,
2. Suarez-Almazor ME
. Variation in the estimation of quality-adjusted life-years by different preference-based instruments. Med Care 2003;41:791–801.
6.
1. Gerard K,
2. Nicholson T,
3. Mullee M,
4. Mehta R,
5. Roderick P
. EQ-5D versus SF-6D in an older, chronically ill patient group. Appl Health Econ Health Policy 2004;3:91–102.
7.
1. Marra CA,
2. Woolcott JC,
3. Kopec JA,
4. Shojania K,
5. Offer R,
6. Brazier JE,
7. et al.
A comparison of generic, indirect utility measures (the HUI2, HUI3, SF-6D, and the EQ-5D) and disease-specific instruments (the RAQoL and the HAQ) in rheumatoid arthritis. Soc Sci Med 2005;60:1571–82.
8.
1. Boonen A,
2. van der Heijde D,
3. Landewé R
. How do the EQ-5D, SF-6D and the well-being rating scale compare in patients with ankylosing spondylitis? Ann Rheum Dis 2007;66:771–7.
9.
1. Adams R,
2. Walsh C,
3. Veale D,
4. Bresnihan B,
5. FitzGerald O,
6. Barry M
. Understanding the relationship between the EQ-5D, SF-6D, HAQ and disease activity in inflammatory arthritis. Pharmacoeconomics 2010;28:477–87.
10.
1. Husted JA,
2. Gladman DD,
3. Farewell VT,
4. Long JA,
5. Cook RJ
. Validating the SF-36 health survey questionnaire in patients with psoriatic arthritis. J Rheumatol 1997;24:511–7.
11.
1. Leung YY,
2. Ho KW,
3. Zhu TY,
4. Tam LS,
5. Kun EW,
6. Li EK
. Testing scaling assumptions, reliability and validity of Medical Outcomes Study Short-form 36 health survey in psoriatic arthritis. Rheumatology 2010;49:1495–501.
12.
1. Leung YY,
2. Tam LS,
3. Kun EW,
4. Ho KW,
5. Li EK
. Comparison of 4 functional indexes in psoriatic arthritis with axial or peripheral disease subgroups using Rasch analyses. J Rheumatol 2008;35:1613–21.
13.
1. Boers M,
2. Brooks P,
3. Strand CV,
4. Tugwell P
. The OMERACT filter for Outcome Measures in Rheumatology. J Rheumatol 1998;25:198–9.
14.
1. Taylor W,
2. Gladman D,
3. Helliwell P,
4. Marchesoni A,
5. Mease P,
6. Mielants H,
7. et al.
Classification criteria for psoriatic arthritis: Development of new criteria from a large international study. Arthritis Rheum 2006;54:2665–73.
15.
The PRET project: Preparation for the re-valuation of the EQ-5D tariff. Sheffield: The University of Sheffield; 2011. (Internet. Accessed January 25, 2013.) Available from: http://www.shef.ac.uk/scharr/sections/heds/mvh/pret
16.
1. Luo N,
2. Chew LH,
3. Fong KY,
4. Koh DR,
5. Ng SC,
6. Yoon KH,
7. et al.
Validity and reliability of the EQ-5D self-report questionnaire in Chinese-speaking patients with rheumatic diseases in Singapore. Ann Acad Med Singapore 2003;32:685–90.
17.
1. Luo N,
2. Chew LH,
3. Fong KY,
4. Koh DR,
5. Ng SC,
6. Yoon KH,
7. et al.
Validity and reliability of the EQ-5D self-report questionnaire in English–speaking Asian patients with rheumatic diseases in Singapore. Qual Life Res 2003;12:87–92.
18.
1. Luo N,
2. Chew LH,
3. Fong KY,
4. Koh DR,
5. Ng SC,
6. Yoon KH,
7. et al.
Do English and Chinese EQ-5D versions demonstrate measurement equivalence? An exploratory study. Health Qual Life Outcomes 2003;1:7.
19.
1. Bansback NJ,
2. Ara R,
3. Barkham N,
4. Brennan A,
5. Fraser AD,
6. Conway P,
7. et al.
Estimating the cost and health status consequences of treatment with TNF antagonists in patients with psoriatic arthritis. Rheumatology 2006;45:1029–38.
20.
1. Strand V,
2. Sharp V,
3. Koenig AS,
4. Park G,
5. Shi Y,
6. Wang B,
7. et al.
Comparison of health-related quality of life in rheumatoid arthritis, psoriatic arthritis and psoriasis and effects of etanercept treatment. Ann Rheum Dis 2012;71:1143–50.
21.
1. Saad A,
2. Ashcroft DM,
3. Watson KD,
4. Symmons DP,
5. Noyce PR,
6. Hyrich KL; and
7. on behalf of the British Society for Rheumatology Biologics Register
. Improvements in quality of life and functional status in patients with psoriatic arthritis receiving anti–tumor necrosis factor therapies. Arthritis Care Res 2010;62:345–53.
22.
1. Wee HL,
2. Cheung YB,
3. Fong KY,
4. Luo N,
5. Machin D,
6. Thumboo J
. Are English- and Chinese-language versions of the SF-6D equivalent? A comparison from a population-based study. Clin Ther 2004;26:1137–48.
23.
1. Veenhof C,
2. Bijlsma JW,
3. van den Ende CH,
4. van Dijk GM,
5. Pisters MF,
6. Dekker J
. Psychometric evaluation of osteoarthritis questionnaires: A systematic review of the literature. Arthritis Rheum 2006;55:480–92.
24.
1. Barton GR,
2. Sach TH,
3. Avery AJ,
4. Jenkinson C,
5. Doherty M,
6. Whynes DK,
7. et al.
A comparison of the performance of the EQ-5D and SF-6D for individuals aged > or = 45 years. Health Econ 2008;17:815–32.
25.
1. Bland JM,
2. Altman DG
. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986;1:307–10.
26.
1. Fayers PM,
2. Machin D
. Quality of life: The assessment, analysis and interpretation of patient-reported outcomes. Chichester: John Wiley & Sons Ltd.; 2007.
27.
1. Cohen J
. Statistical power for the behavioral sciences. 2nd ed. Hillsdale, NJ: Erlbaum Associates; 1988.
28.
1. Petrou S,
2. Hockley C
. An investigation into the empirical validity of the EQ-5D and SF-6D based on hypothetical preferences in general population. Health Econ 2005;14:1169–89.
29.
1. Walters SJ,
2. Brazier JE
. Comparison of the minimally important difference for two health state utility measures: EQ-5D and SF-6D. Qual Life Res 2005;14:1523–32.
30.
1. Gaujoux-Viala C,
2. Rat AC,
3. Guillemin F,
4. Flipo RM,
5. Fardellone P,
6. Bourgeois P,
7. et al.
Comparison of the EQ-5D and the SF-6D utility measures in 813 patients with early arthritis: Results from the ESPOIR cohort. J Rheumatol 2011;38:1576–84.
31.
1. Gaujoux-Viala C,
2. Rat AC,
3. Guillemin F,
4. Flipo RM,
5. Fardellone P,
6. Bourgeois P,
7. et al.
Responsiveness of EQ-5D and SF-6D in patients with early arthritis: Results from the ESPOIR cohort. Ann Rheum Dis 2012;71:1478–83.
32.
1. Seymour J,
2. McNamee P,
3. Scott A,
4. Tinelli M
. Shedding new light onto the ceiling and floor? A quantile regression approach to compare EQ-5D and SF-6D responses. Health Econ 2010;19:683–96.
33.
1. Marra CA,
2. Marion SA,
3. Guh DP,
4. Najafzadeh M,
5. Wolfe F,
6. Esdaile JM,
7. et al.
Not all “quality-adjusted life years” are equal. J Clin Epidemiol 2007;60:616.
34.
1. Patrick DL,
2. Starks HE,
3. Cain KC,
4. Uhlmann RF,
5. Pearlman RA
. Measuring preferences for health states worse than death. Med Decis Making 1994;14:9–18.
35.
1. Adams R,
2. Craig BM,
3. Walsh CD
. The impact of a revised EQ-5D population scoring on preference-based utility scores in an inflammatory arthritis cohort. Value Health 2011;14:921–7.
36.
1. Craig BM,
2. Busschbach JJ
. Toward a more universal approach in health valuation. Health Econ 2011;20:864–75.
37.
1. Conner-Spady B,
2. Suarez-Almazor ME
. Variation in the estimation of quality-adjusted life-years by different preference-based instruments. Med Care 2003;41:791–801.
38.
1. Wee HL,
2. Machin D,
3. Loke WC,
4. Li SC,
5. Cheung YB,
6. Luo N,
7. et al.
Assessing differences in utility scores: A comparison of four widely used preference-based instruments. Value Health 2007;10:256–65.
39.
1. Bryan S,
2. Longworth L
. Measuring health-related utility: Why the disparity between EQ-5D and SF-6D? Eur J Health Econ 2005;6:253–60.
40.
1. Hawthorne G,
2. Richardson J,
3. Day NA
. A comparison of the Assessment of Quality of Life (AQoL) with four other generic utility instruments. Ann Med 2001;33:358–70.
41.
1. Parkin D,
2. Devlin N
. Is there a case for using visual analogue scale valuations in cost-utility analysis? Health Econ 2006;15:653–64.
42.
1. Helliwell PS,
2. Fitzgerald O,
3. Strand CV,
4. Mease PJ
. Composite measures in psoriatic arthritis: A report from the GRAPPA 2009 annual meeting. J Rheumatol 2011;38:540–5.
43.
1. Coates LC,
2. Mumtaz A,
3. Helliwell PS,
4. Mease PJ,
5. Callis Duffin K,
6. Krueger GG,
7. et al.
Development of a disease severity and responder index for psoriatic arthritis (PsA) — Report of the OMERACT 10 PsA Special Interest Group. J Rheumatol 2011;38:1496–501.
44.
1. Cauli A,
2. Gladman DD,
3. Mathieu A,
4. Olivieri I,
5. Porru G,
6. Tak PP,
7. et al.
Patient global assessment in psoriatic arthritis: A multicenter GRAPPA and OMERACT study. J Rheumatol 2011;38:898–903.
45.
1. Leung YY,
2. Ho KW,
3. Zhu TY,
4. Tam LS,
5. Kun EW,
6. Li EK
. Construct validity of the modified numeric rating scale of patient global assessment in psoriatic arthritis. J Rheumatol 2012;39:844–8.

Main menu

User menu

Search

Comparison of EuroQol-5D and Short Form-6D Utility Scores in Multiethnic Asian Patients with Psoriatic Arthritis: A Cross-sectional Study

Abstract

MATERIALS AND METHODS

Data source and collection

Instruments

Statistical analysis

RESULTS

Characteristics of participants

EQ-5D and SF-6D distribution and agreement

Aspect of truth

Aspect of discrimination

Aspect of feasibility

DISCUSSION

Acknowledgment

REFERENCES

Content

Resources

Subscribers

More