Abstract
Objective. There are several new composite indices for assessing disease activity in psoriatic arthritis (PsA). Each may function as a disease state variable and a responder index. The aim of our study was to determine cutoffs for disease activity and response.
Methods. Data from the Group for GRAPPA Composite Exercise (GRACE) study were used to develop cutoffs using a number of different approaches. Voting on choice of cutoff was undertaken at the 2013 GRAPPA Annual Meeting in Toronto, Ontario, Canada.
Results. After voting, results for cutoffs for low/high disease activity for the Psoriatic ArthritiS Disease Activity Score (PASDAS), GRAppa Composite scorE (GRACE index), and Composite Psoriatic Disease Activity Index (CPDAI), respectively, were 3.2/5.4, 2.3/4.7, and 4/8. The measurement error for each composite score was estimated at 0.8, 1, and 2 for PASDAS, GRACE, and CPDAI, respectively.
Conclusion. Response criteria for the new composite indices have been developed. These now require further validation and testing in other datasets.
Psoriatic arthritis (PsA) is a heterogeneous disease that can manifest in several ways including arthritis, enthesitis, dactylitis, axial disease, and skin/nail involvement. For the last 12 years the primary outcome measure used in interventional studies has been the American College of Rheumatology 20% improvement (ACR20) criteria, a measure originally developed for rheumatoid arthritis (RA) that focuses on peripheral joint activity1. The ACR improvement criteria measure improvement in tender and swollen joint counts plus at least 3 of the following 5 measures: acute-phase reactant, patient global assessment of disease activity by visual analog scale (VAS), physician global (MD global) assessment of disease activity by VAS, pain by VAS, and physical function using the Health Assessment Questionnaire (HAQ). The ACR20, 50, and 70 scores refer to ≥ 20/50/70% improvements.
In addition, a number of studies have used the Disease Activity Score for 28 joints (DAS28)2. The DAS28 in RA measures 28-joint tender and swollen counts, patient global, and either erythrocyte sedimentation rate (ESR) or C-reactive protein (CRP). However, in PsA, the number of joints assessed should optimally include a 68-tender, 66-swollen joint count, which includes the distal interphalangeal (DIP) joints of the fingers. The 28-joint count excludes the DIP joints of the fingers, as well as the ankles and feet. Although the DAS28 has been shown to be capable of distinguishing between patients with PsA treated with anti-tumor necrosis factor agents from those receiving placebo, it was noted that 25% of the patients would not have been included in this study if a 28-joint count had been part of inclusion criteria3. Further, in cases of oligoarthritis, use of the DAS28 can misclassify 20% of cases, as shown in a cross-sectional dataset4.
The Psoriatic Arthritis Response Criteria (PsARC) were developed for a specific Veterans Administration study of sulfasalazine in PsA but have been used widely in subsequent clinical trials5. To achieve response, a patient had to achieve 2 of the following, 1 of which had to be a joint count, and no worsening of any measure: ≥ 30% improvement in tender or swollen joint count; and 1-point improvement (on 5-point Likert scale) on patient global or MD global.
Several other candidate composite measures have been proposed, some of which capture aspects of PsA other than the peripheral arthritis. These include measures developed in the GRAppa Composite scorE (GRACE project); the Psoriatic Arthritis Disease Activity Score (PASDAS) and the Arithmetic Mean of Desirability Function (AMDF)6; the Composite Psoriatic Disease Activity Index (CPDAI)7; and the Disease Activity score for PSoriatic Arthritis (DAPSA)8. Initial comparison of these measures was made in the development phase of the PASDAS and AMDF2 and other comparisons have been made using interventional trial data9.
Composite indices may function in different ways. Responder indices, such as ACR20 in RA, measure changes in disease states with treatment interventions. Disease activity indices, such as the DAS28 in RA, measure both disease activity at a single time point and changes in disease activity after treatment interventions, thereby functioning both as a static measure of disease activity and a responder index, with cutoffs for disease activity states and magnitude of response.
At the Group for Research and Assessment of Psoriasis and Psoriatic Arthritis (GRAPPA) annual meeting in Toronto, Ontario, Canada in July 2013, data were presented on the development of cutoffs for disease activity and response criteria for these new measures, and votes were taken to finalize the process.
METHODS
The GRACE study was a large observational study of 503 patients with PsA with data collected at 32 centers worldwide affiliated with GRAPPA. A large range of clinical data and patient-reported outcomes were collected at baseline, 3 months, 6 months, and 12 months. At each visit, treatment changes were noted, which were used as a surrogate for an active disease state. A change equated to additions of medication, dose increases of current medications, and/or changes to different medications. If treatments were changed because of an adverse event, cases were excluded from the “changed medication” group. Further descriptions and formulas for the composite measures are presented below.
Psoriatic Arthritis Disease Activity Score
The PASDAS is a weighted index comprising assessments of joints, function, acute-phase response, quality of life (QOL), and patient and physician MD by VAS. It is represented by the formula:
where LN = natural logarithm, PCS = physical component summary scale of SF36, CRP = C-reactive protein in mg/l, SF36 = Medical Outcomes Study Short Form-36. All VAS scores are 0–100 mm. Swollen joint count is 66 joints, and tender joint count 68. The score range of the PASDAS is 0–10, with worse disease activity represented by higher scores.
Arithmetic mean of the desirability function and GRACE index
The AMDF is a composite score comprising assessments of joints, skin, pain, function, and health-related QOL. Each domain was rated by physicians on a similar “desirability” scale to be able to combine all the items, and transformed into a 0–1 scale where 0 is a completely unacceptable state and 1 is normal. The variables transformed were (1) 68 tender joint count, (2) 66 swollen joint count, (3) HAQ, (4) Patient global assessment of disease activity by VAS, (5) Patient VAS for skin, (6) Patient VAS for joints, (7) Psoriasis Area and Severity Index (PASI), and (8) Psoriatic Arthritis QOL Index (PsAQoL).
The 8 transformed variables were combined using the arithmetic mean. For the purposes of this analysis, as agreed at the GRAPPA meeting in Washington, DC, in November 2012, the AMDF was transformed, and renamed, as follows:
This provides a score range of 0–10, where 0 is best and 10 is worst.
Composite Psoriatic Arthritis Disease Activity Index
The CPDAI measures disease activity in 5 domains: peripheral joints (68 tender and 66 swollen joints, and HAQ), skin [PASI and Dermatology Life Quality Index (DLQI)], enthesitis (Leeds Enthesitis Count and HAQ), dactylitis (number of tender dactylitic digits and HAQ), and spine (Bath Ankylosing Spondylitis Disease Activity Score and Ankylosing Spondylitis QOL index)7. Within each domain, severity was graded as 0 (none), 1 (mild), 2 (moderate), and 3 (severe), according to predefined cutoffs.
Disease Activity Score for Rheumatoid Arthritis
The DAS28 in RA includes 28-joint tender and swollen counts, patient global VAS score, and either ESR or CRP. The score is calculated using weighting of the components, and ranges between 0 and 10. The DAS28 was calculated as follows:
Disease Activity Index for Psoriatic Arthritis
The DAPSA measures disease activity in peripheral arthritis using 68 tender and 66 swollen peripheral joint counts, patient global VAS (0–10 scale), patient pain VAS (0–10 scale), and CRP. The composite score is a simple sum of the scores8.
Development of cutoffs for disease activity
As there is no single acceptable “gold standard” for low and high disease activity, 3 methods were used to estimate cutoffs: (1) physician and patient global scores; (2) score distribution method; and (3) receiver-operating curve (ROC) method. In addition, the interperiod correlation coefficient was used to estimate “e,” the measurement error.
The results were to be interpreted using a consensus approach with experts in the field. A voting system was used to arrive at cutoffs for each scoring system.
Physician and patient global scores
Physician and patient global scores were the external standards, with which the following cutoffs were used: < 10 low disease activity; ≥ 10 but < 30 moderate; ≥ 30 but < 60 high; ≥ 60 very high. Using these cutoffs, and the ROC curves generated with them, selection of the cutoff was made at the 90% specificity value, in order not to misclassify patients by keeping the false-positive rate low.
Score distribution method
An estimation of cutoffs for disease activity based on the distribution of scores of people in high and low disease activity was used, based on the methods described in the development of cutoffs for the DAS score10. The score distributions for the PASDAS, taken from the GRACE dataset, are shown in Figure 1. Considerable overlap can be seen in scores between the low and high disease activity states. For this reason, a 50th percentile cutoff for both low and high distributions was chosen.
Score distributions for PASDAS in GRACE study. The “change” group had its treatment changed or escalated for “active” disease. The “no change” group did not have treatment change and was therefore deemed to be in stable disease activity. The broken lines represent the 50th percentile values. PASDAS: Psoriatic ArthritiS Disease Activity Score; GRACE: GRAppa Composite scorE (index).
ROC method
This approach used a definition of high disease activity (in GRACE, the physician’s decision to escalate treatment) and constructed ROC curves from which the cutoff for high disease activity can be estimated, using a cutoff at 90% specificity (Figure 2).
Receiver-operating curve (ROC) for PASDAS with decision to change treatment as discriminator. The value of PASDAS at 90% specificity was 5.62 (at this point sensitivity was 40%, as indicated by the dotted line). Area under the curve was 0.78 (95% CI: 0.75–0.81). PASDAS: Psoriatic ArthritiS Disease Activity Score.
Estimation of measurement error
As with the DAS score, an estimate of measurement error (e) was obtained from the interperiod correlation coefficients for each measure using data from each of the assessment points in the GRACE study6. The value of e is represented by:
where sd = standard deviation of measure, and ro is derived from the regression of the interperiod correlation coefficients. A “good” response is represented by (2e).
Consensus approach and voting
The results of this exercise and choice of cutoff were debated at the GRAPPA Annual Meeting in 2013. After discussion, GRAPPA members were asked to vote on the following question: Should the cutoffs be based on (1) lower estimate (patient); (2) upper estimate (physician); or (3) middle point between these 2 (i.e., mean)?
RESULTS
The GRACE database obtained data at baseline, 3 months, 6 months, and 12 months. At each timepoint the physician was asked about treatment change, which therefore provided more data points than the baseline recruitment figures (n = 503). The total number of timepoints at which change was recorded varied according to outcome measures owing to missing data: PASDAS 1103 data points, GRACE 1377, CPDAI 1356, DAS28 1143, and DAPSA 1143.
Physician and patient global scores
For illustration, global scores are provided for the PASDAS in Table 1. The discrepancy between the cutoffs based on physician global scores and those based on patient global scores was resolved by selecting the patient global scores (as agreed at the GRAPPA meeting in November 2012). The process was repeated for all 5 measures (see column 4, Table 2).
Cutoffs for disease activity for PASDAS using physician and patient global scores.
Cutoffs for disease activity in composite measures.
Score distribution method
Percentiles were calculated for each measure according to the score distribution for active and inactive disease; they are shown in Table 3. Because of the overlap mentioned in Methods, the 50th percentile was chosen as the cutoff for each distribution, representing high and low disease activity, as shown in column 6, Table 3. The 50th percentile cutoffs are also shown in column 2, Table 2.
Percentiles for each score distribution.
ROC method
This approach used the definition of high disease activity (the decision to escalate treatment) to construct ROC curves from which the cutoff for high disease activity could be made. These are shown in column 3, Table 2.
Definition of response
Results of the regression of the interperiod correlation coefficients are shown in Table 4. The value of e is the measurement error (see Methods) and a good response is represented by (2e).
Results of regression using interperiod correlation coefficients.
Triangulation of results
Combining the various estimates of the cutoffs, together with the estimates of good response, the results are summarized for all 5 measures in Table 2. Note that the cutoffs determined for the DAS28 are different from those defined for RA. It is also clear from Table 2 that some of the estimates differ. Where discrepancies were found, the choice of cutoff was determined by consensus (see Methods). Voting results were as follows: patient-derived values, 29%; physician-derived, 12%; and a mean of those values, 59%. The mean value of the cutoff is shown in Table 2, column 5.
DISCUSSION
Data from the GRACE study have been used, in a manner similar to the development of the DAS (European League Against Rheumatism response criteria), to define activity states and response criteria for the new composite measures that take a more comprehensive account of PsA. A number of choices for cutoffs were available. A process of triangulation was undertaken to cross-validate estimates. The main discrepancies in the estimates occurred for the PASDAS and GRACE measures. The discrepancies resulted from the different methodological approaches — the patient-based methods (using the global VAS) gave estimates that were lower than the physician-based methods (using the decision to change treatment). In these discrepant cases, voting by GRAPPA members resulted in a majority for using the mean value of available estimates as cutoffs for disease activity and response.
What would PsA response criteria look like? These are indicated in Table 5, but require further evaluation in interventional studies. The defined low disease activity states may also be used as specific targets for treatment in PsA, both in clinical practice and clinical trials. No matter which of these new composite indices is used, it will be important to be able to report the values for individual domains, as will the single composite score; otherwise, differential treatment responses (e.g., between the skin and the joints) will be missed. The single composite score will retain the additional power provided by including all relevant domains, but it will still be appropriate to provide data on the component parts. In time, it is hoped that shorter versions of these indices that function equivalently to the parent index will be developed; however, further experience with the full composite index is required before this can be done. For the moment, GRAPPA members suggest using the PASDAS, GRACE, or CPDAI; future studies will help determine which of these is preferable.
Response criteria for the Psoriatic ArthritiS Disease Activity Score (PASDAS).
Response criteria for the GRAPPA Composite Exercise (GRACE).
Response criteria for the Composite Psoriatic Disease Activity Index (CPDAI).
Response criteria for Disease Activity Score (DAS28-CRP).
Response criteria for Disease Activity score for PSoriatic Arthritis (DAPSA).