Abstract
Objective. To investigate responsiveness, discrimination, and construct validity of a composite change index (CCI) for the assessment of single-joint involvement in inflammatory arthritis.
Methods. Evaluation of standardized response means (SRM), Guyatt effect size, and Spearman rank correlation coefficient in a randomized controlled trial investigating the effect of an intraarticular etanercept injection.
Results. The CCI showed a high SRM (1.68) and high Guyatt effect size (2.72). Both visual analog scale of pain and functionality had a moderate Guyatt effect size (2.06, 2.44) and high SRM (0.81, 0.97).
Conclusion. This study supports the use of the CCI as a single-joint assessment after single-joint intervention. Clinical trial registration: NTR-1210.
Several validated outcome measures have been developed to evaluate a clinical response to treatment in patients with polyarthritis, for example the Disease Activity Score at 28 joints (DAS28)1. However, such a measure is lacking for the assessment of a clinical response in patients with a single joint involved.
The absence of a validated outcome measure for single-joint interventions does not imply a lack of interest: an Outcome Measures in Rheumatology Clinical Trials (OMERACT) special interest group (SIG) was already established in 20042. The SIG assessed studies investigating patient-reported outcomes (PRO), clinical examination, and imaging outcomes in therapeutic studies in knee arthritis for their accordance with the OMERACT filter (domains: truth, discrimination, and feasibility)3. It was concluded that the development of outcome measures for the evaluation of single joints remains an important but difficult endeavor2.
Analysis of a gene therapy trial that investigated the effect of an intraarticular injection in a single joint has suggested that PRO have construct validity and are responsive. Clinical assessments (by the physician) have shown a high level of interobserver agreement, but were not sensitive in detecting changes over time4.
In previous trials investigating single-joint treatment effects, a composite score has been used that was specifically designed for single-joint interventions. This score combines 6 target-joint variables and reflects the change in variables over time5. The purpose of our study was to further investigate the responsiveness, discrimination, and construct validity of this composite change index (CCI).
MATERIALS AND METHODS
Clinical study
The CCI was evaluated in a multicenter, randomized double-blind placebo-controlled trial (the Netherlands National Trial Register: NTR-1210) comparing the efficacy of a single intraarticular etanercept (ETN) injection (25 mg) versus placebo in 32 patients with rheumatoid arthritis (RA)6 or psoriatic arthritis (PsA)7. Patients had mono- or oligo-arthritis, including at least arthritis of a knee, ankle, wrist, elbow, or metacarpophalangeal (MCP) joint, despite a stable dose of methotrexate and/or prednisone. Evaluations have been performed at baseline, once per week until 4 weeks, and at 6 weeks after the injection. With a few exceptions, each patient was seen by the same physician examiner over time. Patients were recruited from 3 sites in the Netherlands.
Primary outcome measure
Target joint improvement was assessed by the CCI, a composite score consisting of a 100-mm visual analog scale (VAS) for patient-reported target-joint pain, physician-assessed (PA) joint tenderness, PA joint swelling, and PA functional disability. In addition, both patient’s and physician’s global assessment of the effect of treatment on the target joint was measured. The calculation of the CCI is illustrated in Table 1. Successful treatment was defined as CCI ≥ 55.
Other outcome measures
General disease activity variables included the DAS28 [based both on C-reactive protein (CRP) and erythrocyte sedimentation rate (ESR)], a 66-tender and 68-swollen joint count, and a Ritchie Articular Index, as well as CRP (mg/l) and ESR (mm/h). The Health Assessment Questionnaire-Disability Index (HAQ-DI) was used to evaluate physical function.
Statistical analysis
The responsiveness, discriminative power, and construct validity of the CCI were assessed using baseline and Week 1 data because optimal clinical effect was seen at 1 week.
Responsiveness, discrimination, and construct validity
Responsiveness was determined by standardized response means (SRM)8 and Guyatt effect size, defined as the mean change in a score in the treatment group in a defined period of time divided by the SD of the change in the control group9. Discrimination between active treatment and placebo was analyzed by unpaired Student t tests and chi-square tests where appropriate. Construct validity was determined by comparing the CCI and its components with the HAQ score using Spearman rank correlation.
RESULTS
Study population
The characteristics of the trial, as well as the clinical outcomes, are described elsewhere10. In brief, 32 patients were randomized, and 22 received ETN, 9 placebo, and 1 did not receive treatment because of resolved arthritis symptoms. The CCI was assessed in 30 subjects, including 11 with RA (37%) and 19 with PsA (63%), divided equally between the 2 treatment groups. The injected joints included 26 knee joints (87%), 1 ankle joint (3%), and 3 MCP joints (10%). Intraarticular ETN treatment resulted in a rapid improvement of the CCI in comparison with placebo [mean (SD) change 4.3 (2.6) vs 1.5 (1.6), p < 0.001]. This clinical treatment effect was only transient and disappeared after 3 weeks.
Responsiveness, discrimination, and construct validity
Table 2 shows the statistics of sensitivity to change. The CCI had the highest SRM (1.68) in actively treated patients, as well as the highest Guyatt effect size (2.72). For all measures, SRM were higher for patients treated with ETN than for patients treated with placebo. Of the 4 separate CCI components, both patient-reported VAS pain and PA functionality of the target joint had a high SRM (0.81 and 0.97, respectively) and a high Guyatt effect size (2.06 and 2.44, respectively).
Interestingly, the DAS28, as well as the ESR and CRP, showed moderate to high Guyatt effect sizes, but other variables of general disease activity did not.
The SRM for CCI and its separate components were rather high. VAS pain (0.09) and function (0.35) showed the lowest SRM in the placebo group while patient’s and physician’s evaluation of effect showed the highest SRM (> 0.90).
For between-group discrimination, lowest p values for the comparison between ETN injections and placebo injections were found for the CCI (p = 0.007), but also for the separate CCI components, patient’s and physician’s evaluation of treatment efficacy (p = 0.011 and 0.023, respectively). The DAS28-CRP also showed an acceptable p value (0.026). The other general variables did not discriminate in this trial. Lastly, the CCI correlated moderately well with the HAQ (r = 0.55), as well as the VAS pain and the physician’s evaluation of the treatment effect (Table 3).
DISCUSSION
Our study demonstrates that the CCI as a composite score can be used to evaluate the efficacy of local interventions. Compared with all other tested outcome variables, it had the best level of responsiveness, as well as the best discrimination between active therapy and placebo. The CCI outperformed its separate components, which adds to its credibility: the signal-to-noise ratio of this index is better than that of its separate components, and it seems as if all components notably contribute to some extent to the index.
By incorporating PRO and PA outcomes, the CCI offers a more complete evaluation of single-joint responses. We found a few remarkable discrepancies in the placebo group that point to the importance of combining several variables into 1 index: VAS pain, for example, showed a far lower SRM in patients receiving placebo than “patient’s evaluation of effect” (both PRO) or “physician’s evaluation of effect” in the placebo group. These data suggest that an important placebo effect is operating in single-joint interventions. The low patient number in the placebo group, though, suggests that these data should be interpreted with some reservation.
Notably, joint swelling at baseline had far higher scores than joint tenderness because swelling rather than tenderness was a specific eligibility criterion. This difference may have contributed to a ceiling effect. Consequently, there was more room for improvement of swelling than of tenderness; the higher SRM for swelling than for tenderness in the active treatment group in comparison with the relatively low SRM in the placebo group may be a reflection of this. Further, knee joints, which were the most frequently targeted joints in our study, are particularly sensitive to injections with a tumor necrosis factor–blocking agent.
The CCI does not include systemic variables such as CRP, excluding the possibly confounding influence of persistent arthritis in other joints. Not surprisingly, most of the general variables had a worse performance in terms of evaluating a local treatment effect. However, high values for the Guyatt effect size were found for the DAS28, in addition to CRP and ESR separately, as well as a significant p value for the DAS28-CRP. These findings may reflect a systemic effect of ETN because of leakage from the joint space into the circulation10.
Our study included patients with at least 1 swollen joint, but patients with more than 1 swollen joint (in which the general indices are a better reflection of disease activity) were not excluded. From a principle point of view (face validity), the single-joint CCI rather than an index, such as DAS28, should be recommended to assess effects of a single-joint intervention.
The CCI has not been used extensively until now and has been mostly used for the evaluation of knee joints. Although the assessed variables are likely applicable to other single joints, this should be further investigated. In our study, results of knee, ankle, and MCP joints were combined. It should be noted, however, that results were comparable when the analysis was performed on only knee joints (data not shown).
In our study, the domain “feasibility” of the OMERACT filter was not evaluated. However, based on the study physicians’ opinions, the combination of 1 VAS scale, 2 questions, and a brief joint examination required little extra time and was easily performed during study visits. Further, most components are already incorporated in other standard clinical assessments.
Our study supports the use of the CCI as a single-joint assessment in future studies with single-joint interventions. It has shown good responsiveness, as well as good discrimination. With new local therapies evolving, this index may meet a previously unmet need.
Footnotes
Sponsored by Arthrogen BV. Dr. Aalbers, Dr. Vervoordeldonk, and Dr. Tak are affiliated with Arthrogen BV. Dr. Tak became an employee and shareholder of GlaxoSmithKline after completion of this study.
- Accepted for publication May 6, 2015.