Abstract
Objective. To assess changes following treatment and the reliability and responsiveness to change of the Outcome Measures in Rheumatology (OMERACT) Psoriatic Arthritis Magnetic Resonance Imaging Score (PsAMRIS) in a randomized controlled trial.
Methods. Forty patients with PsA randomized to either placebo or abatacept (ABA) had MRI of either 1 hand (n = 20) or 1 foot (n = 20) at baseline and after 6 months. Images were scored blindly twice by 3 independent readers according to the PsAMRIS (for synovitis, tenosynovitis, periarticular inflammation, bone edema, bone erosion, and bone proliferation).
Results. Inflammatory features improved numerically but statistically nonsignificantly in the ABA group but not the placebo group. Baseline intrareader intraclass correlation coefficients (ICC) were good (≥ 0.50) to very good (≥ 0.80) for all features in both hand and foot. Baseline interreader ICC were good (ICC 0.72–0.96) for all features, except periarticular inflammation and bone proliferation in the hand and tenosynovitis in the foot (ICC 0.25–0.44). Intrareader and interreader ICC for change scores varied. Guyatt’s responsiveness index (GRI) was high for inflammatory features in the hand and metatarsophalangeal joints (GRI −0.67 to −3.13; bone edema not calculable). Minimal change and low prevalence resulted in low ICC and GRI for bone damage.
Conclusion. PsAMRIS showed overall good intrareader agreement in the hand and foot, and inflammatory feature scores were responsive to change, suggesting that PsAMRIS may be a valid tool for MRI assessment of hands and feet in PsA clinical trials.
Joint involvement in psoriatic arthritis (PsA) is heterogeneous, and inflammation is often present in both axial and peripheral joints, including the small joints of the hands and feet1,2. Rapidly evolving treatment options have increased the requirement for developing efficient measures for assessing treatment response.
Magnetic resonance imaging (MRI) can assess inflammation as well as bone damage and has been used extensively as an outcome measure in rheumatoid arthritis (Rheumatoid Arthritis MRI Scoring System, RAMRIS, assessing synovitis, bone marrow edema, and bone erosion)3,4.
In 2009, the PsA MRI Score (PsAMRIS) for peripheral PsA was presented by the Outcome Measures in Rheumatology (OMERACT) MRI in arthritis working group. The scoring system assessed the joints of the fingers and was based on the same features as the rheumatoid arthritis instrument, RAMRIS, with the addition of tenosynovitis, periarticular inflammation, and bone proliferation5. In previous studies, the application of the PsAMRIS to the hand has showed overall moderate-to-high intrareader and interreader agreement and has been proved sensitive to change6,7,8. In the present multireader exercise, we used images from a randomized, placebo-controlled trial.
We assessed changes following treatment, intrareader and interreader reliability, and responsiveness to change of PsAMRIS features in the hand and foot of patients with PsA.
MATERIALS AND METHODS
Patients
In the primary study9, 170 patients with PsA were included. Key inclusion criteria were: fulfillment of the ClASsification for Psoriatic ARthritis (CASPAR) Study Group criteria, ≥ 3 swollen joints and ≥ 3 tender joints, active plaque psoriasis, disease duration of ≥ 3 months and inadequate response to 1 or more disease-modifying antirheumatic drugs (DMARD). The patients were randomized 1:1:1:1 to 3 different doses of abatacept (ABA) or placebo at days 1, 15, and 29 and then once every 28 days thereafter. Selection of patients and image acquisition in the OMERACT exercise is described in Figure 1. All patients gave their written informed consent prior to the study.
Image evaluation
Twenty paired sets of MRI of the hand (7 received 30/10 mg/kg, 6 received 10 mg/kg, and 7 received placebo) and 20 image sets of the foot (6 received 30/10 mg/kg, 8 received 10 mg/kg, and 6 received placebo) were used for this exercise. Each MR image set consisted of images acquired at 2 timepoints: Baseline and after 6 months of treatment (2 patients in the hand placebo group and 1 patient in the foot placebo group had followup MRI at 3 mos). The 1st–5th metacarpophalangeal (MCP) joint, 1st–5th proximal interphalangeal (PIP), and 2nd–5th distal interphalangeal (DIP) joints in the hand; or the 1st–5th metatarsophalangeal (MTP), and 1st interphalangeal (IP) joint in the foot were assessed. The DIP joints of the foot were considered unreadable because of small joint size. Each joint was scored according to the OMERACT PsAMRIS5 for synovitis (0–3), flexor tenosynovitis (0–3), periarticular inflammation (0–2), bone marrow edema (0–6), bone erosion (0–20), and bone proliferation (0–1). MR images were read twice by 3 readers (FG, MØ, PB), all experienced MRI readers familiar with the scoring system. A calibration session was performed, using similar images, in the evening before the exercise. All images were anonymized and randomized. Prior to the second reading, the images were reanonymized and rerandomized. The readers were blinded for treatment and patient data, but not for time order, as suggested by van Tuyl, et al10. The individual readers used the same computer workstations for both readings, and the MR images were assessed using OsiriX software version 4.1.2 (Pixmeo, Geneva, Switzerland).
Statistics
Status and change scores were described by calculating the mean score of the 2 individual readings, and were presented as the mean of the readers’ mean scores. Change over time was estimated using the Wilcoxon signed-rank test. Difference between treatment groups was assessed using the Mann-Whitney U test.
Intrareader and interreader reliability was assessed using single measure and average measure intraclass correlation coefficients (ICC), respectively. The smallest detectable change (SDC) was calculated for change scores11.
Responsiveness of the PsAMRIS features was estimated using Guyatt’s effect size (Guyatt’s responsiveness index; GRI), which is calculated by dividing the mean change score in the ABA group with the standard deviation of the change score in the placebo group12.
RESULTS
MRI scores at baseline and during treatment
Overall, the inflammatory variables showed numerical improvement in the ABA group but not the placebo group in the hand and foot. The synovitis score of the MTP joints in the ABA group was the only variable showing a statistically significant improvement in score: 1.1 (p = 0.04). Bone damage scores were overall unchanged in all groups (Table 1).
The Mann-Whitney U test showed no statistically significant differences between the placebo group and the ABA group (data not shown).
Reliability
In the hand, intrareader ICC for baseline scores were good to very good for all features except synovitis and bone proliferation in 1 reader (Table 2). Intrareader ICC for change scores were good to very good in all or some readers for the inflammatory features, and were poor for bone damage features. Interreader ICC for baseline scores were good to very good for all features, except periarticular inflammation and bone proliferation. Interreader ICC for change scores were very good for tenosynovitis (Figure 2) and bone marrow edema, and poor for the remaining features. SDC was low for all features, except intrareader SDC for synovitis and periarticular inflammation in 1 reader.
In the foot, intrareader ICC for baseline scores was good to very good for all or some readers for all features. Intrareader ICC for change scores was good to very good for all or some readers for synovitis, tenosynovitis, and bone erosion; and poor for the remaining features. Interreader ICC for baseline scores were good to very good for all features except tenosynovitis. Interreader ICC for change scores were good for synovitis, periarticular inflammation, and bone marrow edema, and poor for remaining features. SDC was low for all features.
When analyzed for separate joint areas, the ICC tended to improve with the size of the joint (data not shown).
Responsiveness
In the hand, GRI was excellent for total score, and MCP and PIP joint scores for tenosynovitis and for total scores, and MCP joint scores for periarticular inflammation (Table 3). GRI for synovitis was moderate to good whereas the GRI for the remaining features were poor or not calculable because no change was detected in the placebo group for that feature by any of the 3 readers (actually suggesting high responsiveness for some features, because improvement was observed in the ABA group).
In the foot, GRI was excellent for total score and MTP joint scores of synovitis, and for total scores of periarticular inflammation. GRI was good for bone erosion total score and moderate for tenosynovitis and bone erosion in the MTP joints. GRI of the remaining features were poor or not calculable.
DISCUSSION
This multireader exercise evaluating the OMERACT PsAMRIS in a randomized, placebo-controlled trial showed numerical improvements in the inflammatory features in patients treated with ABA but not with placebo. Bone damage did not change during the 6-month followup period. The baseline intrareader and interreader agreement and the responsiveness of inflammatory variables were high overall for some or all readers.
Synovitis in the MTP joints of the foot was the only feature showing a statistically significant improvement during ABA therapy. However, the PsAMRIS features of inflammation all showed numerical, although statistically nonsignificant, improvements after 6 months of ABA therapy, both in hands and feet. This indicates that a larger sample size than the current 13–14 actively treated patients would likely have resulted in more statistically significant PsAMRIS score improvements.
In our present study, we analyzed feet and hands separately. By analyzing both groups together in a posthoc analysis (data not shown), statistically significant improvement was seen in total synovitis score in the patients treated with ABA, supporting that higher sample sizes would provide statistically significant differences. In agreement with the present study, our previous observational study of patients receiving tumor necrosis factor-α (TNF-α) inhibitor8 showed improvements of PsAMRIS tenosynovitis, synovitis, and periarticular inflammation in the hand of patients receiving TNF-α inhibitors. It should be mentioned that the purpose of the present study was not to detect a statistically significant change in PsAMRIS score because the sample size was very small.
Intrareader reliability assessed by ICC for the hand was generally good for all features, but not for some or all readers regarding change scores of periarticular inflammation, bone marrow edema, bone erosion, and bone proliferation. A probable explanation for these low ICC values on change scores for these features is that the change was low, making the ICC sensitive to minor disagreements within or between readers. For the bone damage features, a low ICC was to be expected because these features did not show any change over the 6 months. The low SDC for these features suggests a good reliability and supports the proposed reason for low ICC.
Interreader reliability for baseline periarticular inflammation and bone proliferation was low for the hand, which suggests that there is less consensus about how to score these PsA-specific features compared to the features included in the RAMRIS, which have been summarized in an atlas13,14,15.
The ICC for the foot showed a pattern similar to that of the hand. However, the intrareader ICC for periarticular inflammation and bone proliferation was generally lower than in the hand. This may be explained by the smaller structures in the toes and less experience by the readers in scoring MR images of the foot.
Responsiveness was high for the inflammatory features of PsAMRIS in the hand and the MTP joints of the foot, despite bone marrow edema not being calculable. For many PsAMRIS features, calculation of GRI was not possible owing to zero variance in the placebo group. However, this illustrates that none of the 3 readers detected any change in the placebo group, while a change in score was recorded in the ABA group (e.g., bone marrow edema as seen in Table 1), suggesting responsiveness even though GRI was not calculable.
Overall, the DIP joints of the hand showed low responsiveness to change. This may be explained by poor image quality and difficulties scoring the features as a result of small anatomical size. Good visualization of DIP joints is relevant considering that the DIP joints of the hands and feet are commonly involved clinically in PsA2. The DIP and 2nd–4th PIP joints of the foot are smaller than the hand and were not considered readable in this trial. In addition, the responsiveness of all the features of the IP joint of the foot were poor or not calculable. Higher image resolution based on increased magnetic field strength and smaller voxel size may be needed to increase the accuracy of the scorings of the small anatomical structures of the hand and foot.
The course of changes, reproducibility, and responsiveness to change in the foot were comparable to the hand, which supports further validation of the PsAMRIS on the foot.
This randomized controlled trial showed numerical improvements in the inflammatory variables of PsAMRIS during ABA but not placebo therapy. For all or some readers, the PsAMRIS showed high intrareader and interreader agreement for most baseline scores, and the inflammatory variables showed high responsiveness, especially for synovitis and tenosynovitis. The PsAMRIS was tested for the first time in the foot, providing results similar to those of the hand. The OMERACT PsAMRIS may be a valuable tool for assessing the MRI features in the hand and foot of PsA patients in clinical trials.
Acknowledgment
We thank Bristol Myers Squibb for making the MR images available.