Abstract
Objective. To investigate whether adalimumab (ADA) reduces whole-body (WB-) magnetic resonance imaging (MRI) indices for inflammation in the entheses, peripheral joints, sacroiliac joints, spine, and the entire body in patients with axial spondyloarthritis (axSpA).
Methods. An investigator-initiated, randomized, placebo-controlled, double-blinded 48-week followup trial included 49 patients with axSpA, who had Bath Ankylosing Spondylitis Disease Activity Index (BASDAI) ≥ 4.0 despite treatment with nonsteroidal antiinflammatory drugs and a clinical indication for tumor necrosis factor inhibitor treatment. Patients were randomized to subcutaneous ADA 40 mg or placebo every other week for 6 weeks; thereafter, all patients received ADA. Conventional MRI and WBMRI were performed at weeks 0, 6, 24, and 48. The primary WBMRI endpoint was the proportion of patients with an improvement in WBMRI total inflammation index above the smallest detectable change (SDC) at Week 6.
Results. The primary WBMRI endpoint (improvement of SDC > 2.3) was met in 11 (44%) patients in the ADA group and 3 (13%) patients in the placebo group (p = 0.025, Fisher’s exact test). The primary conventional MRI endpoint, the minimally important change in Spondyloarthritis Research Consortium of Canada Spine MRI Inflammation Index at Week 6, was achieved by 9 (36%) patients in the ADA group and 4 (17%) patients in the placebo group (p = 0.20). The primary clinical endpoint, BASDAI reduction > 50% or 2.0 at Week 24, was attained by 32 (65%) patients.
Conclusion. ADA provided significant reductions in WBMRI indices of peripheral, axial, and whole-body inflammation in patients with axSpA. WBMRI is promising for objective assessment and monitoring of peripheral and axial disease activity in future clinical trials.
Patients with axial spondyloarthritis (axSpA) have various patterns of involvement of sacroiliac joints (SIJ), spine, and peripheral joints and entheses that can be visualized on magnetic resonance imaging (MRI). Therefore, MRI can provide an objective measure of the inflammatory activity in different parts of the body. MRI scoring methods have been developed to assess inflammation in SIJ and the spine1,2, and MRI axial inflammation decreases during anti-tumor necrosis factor (TNF) treatment3,4,5.
Whole-body (WB-) MRI potentially allows assessment of the entire musculoskeletal system in 1 scan and may thus provide an objective assessment of the inflammatory activity of the whole patient6,7,8. MRI inflammatory lesions outside the axial skeleton improve during anti-TNF treatment9,10. Therefore, it is of considerable interest how WBMRI indices perform as outcome measures in a randomized clinical trial setting.
The aim was to investigate whether WBMRI indices of inflammation in both axial and peripheral joints and entheses are reduced by adalimumab (ADA) in patients with axSpA.
MATERIALS AND METHODS
Study design
This was a randomized, double-blind, placebo-controlled, investigator-initiated trial conducted at 5 rheumatology outpatient clinics in Denmark from February 2010 to March 2014. Patients were assigned to ADA 40 mg or placebo SC every other week for 6 weeks by site-wise randomization in blocks of 2:2. From Week 6 onward, patients in both groups received ADA. At Week 24, clinical responders [decrease in Bath Ankylosing Spondylitis Disease Activity Index (BASDAI) of 50% or 2.0] continued ADA while nonresponders were allowed treatment with other drugs following local treatment guidelines. AbbVie provided study medicine for the first 24 weeks of the study. Patients and all study personnel were blinded to treatment allocation. The study was approved by the Danish Health Authority and the Regional Ethics Committees in the Capital Region of Denmark (ethics approval number H1-2013-118). All patients gave written informed consent. (ClinicalTrials.gov, NCT01029847.)
Inclusion and exclusion criteria
Eligible patients were adults aged 18–85 years who fulfilled the Assessment of Spondyloarthritis international Society (ASAS) criteria for axSpA11, had sacroiliitis as assessed by radiography or MRI, BASDAI ≥ 4.0 despite treatment with nonsteroidal antiinflammatory drugs, and clinical indication for anti-TNF treatment.
Exclusion criteria were glucocorticoid administration within 4 weeks before inclusion, drug or alcohol abuse, contraindications to MRI or anti-TNF treatment, or otherwise being unable to fulfill the investigational program owing to physical or psychological causes. Conventional disease-modifying antirheumatic drugs were allowed if dosage had been stable for 4 weeks before inclusion.
Study procedures and outcome measures
Clinical examination, patient-reported outcomes, adverse events, and blood and urine samples were collected at baseline and at weeks 2, 6, 8, 12, 18, 24, 36, and 48. The C-reactive protein assay had a lower detection limit of 0.3 mg/l (Sentinel Diagnostics).
Physical examination included Bath Ankylosing Spondylitis Metrology Index and thorax expansion. There were 70/68 peripheral joints assessed for tenderness and swelling: temporomandibular, sternoclavicular and acromioclavicular joints, shoulder, elbow, wrist, first carpometacarpal, first to fifth metacarpophalangeal, all interphalangeal joints of the hands, hip (only assessed for tenderness), knee, ankle and tarsal joints, first to fifth metatarsophalangeal, first interphalangeal and second to fifth proximal interphalangeal joints of the feet. There were 33 entheses assessed for tenderness: first and seventh costosternal joints, anterior superior iliac spine, posterior superior iliac spine, iliac crest, ischial tuberosity, spinous process of L5, supraspinatus tendon insertion into humerus, medial and lateral epicondyles of humerus, greater femoral trochanter, medial femoral condyle, quadriceps tendon insertion into patella, patellar ligament insertion into patella, patellar ligament insertion into the tibial tuberosity, calcaneal (Achilles) tendon insertion into calcaneus, and plantar aponeurosis insertion into calcaneus.
MRI was performed at baseline before treatment initiation and at weeks 6, 24, and 48. Because dedicated SIJ sequences were not performed at Week 6, SIJ at Week 6 could only be assessed as part of the WBMRI images. Radiographs of the spine and SIJ were obtained at baseline and at Week 48.
MRI acquisition
WBMRI was performed in a Philips 3.0 Tesla scanner using 6 separate imaging stations with a whole-body quadrature coil: (1) coronal and sagittal images of cervical spine/shoulders; (2) coronal images of thoracic spine; (3) coronal images of lumbar spine and SIJ; (4) coronal images of hips and hands; (5) coronal images of knees; and (6) coronal and axial images of ankles and feet. Short-tau inversion recovery (STIR) sequences had slice thickness of 3 mm for hips and hands (resolution 1.2 × 1.2 mm), and 5 mm for all other stations (resolution 1.8 × 2.6 mm, repetition time 5257–16,832 ms, echo time 70–83, and inversion time 200 ms). Pre- and post-gadolinium (Gd) T1-weighted (T1W) spin-echo sequences without fat saturation had slice thickness of 3–5 mm (repetition time 733–1374 ms and echo time 7.6 ms, resolution 1.1 × 1.1 to 1.5 × 1.5 mm).
Conventional MRI of SIJ and spine was performed on the same scanner using appropriate surface coils and with a slice thickness of 4 mm. The spine was imaged in 3 parts by sagittal T1W spin-echo (repetition time 518 ms, echo time 8 ms) and STIR (repetition time 4990–8530, echo time 80, inversion time 120). The SIJ were imaged by semicoronal T1W spin-echo (repetition time 699, echo time 20) and STIR (repetition time 4818, echo time 60, inversion time 200).
Evaluation of WBMRI
All WBMRI images covering both the axial and peripheral musculoskeletal system were anonymized and read in chronological order by an experienced musculoskeletal radiologist (IE) who was blinded to conventional MRI images, radiography, and clinical data. Bone marrow edema (BME) was assessed on STIR sequences, with post-Gd T1W sequences used for reference only. Synovitis and entheseal soft tissue inflammation were assessed based on both STIR and post-Gd T1W sequences12. Forty-eight peripheral joints were scored at each side (right and left) for synovitis and BME: glenohumeral, acromioclavicular, sternoclavicular, wrist, carpometacarpal, first to fifth metacarpophalangeal joints, all interphalangeal joints of the hands, hip, knee, ankle, tarsometatarsal, and first metatarsophalangeal joints. Fourteen peripheral entheses, supraspinatus tendon insertion at humerus, iliac crest, ischial tuberosity, pubic symphysis, greater femoral trochanter, medial femoral condyle, and calcaneal Achilles tendon insertion were scored bilaterally for entheseal BME and entheseal soft tissue inflammation. BME and soft tissue inflammation in the immediate surroundings of the symphysis may represent either symphysitis or enthesitis. The pubic symphysis was scored separately for left and right side; inflammation more laterally on the inferior side of the pubic bones was not scored. The medial femoral condyle was scored for inflammation at the insertion of the collateral ligaments, not at the adductor tubercle; BME below the cartilage of the femorotibial joint was assigned to the knee joint. WBMRI images of SIJ were scored for BME in each of 4 quadrants (upper/lower part of iliac side and upper/lower part of sacral side) in the cartilaginous parts of the joints. The spine was scored for BME in each discovertebral unit from C2/C3 to L5/S1; spine scores were based on BME in the vertebral bodies, while inflammatory lesions in the posterior structures of the spine (e.g., facet joints and costotransverse joints) were not scored. The scoring system was 0 (absent), 1 (mild/moderate), or 2 (severe).
Eight sets of WBMRI images at baseline and at Week 24 were reanonymized and scored again in known chronology to assess intrarater reliability.
Evaluation of conventional MRI of spine and SIJ
All conventional MR images were read in chronological order and scored according to the Spondyloarthritis Research Consortium of Canada (SPARCC) Sacroiliac Joint MRI Inflammation Index and the SPARCC Spine MRI Inflammation Index1,2 by an experienced axSpA reader (SJP), blinded to WBMRI, radiography, and clinical data.
Definitions of WBMRI indices
WBMRI peripheral joint inflammation index was defined as the sum of scores for synovitis and BME for the abovementioned peripheral joints. WBMRI enthesis inflammation index was defined as the sum of scores for entheseal BME and entheseal soft tissue inflammation for the abovementioned entheses. WBMRI axial BME index was defined as the sum of scores for spine and SIJ BME. WBMRI total BME index was defined as the sum of scores for BME from peripheral joints, entheses, spine, and SIJ. WBMRI total inflammation index was defined as the sum of scores for synovitis, BME, and soft tissue inflammation from peripheral joints and entheses, spine, and SIJ (the primary WBMRI outcome measure).
Endpoints
The primary WBMRI outcome measure was improvement in WBMRI total inflammation index above smallest detectable change (SDC) at Week 6. The primary conventional MRI outcome measure was SPARCC Spine MRI Inflammation Index minimally important change (improvement ≥ 5) at Week 613. The primary clinical endpoint used for assessment of treatment response during the trial was improvement in BASDAI of ≥ 50% or ≥ 2.0 at Week 24. Secondary endpoints included ASAS partial remission, ASAS20, ASAS40, and ASAS5/6 responses, and changes in WBMRI indices.
Statistical analysis
Intention-to-treat analysis with nonresponder imputation of missing data was used for binary outcomes. Missing data for continuous outcomes were imputed with last observation carried forward. Two patients had no dedicated conventional SIJ scans at baseline, and the first available SIJ scores were used as the baseline scores. Standardized response mean (SRM) was calculated as the overall mean change score divided by SD of this change score, while Guyatts responsiveness index (GRI) was calculated as the mean change score in the ADA group divided by SD of the change score in the placebo group at Week 6; values ≥ 0.8 were judged to represent high responsiveness14.
The 95% CI for risk difference (difference between the proportions of patients with the outcome of interest) were calculated with continuity correction because of the small sample size. The statistical analyses included Fisher’s exact test, Mann-Whitney U test, Wilcoxon signed-rank test, and Pearson correlation. ANCOVA analysis with group allocation and baseline WBMRI total inflammation score as covariates and change in WBMRI total inflammation score as outcome was performed as a posthoc secondary analysis to take differences in baseline values into account. Also, logistic regression with WBMRI total inflammation responder status at Week 6 as outcome and baseline WBMRI total inflammation score and group allocation as covariates was performed as posthoc secondary analysis. Similar analyses were performed for SPARCC Spine Inflammation Index.
Intrarater reliability was calculated using a 2-way random effects model, ICC(3,1), based on absolute agreement. ICC ≥ 0.6 were considered to represent good reliability, and ≥ 0.8 very good reliability. SDC was calculated from baseline to Week 24 as 1.96 × (SDdifference in change scores)/√2, where SD is the standard deviation of the differences in change score between 2 separate scorings performed by the same reader (IE)15. All statistical analyses were performed using R version 3.4.0.
RESULTS
Study participants
Fifty patients were included from February 2010 to March 2013 (Figure 1). The intention-to-treat analyses were based on 49 patients because all data from 1 patient were unavailable as a result of an administrative error.
Baseline characteristics were similar among the randomization groups, except that the ADA group had more men, longer symptom duration (Table 1), higher baseline WBMRI peripheral joint inflammation, WBMRI enthesis inflammation, and WBMRI total inflammation indices (Table 2). Ten patients discontinued the study (Figure 1). Four patients stopped ADA because of adverse events; during the trial, 1 serious adverse event judged as probably not causally related to ADA was observed.
Primary imaging endpoints
At Week 6, the WBMRI total inflammation index had decreased in 11 (44%) patients in the ADA group and 3 (13%) patients in the placebo group; risk difference was 32% (95% CI 4%–59%, p = 0.025; Figure 2 and Figure 3). The primary conventional MRI endpoint of SPARCC Spine MRI Index minimally important change at Week 6 was met by 9 (36%) of the patients treated with ADA and 4 (17%) of the patients treated with placebo; risk difference was 19% (95% CI −9% to 47%, p = 0.20).
Intrarater reliability was very good or good for most WBMRI indices (Table 3). SDC ranged from 1.0 to 2.7 for the different WBMRI indices. SDC for the WBMRI total inflammation index was 2.3. In posthoc secondary analysis using logistic regression adjusted for baseline WBMRI total inflammation index, the difference between groups in total inflammation index responders was still significant (p = 0.048). SPARCC Spine minimally important change was also not significant when adjusted for baseline SPARCC Spine (p = 0.16).
Clinical endpoints at weeks 6, 24, and 48
The clinical endpoint of reduction in BASDAI of 50% or 2.0 at Week 6 was met by 13 (52%) patients in the ADA group and 3 (13%) in the placebo group (p = 0.005). Thereafter, all patients received ADA, and at Week 24 this endpoint was met by 18 (72%) patients in the ADA group and 14 (58%) in the placebo group (p = 0.38). Among the 32 patients who were clinical responders at Week 24, there were 27 (84%) who were still clinical responders at Week 48. Two clinical nonresponders at Week 24 were judged by the treating physician as having improved significantly and continued ADA; both were clinical responders at Week 48. Six clinical nonresponders at Week 24 were switched to etanercept; 2 of these were clinical responders at Week 48. A total of 31 (63%) patients were clinical responders at Week 48.
Secondary imaging endpoints
The WBMRI enthesis inflammation index decreased significantly in the ADA group (mean change −0.9) compared with the placebo group (+0.4) at Week 6 (Table 2). In the ADA group, WBMRI enthesis BME index and enthesis soft tissue inflammation index both decreased at Week 6, while they were largely unchanged in the placebo group.
WBMRI axial BME index
WBMRI total BME index and total inflammation index decreased significantly at Week 6 in the ADA group, as well as at weeks 24 and 48 in both groups in comparison to baseline. At Week 6, changes in these indices differed significantly between groups. No significant changes were observed in WBMRI peripheral joint synovitis index or peripheral joint BME index.
In posthoc secondary analysis, WBMRI total inflammation index at Week 6 improved significantly in the ADA group compared to the placebo group when adjusted for baseline WBMRI total inflammation index (p = 0.011). Change in SPARCC Spine was significant at Week 6 in ANCOVA analysis with adjustment for baseline SPARCC Spine (p = 0.013). Overall, the SPARCC SIJ minimally important change (≥ 2.5) at Week 24 was attained by 19 (39%) patients.
Other secondary endpoints
At Week 6, ASAS partial remission was reached by 5 (20%) patients in the ADA group and none in the placebo group (p = 0.05). Moreover, ASAS20 was reached by 13 (52%) patients in the ADA group and 4 (17%) patients in the placebo group (p = 0.02); ASAS40 was reached by 12 (48%) in the ADA group and 1 (4%) in the placebo group (p = 0.001); and ASAS5/6 was reached by 13 (52%) patients in the ADA group and 1 (4%) patient in the placebo group (p < 0.001).
Responsiveness and construct validity of WBMRI indices
Responsiveness as assessed by GRI at Week 6 was good for WBMRI enthesis BME, axial BME index, total BME index, and total inflammation index, while the other indices had lower responsiveness. A similar pattern was seen with SRM at weeks 6, 24, and 48, where the 2 most composite indices, WBMRI total BME index and total inflammation index, had good or moderate responsiveness in the range of 0.65 to 0.86, while the individual components had somewhat lower responsiveness (Table 3). At baseline, the Pearson correlation coefficient between WBMRI SIJ score and SPARCC SIJ MRI Index was 0.75, while the correlation between WBMRI spine score and SPARCC Spine MRI Index was 0.80.
DISCUSSION
Our present study, using WBMRI indices in a randomized controlled trial setting, documents the effect of TNF inhibitor therapy on the objective inflammatory load in patients with axSpA. A beneficial effect of ADA compared to placebo was seen already at Week 6 both for axial BME, peripheral entheseal inflammation, and total whole-body inflammation. Our study confirms the known antiinflammatory axial effect of ADA, but the effect of TNF inhibitor therapy on WBMRI inflammation indices has, to our knowledge, not previously been shown. The intrarater reliability of WBMRI assessments was generally good, especially for the most comprehensive indices (total BME index and total inflammation index) and the method offers significant potential as a measure of the total inflammatory burden in patients with arthritis.
Interestingly, peripheral enthesitis represented a significant part of the WBMRI total inflammation index, even though the patients were selected by the presence of sacroiliitis by imaging. The WBMRI enthesis inflammation index decreased significantly more in the ADA group compared to the placebo group at Week 6. The frequent finding of MRI enthesitis is in line with anatomical investigations that point to the synovioentheseal complex as the initial site of inflammation in SpA16,17, and for many years, it has indeed been known that MRI may visualize enthesitis and a range of other causes of regional pain18,19,20,21.
The SDC was relatively low (2.2–2.3) for WBMRI total inflammation index and total BME index. MRI scans were read in known chronology; if they were read in random order, SDC would possibly be somewhat larger. The highest ICC among the WBMRI indices were found for the WBMRI total inflammation index (status score 0.93, change score 0.85) and total BME index (status score 0.96, change score 0.91), and these composite scores had overall good responsiveness. Overall, this suggests that further work on WBMRI as a sensitive objective outcome measure is justified and that the WBMRI total inflammation index, total BME index, and enthesis inflammation index seem to be most promising. The indices should be validated in independent cohorts.
Slice thickness was 3–5 mm of the STIR sequences, which, because of partial volumen averaging, makes it hard to confidently judge the anatomical location of small high-signal areas in small structures and at tissue boundaries. Improved resolution may allow for better ICC as well as greater sensitivity, but longer scan time was judged unfeasible.
We chose simple addition of scores to obtain the indices, well aware that other weightings of the scores may lead to higher correlations with patient-reported outcomes. Future studies may explore different ways of weighting scores according to the size or functional importance of the affected anatomical structures.
The ABILITY-1 trial demonstrated the efficacy of ADA in patients with nonradiographic axSpA4. Compared with the ADA arm of the ABILITY-1 trial at Week 12, the composite endpoints such as ASAS40 and Ankylosing Spondylitis Disease Activity Score major improvement were reached by a larger proportion of patients already at Week 6 in the ADA arm of our present trial. Our baseline scores for SPARCC Spine MRI Index and SPARCC SIJ MRI Index were about twice as high as in ABILITY-1, and because MRI inflammation is a known predictor for treatment response to TNF inhibitors in patients with axSpA22, this difference may explain the higher clinical response rates in our study.
Based on the current data, WBMRI total inflammation index can discriminate between the treatment and placebo groups in a randomized controlled trial setting with a total of 50 patients with SpA randomized to the 2 treatment groups. Other properties are also important for an outcome measure to be potentially useful, such as face and content validity. The comprehensive WBMRI approach would seem to have an advantage rather than only assessing the SIJ and spine, particularly in patients with widespread disease located in many different anatomical structures. Regarding feasibility, the more comprehensive WBMRI approach demands longer scan times, as well as image analysis and scoring, but novel MRI scanners and software with ensuing improvements in image quality and scan time, as well as the possible development of automated image analysis software, may improve the feasibility of this approach in the future. The sagittal scan planes used in our trial are not optimal for visualizing all subsets of peripheral joints and entheses; a consensus proposal for scan planes and pathology to be assessed using WBMRI has recently been published23. In our study, we chose to score entheses on WBMRI that are part of existing clinical enthesitis indices and that were judged to be possible to evaluate with the scan planes performed and the current image quality.
Our investigator-initiated trial shows that ADA reduces WBMRI indices of inflammation in both axial and peripheral joints and entheses, in parallel with improvements in conventional clinical measures of disease activity. To our knowledge, this study is the first to document significant improvements in axial and peripheral inflammatory disease activity assessed with WBMRI in a randomized, double-blind, placebo-controlled trial in patients with axSpA. Our present study encourages further development and validation of WBMRI indices as future outcome measures in peripheral SpA and axSpA.
Footnotes
The work was supported by AbbVie. AbbVie had no role in the study design or in the collection, analysis, or interpretation of the data, the writing of the manuscript, or the decision to submit the manuscript for publication; publication of this article was not contingent upon approval by AbbVie. SK had a grant from The Danish Rheumatism Association (A3866).
- Accepted for publication November 17, 2017.