Skip to main content

Main menu

  • Home
  • Content
    • First Release
    • Current
    • Archives
    • Collections
    • Audiovisual Rheum
    • COVID-19 and Rheumatology
  • Resources
    • Guide for Authors
    • Submit Manuscript
    • Payment
    • Reviewers
    • Advertisers
    • Classified Ads
    • Reprints and Translations
    • Permissions
    • Meetings
    • FAQ
    • Policies
  • Subscribers
    • Subscription Information
    • Purchase Subscription
    • Your Account
    • Terms and Conditions
  • About Us
    • About Us
    • Editorial Board
    • Letter from the Editor
    • Duncan A. Gordon Award
    • Privacy/GDPR Policy
    • Accessibility
  • Contact Us
  • JRheum Supplements
  • Services

User menu

  • My Cart
  • Log In
  • Log Out

Search

  • Advanced search
The Journal of Rheumatology
  • JRheum Supplements
  • Services
  • My Cart
  • Log In
  • Log Out
The Journal of Rheumatology

Advanced Search

  • Home
  • Content
    • First Release
    • Current
    • Archives
    • Collections
    • Audiovisual Rheum
    • COVID-19 and Rheumatology
  • Resources
    • Guide for Authors
    • Submit Manuscript
    • Payment
    • Reviewers
    • Advertisers
    • Classified Ads
    • Reprints and Translations
    • Permissions
    • Meetings
    • FAQ
    • Policies
  • Subscribers
    • Subscription Information
    • Purchase Subscription
    • Your Account
    • Terms and Conditions
  • About Us
    • About Us
    • Editorial Board
    • Letter from the Editor
    • Duncan A. Gordon Award
    • Privacy/GDPR Policy
    • Accessibility
  • Contact Us
  • Follow jrheum on Twitter
  • Visit jrheum on Facebook
  • Follow jrheum on LinkedIn
  • Follow jrheum on YouTube
  • Follow jrheum on Instagram
  • Follow jrheum on RSS
Research ArticleArticle

Limited Reliability of Radiographic Assessment of Sacroiliac Joints in Patients with Suspected Early Spondyloarthritis

Alice Ashouri Christiansen, Oliver Hendricks, Dorota Kuettel, Kim Hørslev-Petersen, Anne Grethe Jurik, Steen Nielsen, Kaspar Rufibach, Anne Gitte Loft, Susanne Juhl Pedersen, Louise Thuesen Hermansen, Mikkel Østergaard, Bodil Arnbak, Claus Manniche and Ulrich Weber
The Journal of Rheumatology January 2017, 44 (1) 70-77; DOI: https://doi.org/10.3899/jrheum.160079
Alice Ashouri Christiansen
From the King Christian 10th Hospital for Rheumatic Diseases, Gråsten; Hospital of Southern Jutland, Aabenraa; Institute of Regional Health Research, University of Southern Denmark, Odense; Department of Radiology, and Department of Rheumatology, Aarhus University Hospital, Aarhus; Research Department, Spine Centre of Southern Denmark, Hospital Lillebaelt Middelfart, Middelfart; Department of Internal Medicine, Hospital Lillebaelt Vejle, Vejle; Copenhagen Center for Arthritis Research (COPECARE), Center for Rheumatology and Spine Diseases, Rigshospitalet – Glostrup, Glostrup; Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark; Rufibach rePROstat, Biostatistical Consulting and Training, Basel, Switzerland.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: achristiansen@gigtforeningen.dk
Oliver Hendricks
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dorota Kuettel
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kim Hørslev-Petersen
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Anne Grethe Jurik
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Steen Nielsen
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kaspar Rufibach
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Anne Gitte Loft
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Susanne Juhl Pedersen
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Louise Thuesen Hermansen
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mikkel Østergaard
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bodil Arnbak
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Claus Manniche
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ulrich Weber
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
  • References
  • PDF
  • eLetters
PreviousNext
Loading

Abstract

Objective. To determine the reproducibility of evaluation of sacroiliac joint (SIJ) radiographs among readers with varying levels of experience, and to identify potential drivers of disagreement in classification among 5 predefined radiographic lesion types.

Methods. The study sample consisted of 104 consecutive patients aged 18–40 with low back pain ≥ 3 months of duration who met the Assessment of SpondyloArthritis international Society (ASAS) definition for a positive SIJ magnetic resonance image, or were HLA-B27–positive and had ≥ 1 spondyloarthritis (SpA)-related clinical/laboratory feature according to the ASAS classification criteria for axial SpA. Seven blinded readers (2 musculoskeletal radiologists, 5 rheumatologists) classified pelvic radiographs according to the modified New York criteria (mNY) and recorded presence/absence of 5 lesion types in both SIJ: erosion, sclerosis, ankylosis, joint space widening, and joint space narrowing. Reproducibility of mNY classification among 21 reader pairs was assessed and potential drivers of disagreement were identified among 5 lesion types. A generalized linear mixed logistic regression model served to analyze to what extent discordance in lesion type was associated with discrepant mNY classification.

Results. Mean κ values (percent concordance) were 0.39 (84.1%) for mNY classification over 21 reader pairs, 0.46 (79.8%) between 2 musculoskeletal radiologists, and 0.55 (86.5%) and 0.36 (77.9%) between the most experienced rheumatologist and the 2 radiologists. Erosion showed the lowest agreement (25%) among patients with discordant classification and gave the highest OR of 13.5 for disagreement.

Conclusion. Reproducibility of radiographic SIJ classification in an SpA inception cohort was only fair to at best moderate among 7 readers with varying levels of experience, questioning the applicability of mNY in early SpA.

Key Indexing Terms:
  • SPONDYLOARTHRITIS
  • RADIOGRAPHIC SACROILIITIS
  • INTERREADER AGREEMENT
  • MODIFIED NEW YORK CRITERIA

Radiographic evaluation of sacroiliac joints (SIJ) according to the modified New York criteria (mNY)1 is the gold standard in the classification of axial spondyloarthritis (axSpA) and may affect treatment decisions in this chronic inflammatory condition. However, several studies have consistently shown limited agreement among trained readers in radiographic classification of SIJ, with κ values around 0.52,3,4. The limited reproducibility of SIJ evaluation on pelvic radiographs of patients suspected of having SpA was also featured at a public hearing of the US Food and Drug Administration5. Two interventional trials in patients with nonradiographic axSpA used the mNY, assessed by local rheumatology and radiology readers from different sites as inclusion criterion. A posthoc analysis by trained central readers resulted in the reclassification of 36% and 37% of the patients regarding fulfillment of the radiographic mNY6,7.

These concerns about low reliability of radiographic mNY were confirmed by a report highlighting at best moderate reproducibility of SIJ evaluation on pelvic radiographs by rheumatologist and radiologist readers, which even put the role of radiographic sacroiliitis for classification of axSpA into question3. However, possible data-driven explanations for the marked variability in interpretation of SIJ radiographs are scarce. We therefore hypothesized that certain radiographic lesion types contained in the radiographic mNY such as erosion, sclerosis, or joint space variation may contribute more to interreader disagreement than others.

The objectives of our study in an SpA inception cohort recruited from primary care were (1) to determine the reproducibility of radiographic SIJ classification according to the mNY among 7 rheumatology and radiology readers with varying levels of experience in imaging in SpA, and (2) to identify potential drivers of disagreement in classification among 5 predefined radiographic lesion types according to the mNY.

MATERIALS AND METHODS

Patients

Our study sample was recruited from the cohort Spines of Southern Denmark, which has been described in detail elsewhere8,9,10. Briefly, the cohort consisted of 1037 patients aged 18–40 years referred to the Spine Centre of Southern Denmark, Middelfart, for evaluation of low back pain of 2–12 months’ duration that was refractory to treatment in primary care.

All referred patients were screened according to a standardized protocol, which included a clinical visit, back pain questionnaires, laboratory testing [HLA-B27, high-sensitivity C-reactive protein (CRP)], and magnetic resonance imaging (MRI) of the SIJ and the entire spine. Patients with back pain of ≥ 3 months’ duration, who either fulfilled the Assessment of SpondyloArthritis international Society (ASAS) criteria for a positive SIJ MRI11 or were HLA-B27–positive with at least 1 concomitant clinical or laboratory feature suggestive of SpA according to ASAS classification criteria for axSpA12 were referred for clinical evaluation by 1 of 3 specialists in rheumatology (AGL, LHH, or OH). ASAS concomitant clinical or laboratory features suggestive of SpA were inflammatory back pain according to ASAS criteria13, arthritis, heel enthesitis, uveitis, dactylitis, psoriasis, inflammatory bowel disease, good response to nonsteroidal antiinflammatory drugs, family history of SpA, and elevated CRP.

Our study sample consisted of 104 patients in whom a diagnosis of axSpA was considered possible by the clinical rheumatologic assessment, and in whom pelvic radiographs of sufficient technical quality were available. Among the 104 patients, 92 met the ASAS criteria for a positive SIJ MRI and 12 were HLA-B27–positive showing ≥ 1 clinical or laboratory SpA feature. Eighty-one patients (77.9%) met the ASAS criteria for axSpA: 56 (53.8%) through the imaging arm only (MRI-only), 8 (7.7%) through the clinical arm only, and 17 (16.3%) through both arms. Twenty-three patients (22.1%) did not meet the ASAS criteria for axSpA: 19 (18.3%) with a positive SIJ MRI only and 4 (3.8%) being HLA-B27–positive with only 1 SpA feature.

The study was approved by the Danish Data Protection Agency and by the Ethics Committee of the Region of Southern Denmark (project ID S-20110029). All participating patients gave written informed consent.

Evaluation of SIJ radiographs

SIJ radiographs were obtained according to local protocols used in daily routine in 6 radiology departments in Denmark. Among the 104 SIJ radiographs, 88 (84.6%) were standard anteroposterior pelvic radiographs, 14 (13.5%) were radiographs of the lumbar spine including the SIJ, and 2 examinations (2.0%) consisted of oblique SIJ projections. All 104 digital SIJ radiographs were centrally anonymized and randomized. Seven readers (2 musculoskeletal radiologists, 5 rheumatologists) blinded to clinical, biochemical, and MRI data independently assessed the SIJ radiographs in random order on electronic workstations. First, the readers classified the SIJ radiographs according to the mNY that was considered met if there was at least bilateral grade 2 or unilateral grade 3 sacroiliitis1. Second, the readers recorded the presence/absence of 5 radiographic lesion types in both SIJ as described in the mNY: erosion, sclerosis, ankylosis, joint space widening (JSW), and joint space narrowing (JSN). Erosion and sclerosis were recorded per 4 joint surfaces, i.e., on the sacral and the iliac side of the right and left SIJ, respectively, whereas ankylosis, JSW, and JSN were reported separately per right and left SIJ, respectively. We followed the definitions of SIJ grades and radiographic lesion types as stated in the mNY1: grade 0 = normal, grade 1 = suspicious changes, grade 2 = minimum abnormality (small localized areas with erosion or sclerosis, without alteration in the joint width), grade 3 = unequivocal abnormality (moderate or advanced sacroiliitis with erosion, evidence of sclerosis, widening, narrowing, or partial ankylosis), and grade 4 = severe abnormality or total ankylosis. SIJ scores and radiographic lesions were entered into a standardized electronic data sheet identical to the one used during reader calibration.

Reader calibration

The 7 readers consisted of 2 senior musculoskeletal radiologists having more than 20 years each of experience in interpretation of pelvic radiographs (AGJ, SN), and of 3 senior and 2 junior staff rheumatologists from 1 institution (King Christian 10th Hospital for Rheumatic Diseases, Gråsten, Denmark). The 2 radiologists came from different institutions and were not involved previously in shared imaging research. One of the rheumatologist readers (UW), who had more than 10 years of research experience in conventional and tomographic imaging in SpA, was responsible for calibration of the reader team.

All 7 readers were calibrated by reference images of pelvic radiographs covering all mNY grades. The reference images were derived from clinical practice in patients with various stages of SpA to best match the original grading description, which lacks standardized and validated lesion definitions. The definitions of the 5 grades were adopted from the original description of the mNY1, which was based on the Atlas of Standard Radiographs in Arthritis14. Because of their longstanding experience in scoring SIJ on pelvic radiographs, the 2 musculoskeletal radiologists did not participate in the additional calibration for the rheumatologists. The 5 rheumatologists had three 2-h calibration sessions and independently performed a training readout. The first session consisted of an introduction to the scoring method, a review of the relevant literature, and a group discussion of 10 pelvic radiographs. This was followed by an independent evaluation of 15 pelvic radiographs by each rheumatologist according to the same scientific protocol that was later used in the main study. SIJ scores and radiographic lesion types reported in this training readout were evaluated in a second calibration session. A third calibration session with group discussion of another 10 pelvic radiographs served to refine the reference images set. All pelvic radiographs used in the training sessions were unrelated to the main study.

Descriptive analysis

Categorical demographic, clinical, and laboratory variables were described as proportion of subjects showing these features, and continuous variables as median [interquartile range (IQR)]. We expressed the presence of single radiographic features and fulfillment of the mNY as mean proportion of study subjects over 7 readers, and as mean proportions stratified according to level of reader experience. To determine the frequency of advanced sacroiliitis in our sample, we calculated the proportion of study subjects showing SIJ scores > 2 in the right and left SIJ separately. Presence of erosion and sclerosis was defined as ≥ 1 lesion on ≥ 1 of the 4 joint surfaces on both sides, while ankylosis, JSW, and JSN were defined as ≥ 1 lesion in ≥ 1 of the 2 joints, respectively. The frequency of the 5 radiographic lesions was calculated as mean proportion of patients having each lesion type over 7 readers, and as proportion of each lesion type among mNY-positive and mNY-negative study subjects for all 7 readers individually. Finally, we calculated the frequency of ≥ 2 concomitant lesion types per patient.

Interreader agreement

Interreader agreement for classification according to the mNY and for the 5 radiographic features was assessed by means of 2 × 2 tables and calculating percent agreement (total; positive/negative) and by Cohen’s κ15. Interreader agreement for the ordinal SIJ grades for both sides separately was evaluated by weighted Cohen’s κ. Agreement was interpreted according to Landis and Koch16 as slight (κ < 0.2), fair (0.2 ≤ κ < 0.4), moderate (0.4 ≤ κ < 0.6), substantial (0.6 ≤ κ < 0.8), and almost perfect (0.8 ≤ κ < 1.0). The computations were made for each reader pair and for all readers jointly as mean value over all 21 reader pairs. For the pairwise κ values, a bootstrap CI based on 1000 bootstrap replications and computed at a CI of 95% was provided. We additionally compared 5 selected reader pairs regarding agreement: the 2 musculoskeletal radiologists, the most experienced rheumatologist versus each of the 2 musculoskeletal radiologists, and the 2 senior and the 2 junior rheumatologists. The proportion of concordant single grades according to mNY among ≥ 2 readers (any reader pair) and ≥ 4 readers (majority of readers) was described for the right and left SIJ separately.

Candidate lesion types driving discrepancies in mNY classification

To assess the relative contribution of each of the 5 lesion types to disagreement in mNY classification, we first identified patients with discrepant mNY classification for each reader pair. Among these, we computed the proportion of patients with concordance for each radiographic lesion type for all reader pairs.

Finally, a generalized linear mixed logistic regression model was computed to estimate the relative effect size of each individual radiographic lesion type. Results were expressed as OR for disagreement in mNY classification with 95% CI. P values ≤ 0.05 were considered significant.

All computations were done with R (R Core Team, version 3.1.1.).

RESULTS

Descriptive analysis

Of the 104 patients, 38.5% were men and 33.7% were HLA-B27–positive (Table 1). Median age was 33.0 years. Over all 7 readers, a mean proportion of 15.7% of the patients met the mNY, and 8.1% showed mNY grades 3 or 4 (Table 1). Sclerosis and erosion were the 2 most frequent lesions reported in 50.1% and in 25.7% of the patients, respectively. The 3 more experienced readers scored more lesions of all types than the 4 less experienced readers, and they also considered more patients to be mNY-positive (21.5% vs 11.3%). Patients with erosion concomitantly showed sclerosis in 93.5%, JSN in 48.5%, JSW in 27.8%, and ankylosis in 19.5%. The distribution of the 5 lesion types among mNY-positive and -negative patients for all 7 readers individually is shown in Figure 1. Among the 5 radiographic lesion types, erosion and sclerosis showed the largest variation between individual readers. The most frequent constellation when reporting erosion in mNY-negative patients was unilateral grade 2 sacroiliitis (data not shown). Both more and less experienced readers reported joint space alterations in a small minority of subjects classified as mNY-negative.

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

Distribution of 5 lesion types among mNY-positive and -negative patients for all 7 readers individually. mNY: modified New York criteria; Rad1: musculoskeletal radiologist 1; Rad2: musculoskeletal radiologist 2; Rh1: most experienced rheumatologist; Rh2, Rh3: 2 senior rheumatologists; Rh4, Rh5: 2 junior rheumatologists.

View this table:
  • View inline
  • View popup
Table 1.

Patient characteristics and distribution of radiographic features. Values are n (%) unless otherwise specified.

Interreader agreement

Kappa (percent) agreement for mNY classification was 0.39 (84.1%) over 7 readers, 0.46 (79.8%) between 2 musculoskeletal radiologists, and 0.55 (86.5%) and 0.36 (77.9%) among the most experienced rheumatologist and each of the 2 musculoskeletal radiologists, respectively (Table 2). Among the rheumatologists less experienced in radiographic SIJ assessment, agreement between the 2 senior and the 2 junior rheumatologists was 0.34 (84.6%) and 0.27 (87.5%), respectively. Among the 5 radiographic lesion types, ankylosis showed the highest (κ 0.34) and JSW the lowest agreement (κ 0.12) over 21 reader pairs. Reliability among all 21 reader pairs for standard pelvic versus lumbar spine radiographs was in the agreement category “fair” as defined above with mean κ values of 0.39 and 0.33, respectively.

View this table:
  • View inline
  • View popup
Table 2.

Agreement of all 21 reader pairs and among selected reader pairs: Cohen’s κ values (95% CI; upper row) and percent agreement (positive/negative; lower row).

Candidate lesion types driving discrepancies in mNY classification

Among the 21 reader pairs, 15.9% of the patients had discrepant mNY classification. Among patients with discordant mNY classification, erosion was the lesion with the lowest interreader agreement: the proportion of mNY-discrepant patients with concordance for erosion was only 0.25 (IQR 0.09; Figure 2). The assessment of the effect size of each of the 5 lesion types showed that erosion was the strongest driver of discordance in mNY classification. Erosion was associated with statistically significant 13.5× higher odds (95% CI 9.1–20.1) for discrepant mNY classification (Table 3).

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

Proportion of patients with concordant lesion types among patients with discordant mNY classification for all 21 reader pairs. The horizontal bands within the boxes represent the medians. mNY: modified New York criteria.

View this table:
  • View inline
  • View popup
Table 3.

Relative contribution of 5 predefined lesion types to disagreement in modified New York criteria classification (generalized linear mixed model).

Figure 3 shows a pelvic radiograph in which 4 of 7 readers considered the mNY as being met and 5 of 7 readers scored erosion.

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3.

Pelvic radiograph illustrating disagreement in modified New York criteria (mNY) and erosion among 7 radiology and rheumatology readers with varying experience in imaging in spondyloarthritis (SpA). Radiograph is of a 36-year-old man suspected of having early SpA. The ASAS criteria for axSpA were met, as well as the ASAS criteria for a positive sacroiliac joint magnetic resonance image. Four of 7 readers (2 radiologists and 2 rheumatologists) considered the mNY met, and 5 of 7 readers (2 radiologists and 3 rheumatologists) reported presence of erosion. Sclerosis and joint space irregularities show mainly in the middle portion of both SIJ, while the lowest third, which represents mostly the cartilaginous joint compartment and is usually involved first by inflammation, seems to be less affeccted. The right joint displays additionally a lumbosacral transitional anomaly with an accessory joint between the transverse process of the fifth lumbar vertebra and the basis of the sacrum. ASAS: Assessment of SpondyloArthritis international Society; axSpA: axial SpA; SIJ: sacroiliac joint.

DISCUSSION

Our study on the reliability of radiographic SIJ classification according to the mNY in an SpA inception cohort suggests that SIJ erosion may be the primary driver of interreader disagreement. The only fair to at best moderate level of concordance for mNY (κ 0.39) among 7 radiology and rheumatology readers with varying experience in imaging in SpA was even slightly lower than a reported only moderate agreement (κ 0.54) between 2 central readers in another axSpA inception cohort3.

The limited reproducibility of radiographic SIJ classification according to the mNY is well documented. However, the characteristics of a given study sample may affect the level of interreader agreement. Previous reports suggest that the higher the proportion of patients with ankylosing spondylitis (AS) in a given study sample, the better the concordance in radiographic mNY. A study from Turkey applying the radiographic mNY in patients with Behçet disease recorded pre/post-training κ agreement of 0.32/0.19, 0.32/0.36, and 0.44/0.41 for 3 reader pairs (1 radiologist and 2 rheumatologists, respectively)2. Our study with 15.7% mNY-positive patients showed a κ concordance of 0.39 among 7 radiology and rheumatology readers. In a report on patients with inflammatory back pain suggestive of axSpA, 21.1%/26.6% (central/local reading) had obvious sacroiliitis; concordance for radiographic mNY by κ values was 0.54 among 2 trained central rheumatologist readers and 0.55 for central versus local radiologist and rheumatologist reading3. A study assessing the rate of radiographic sacroiliitis progression over 2 years consisted of a high proportion of 54.8% patients with AS4. Interreader agreement between 2 trained rheumatology readers blinded to time sequence was moderate at baseline with a κ value of 0.57, but increased to a substantial κ of 0.67 at followup together with a progression rate from nonradiographic SpA to AS of 11.6% over 2 years. The highest interobserver concordance was reported in a study of 217 patients with AS who all met the mNY17. Kappa values between 2 trained rheumatologist readers were 0.68, 0.69, and 0.66 at baseline, 1-year followup, and 2-year followup.

Our agreement for classification according to the radiographic mNY (κ 0.39) was higher than for each single of the 5 radiographic lesion types (κ values 0.12–0.34). This is in line with the above-mentioned report on inflammatory back pain patients with κ values for the single lesion types between 0.12–0.44 as opposed to agreement of 0.54 for mNY3.

A potential source of disagreement is the lack of standardized and validated definitions for each of the radiographic lesion types contained in the original description1,14, which were used in our study. However, it remains to be shown whether an attempt to standardize and validate lesion definitions might facilitate agreement in view of the broad morphologic spectrum of the radiographic lesion types.

All lesions except sclerosis contributed to discordant mNY classification, but erosion was the main driver of disagreement. Technical issues such as bowel overlapping the SIJ or various radiographic SIJ projections can only partially explain this finding because they also affect recognition of other radiographic lesions such as sclerosis or joint space variation. Our results need to be confirmed in other cohorts of patients with clinically suspected axSpA because erosion is widely regarded as a key lesion indicating radiographic sacroiliitis.

Our SpA inception cohort recruited from primary care with low back pain of ≥ 3 months’ duration showed a low frequency of HLA-B27 and male sex. Multiple studies in other early axSpA cohorts have shown lower proportions of male sex and HLA-B27 positivity when compared with AS18,19,20,21,22. However, these cohorts were not or not entirely based on recruitment from primary care, and usually excluded patients with just suspected SpA, which may explain the higher prevalence of male sex and HLA-B27 positivity, when compared to ours. A Dutch cohort of patients with suspected axSpA similar to ours and also recruited from primary care23 showed an even lower proportion of HLA-B27 positivity of 20% among patients meeting the ASAS criteria for axSpA. Our cohort reflects daily routine in which young patients with treatment-refractory back pain referred from primary care with suspected early SpA often need to be followed over time before a final diagnosis can be made. However, pelvic radiographs are often performed as 1 element of the rheumatologic evaluation in such a clinical setting of suspected early SpA, despite the limited evidence of whether they may enhance confidence in a diagnosis of early SpA.

The mNY derived from a cohort of 183 HLA-B27–positive patients with AS, their HLA-B27–positive or –negative first-degree relatives, and population controls1 may not be directly applicable to chronic back pain patients clinically suspected of having axSpA. Further, there are no normative data regarding frequency and morphology of the 5 radiographic mNY lesion types in healthy controls, mechanical back pain patients, subjects with increased physical activity, or multiparous women. A back pain cohort from chiropractic practices in Canada with a recruitment mode similar to ours but with older patients showed degenerative SIJ changes in 35.2% of 142 women ages 18–60 years, which might be a factor leading to reader disagreement in low grade sacroiliitis in women24. In our study, sclerosis was the most frequently reported lesion type by all readers among patients classified as not meeting the radiographic mNY.

A Dutch report on radiographic assessment of sacroiliitis by 100 rheumatologists and 23 radiologists showed only modest sensitivity and specificity for sacroiliitis and sizable intraobserver variation25. Evaluation of the same image set after 3–6 months upon individual training and workshops did not improve performance. However, no pairwise analysis among all possible reader pairs was performed as in our study, but the scores of an expert panel (2 rheumatologists, 1 epidemiologist, and 1 radiologist) served as gold standard. Future studies with pairwise analysis of all possible reader pairs involving both radiologists and rheumatologists are needed to determine whether training and calibration in recognition of various radiographic SIJ lesion types, especially erosion, might improve agreement in classification according to the radiographic mNY.

Our SIJ radiographs were acquired according to local protocols in 6 radiology centers resulting in 84.6% standard anteroposterior pelvic radiographs, the remaining being lumbar spine radiographs including the SIJ and 2 oblique SIJ projections. The different visualizations of the SIJ might have had an effect on reproducibility. The lack of a full calibration of the 2 musculoskeletal radiologists may have affected interreader agreement as well. However, both limitations regarding imaging protocols and reader calibration reflect the conditions in daily routine. Another potential limitation is that κ statistics inherently perform less well in cases of skewed distribution of the variables under observation26,27,28,29, as with our relatively low prevalence of mNY grades 3–4 of only 8.0%.

Reproducibility of SIJ classification according to the mNY in a SpA inception cohort was only fair to at best moderate among 7 radiology and rheumatology readers with varying experience in imaging in SpA. Erosion was the main driver of discordant classification. These findings question the applicability of the radiographic mNY in back pain patients clinically suspected of having early axSpA, particularly in healthcare settings where access to SIJ MRI is readily available.

Acknowledgment

The authors thank Laila Dungart and Henning Jakobsen from the Radiology Department at King Christian 10th Hospital for Rheumatic Diseases, Gråsten, Denmark, for anonymization and randomization of the pelvic radiographs; Lone Holm Hansen (LHH) for clinical evaluation of patients at Hospital Lillebaelt, Vejle, Denmark; Charlotte Drachmann and Lis Schubert at King Christian 10th Hospital for Rheumatic Diseases, Gråsten, Denmark, for high-sensitivity C-reactive protein and HLA-B27 analysis; Tue Secher Jensen at the Spine Centre of Southern Denmark, Denmark, for his role in the conception and design of the Spines of Southern Denmark Cohort; and the radiologic departments at these Danish hospitals for kindly providing the radiographs used in this study: Hospital Lillebaelt, Vejle; Odense University Hospital; Odense University Hospital at Svendborg Hospital; Hospital South West Jutland; Hospital of Nykøbing Falster; and King Christian 10th Hospital for Rheumatic Diseases, Gråsten.

Footnotes

  • Dr. Rufibach is founder and owner of Rufibach rePROstat and is an employee of F. Hoffmann-La Roche, Basel, Switzerland. The Hospital of Southern Jutland, University of Southern Denmark, Hospital Lillebaelt, Vejle, and Knud og Edith Eriksens Mindefond funded Dr. Christiansen’s salary during the course of a PhD program, including this study.

  • Accepted for publication August 31, 2016.

REFERENCES

  1. 1.↵
    1. van der Linden S,
    2. Valkenburg HA,
    3. Cats A
    . Evaluation of diagnostic criteria for ankylosing spondylitis. A proposal for modification of the New York criteria. Arthritis Rheum 1984;27:361–8.
    OpenUrlCrossRefPubMed
  2. 2.↵
    1. Yazici H,
    2. Turunç M,
    3. Ozdoğan H,
    4. Yurdakul S,
    5. Akinci A,
    6. Barnes CG
    . Observer variation in grading sacroiliac radiographs might be a cause of ‘sacroiliitis’ reported in certain disease states. Ann Rheum Dis 1987;46:139–45.
    OpenUrlAbstract/FREE Full Text
  3. 3.↵
    1. van den Berg R,
    2. Lenczner G,
    3. Feydy A,
    4. van der Heijde D,
    5. Reijnierse M,
    6. Saraux A,
    7. et al.
    Agreement between clinical practice and trained central reading in reading of sacroiliac joints on plain pelvic radiographs. Results from the DESIR cohort. Arthritis Rheumatol 2014;66:2403–11.
    OpenUrl
  4. 4.↵
    1. Poddubnyy D,
    2. Rudwaleit M,
    3. Haibel H,
    4. Listing J,
    5. Märker-Hermann E,
    6. Zeidler H,
    7. et al.
    Rates and predictors of radiographic sacroiliitis progression over 2 years in patients with axial spondyloarthritis. Ann Rheum Dis 2011;70:1369–74.
    OpenUrlAbstract/FREE Full Text
  5. 5.↵
    1. Deodhar A,
    2. Reveille JD,
    3. van den Bosch F,
    4. Braun J,
    5. Burgos-Vargas R,
    6. Caplan L,
    7. et al.
    The concept of axial spondyloarthritis: joint statement of the spondyloarthritis research and treatment network and the Assessment of SpondyloArthritis international Society in response to the US Food and Drug Administration’s comments and concerns. Arthritis Rheumatol 2014;66:2649–56.
    OpenUrl
  6. 6.↵
    1. U.S. Food and Drug Administration, Department of Health & Human Services
    . Arthritis Advisory Committee Meeting: sBLA 125057/323: adalimumab for the treatment of active nonradiographic axial spondyloarthritis in adults with objective signs of inflammation by elevated C-reactive protein (CRP) or magnetic resonance imaging (MRI), who have had an inadequate response to, or are intolerant to, a nonsteroidal anti-inflammatory drug [Internet. Accessed August 31, 2016.] Available from: www.fda.gov/downloads/AdvisoryCommittees/CommitteesMeetingMaterials/Drugs/ArthritisAdvisoryCommittee/UCM361563.pdf
  7. 7.↵
    1. U.S. Food and Drug Administration, Department of Health & Human Services
    . Arthritis Advisory Committee Meeting: sBLA 125160/215: Cimzia (certolizumab) for the treatment of active axial spondyloarthritis, including patients with ankylosing spondylitis [Internet. Accessed August 31, 2016.] Available from: www.fda.gov/downloads/AdvisoryCommittees/CommitteesMeetingMaterials/Drugs/ArthritisAdvisoryCommittee/UCM361565.pdf
  8. 8.↵
    1. Arnbak B,
    2. Jensen TS,
    3. Egund N,
    4. Zejden A,
    5. Hørslev-Petersen K,
    6. Manniche C,
    7. et al.
    Prevalence of degenerative and spondyloarthritis-related magnetic resonance imaging findings in the spine and sacroiliac joints in patients with persistent low back pain. Eur Radiol 2016;26:1191–203.
    OpenUrlCrossRefPubMed
  9. 9.↵
    1. Arnbak B,
    2. Hendricks O,
    3. Hørslev-Petersen K,
    4. Jurik AG,
    5. Pedersen SJ,
    6. Østergaard M,
    7. et al.
    The discriminative value of inflammatory back pain in patients with persistent low back pain. Scand J Rheumatol 2016;45:321–8.
    OpenUrl
  10. 10.↵
    1. Arnbak B,
    2. Grethe Jurik A,
    3. Hørslev-Petersen K,
    4. Hendricks O,
    5. Hermansen LT,
    6. Loft AG,
    7. et al.
    Associations between spondyloarthritis features and magnetic resonance imaging findings: a cross-sectional analysis of 1,020 patients with persistent low back pain. Arthritis Rheumatol 2016;68:892–900.
    OpenUrl
  11. 11.↵
    1. Rudwaleit M,
    2. Jurik AG,
    3. Hermann KG,
    4. Landewé R,
    5. van der Heijde D,
    6. Baraliakos X,
    7. et al.
    Defining active sacroiliitis on magnetic resonance imaging (MRI) for classification of axial spondyloarthritis: a consensual approach by the ASAS/OMERACT MRI group. Ann Rheum Dis 2009;68:1520–7.
    OpenUrlAbstract/FREE Full Text
  12. 12.↵
    1. Rudwaleit M,
    2. van der Heijde D,
    3. Landewé R,
    4. Listing J,
    5. Akkoc N,
    6. Brandt J,
    7. et al.
    The development of Assessment of SpondyloArthritis international Society classification criteria for axial spondyloarthritis (part II): validation and final selection. Ann Rheum Dis 2009;68:777–83.
    OpenUrlAbstract/FREE Full Text
  13. 13.↵
    1. Sieper J,
    2. van der Heijde D,
    3. Landewé R,
    4. Brandt J,
    5. Burgos-Vagas R,
    6. Collantes-Estevez E,
    7. et al.
    New criteria for inflammatory back pain in patients with chronic back pain: a real patient exercise by experts from the Assessment of SpondyloArthritis international Society (ASAS). Ann Rheum Dis 2009;68:784–8.
    OpenUrlAbstract/FREE Full Text
  14. 14.↵
    1. Kellgren JH,
    2. Jeffrey MR
    . The epidemiology of chronic rheumatism; volume 2: atlas of standard radiographs of arthritis. Oxford: Blackwell Scientific Publications; 1963:36–40.
  15. 15.↵
    1. Conger AJ
    . Integration and generalization of kappas for multiple raters. Psychol Bull 1980;88:322–8.
    OpenUrlCrossRef
  16. 16.↵
    1. Landis JR,
    2. Koch GG
    . An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics 1977;33:363–74.
    OpenUrlCrossRefPubMed
  17. 17.↵
    1. Spoorenberg A,
    2. de Vlam K,
    3. van der Linden S,
    4. Dougados M,
    5. Mielants H,
    6. van de Tempel H,
    7. et al.
    Radiological scoring methods in ankylosing spondylitis. Reliability and change over 1 and 2 years. J Rheumatol 2004;31:125–32.
    OpenUrlAbstract/FREE Full Text
  18. 18.↵
    1. Rudwaleit M,
    2. Haibel H,
    3. Baraliakos X,
    4. Listing J,
    5. Märker-Hermann E,
    6. Zeidler H,
    7. et al.
    The early disease stage in axial spondylarthritis: results from the German Spondyloarthritis Inception Cohort. Arthritis Rheum 2009;60:717–27.
    OpenUrlCrossRefPubMed
  19. 19.↵
    1. Ciurea A,
    2. Scherer A,
    3. Exer P,
    4. Bernhard J,
    5. Dudler J,
    6. Beyeler B,
    7. et al;
    8. Rheumatologists of the Swiss Clinical Quality Management Program for Axial Spondyloarthritis
    . Tumor necrosis factor alpha inhibition in radiographic and nonradiographic axial spondyloarthritis: results from a large observational cohort. Arthritis Rheum 2013;65:3096–106.
    OpenUrlCrossRefPubMed
  20. 20.↵
    1. van den Berg R,
    2. de Hooge M,
    3. van Gaalen F,
    4. Reijnierse M,
    5. Huizinga T,
    6. van der Heijde D
    . Percentage of patients with spondyloarthritis in patients referred because of chronic back pain and performance of classification criteria: experience from the Spondyloarthritis Caught Early (SPACE) cohort. Rheumatology 2013;52:1492–9.
    OpenUrlAbstract/FREE Full Text
  21. 21.↵
    1. Moltó A,
    2. Paternotte S,
    3. van der Heijde D,
    4. Claudepierre P,
    5. Rudwaleit M,
    6. Dougados M
    . Evaluation of the validity of the different arms of the ASAS set of criteria for axial spondyloarthritis and description of the different imaging abnormalities suggestive of spondyloarthritis: data from the DESIR cohort. Ann Rheum Dis 2015;74:746–51.
    OpenUrlAbstract/FREE Full Text
  22. 22.↵
    1. Kiltz U,
    2. Baraliakos X,
    3. Karakostas P,
    4. Igelmann M,
    5. Kalthoff L,
    6. Klink C,
    7. et al.
    The degree of spinal inflammation is similar in patients with axial spondyloarthritis who report high or low levels of disease activity: a cohort study. Ann Rheum Dis 2012;71:1207–11.
    OpenUrlAbstract/FREE Full Text
  23. 23.↵
    1. van Hoeven L,
    2. Luime J,
    3. Han H,
    4. Vergouwe Y,
    5. Weel A
    . Identifying axial spondyloarthritis in Dutch primary care patients, ages 20–45 years, with chronic low back pain. Arthritis Care Res 2014;66:446–53.
    OpenUrl
  24. 24.↵
    1. O’Shea FD,
    2. Boyle E,
    3. Salonen DC,
    4. Ammendolia C,
    5. Peterson C,
    6. Hsu W,
    7. et al.
    Inflammatory and degenerative sacroiliac joint disease in a primary back pain cohort. Arthritis Care Res 2010;62:447–54.
    OpenUrlCrossRef
  25. 25.↵
    1. van Tubergen A,
    2. Heuft-Dorenbosch L,
    3. Schulpen G,
    4. Landewé R,
    5. Wijers R,
    6. van der Heijde D,
    7. et al.
    Radiographic assessment of sacroiliitis by radiologists and rheumatologists: does training improve quality? Ann Rheum Dis 2003;62:519–25.
    OpenUrlAbstract/FREE Full Text
  26. 26.↵
    1. Feinstein AR,
    2. Cicchetti DV
    . High agreement but low kappa: I. The problems of two paradoxes. J Clin Epidemiol 1990;43:543–9.
    OpenUrlCrossRefPubMed
  27. 27.↵
    1. Cicchetti DV,
    2. Feinstein AR
    . High agreement but low kappa: II. Resolving the paradoxes. J Clin Epidemiol 1990;43:551–8.
    OpenUrlCrossRefPubMed
  28. 28.↵
    1. Vach W
    . The dependence of Cohen’s kappa on the prevalence does not matter. J Clin Epidemiol 2005;58:655–61.
    OpenUrlCrossRefPubMed
  29. 29.↵
    1. Flight L,
    2. Julious SA
    . The disagreeable behaviour of the kappa statistic. Pharm Stat 2015;14:74–8.
    OpenUrl
PreviousNext
Back to top

In this issue

The Journal of Rheumatology
Vol. 44, Issue 1
1 Jan 2017
  • Table of Contents
  • Table of Contents (PDF)
  • Index by Author
  • Editorial Board (PDF)
Print
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word about The Journal of Rheumatology.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Limited Reliability of Radiographic Assessment of Sacroiliac Joints in Patients with Suspected Early Spondyloarthritis
(Your Name) has forwarded a page to you from The Journal of Rheumatology
(Your Name) thought you would like to see this page from the The Journal of Rheumatology web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
Limited Reliability of Radiographic Assessment of Sacroiliac Joints in Patients with Suspected Early Spondyloarthritis
Alice Ashouri Christiansen, Oliver Hendricks, Dorota Kuettel, Kim Hørslev-Petersen, Anne Grethe Jurik, Steen Nielsen, Kaspar Rufibach, Anne Gitte Loft, Susanne Juhl Pedersen, Louise Thuesen Hermansen, Mikkel Østergaard, Bodil Arnbak, Claus Manniche, Ulrich Weber
The Journal of Rheumatology Jan 2017, 44 (1) 70-77; DOI: 10.3899/jrheum.160079

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero

 Request Permissions

Share
Limited Reliability of Radiographic Assessment of Sacroiliac Joints in Patients with Suspected Early Spondyloarthritis
Alice Ashouri Christiansen, Oliver Hendricks, Dorota Kuettel, Kim Hørslev-Petersen, Anne Grethe Jurik, Steen Nielsen, Kaspar Rufibach, Anne Gitte Loft, Susanne Juhl Pedersen, Louise Thuesen Hermansen, Mikkel Østergaard, Bodil Arnbak, Claus Manniche, Ulrich Weber
The Journal of Rheumatology Jan 2017, 44 (1) 70-77; DOI: 10.3899/jrheum.160079
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One
Bookmark this article

Jump to section

  • Article
    • Abstract
    • MATERIALS AND METHODS
    • RESULTS
    • DISCUSSION
    • Acknowledgment
    • Footnotes
    • REFERENCES
  • Figures & Data
  • Info & Metrics
  • References
  • PDF
  • eLetters

Keywords

SPONDYLOARTHRITIS
RADIOGRAPHIC SACROILIITIS
INTERREADER AGREEMENT
MODIFIED NEW YORK CRITERIA

Related Articles

Cited By...

More in this TOC Section

  • Clustering Patients With Gout Based on Comorbidities and Biomarkers: A Cross-Sectional Study
  • Pain Mechanisms Associated With Disease Activity in Patients With Rheumatoid Arthritis Treated With Disease-Modifying Antirheumatic Drugs: A Regression Tree Analysis
  • Immunosuppressive Therapies in Ear, Nose, and Throat Involvement in Antineutrophil Cytoplasmic Antibody–Associated Vasculitis: Results From a Multicenter Retrospective Cohort Study
Show more Article

Similar Articles

Keywords

  • spondyloarthritis
  • RADIOGRAPHIC SACROILIITIS
  • INTERREADER AGREEMENT
  • MODIFIED NEW YORK CRITERIA

Content

  • First Release
  • Current
  • Archives
  • Collections
  • Audiovisual Rheum
  • COVID-19 and Rheumatology

Resources

  • Guide for Authors
  • Submit Manuscript
  • Author Payment
  • Reviewers
  • Advertisers
  • Classified Ads
  • Reprints and Translations
  • Permissions
  • Meetings
  • FAQ
  • Policies

Subscribers

  • Subscription Information
  • Purchase Subscription
  • Your Account
  • Terms and Conditions

More

  • About Us
  • Contact Us
  • My Alerts
  • My Folders
  • Privacy/GDPR Policy
  • RSS Feeds
The Journal of Rheumatology
The content of this site is intended for health care professionals.
Copyright © 2022 by The Journal of Rheumatology Publishing Co. Ltd.
Print ISSN: 0315-162X; Online ISSN: 1499-2752
Powered by HighWire