Abstract
Objective. To evaluate the relative level of influence of usual investigations in early arthritis on the diagnosis of rheumatoid arthritis (RA).
Methods. Patients: those included in the ESPOIR early arthritis cohort, a national cohort of patients with grade ≥ 2 synovitis for > 6 weeks and < 6 months. The diagnostic properties of variables assessed at baseline were measured against the diagnosis of RA defined by American College of Rheumatology criteria (at any timepoint between inclusion and 12-month followup) and expert opinion. Various models, including (1) clinical data; (2) clinical + radiographic data (plain radiographs); (3) addition of rheumatoid factor (RF) and/or anti-cyclic citrullinated peptide (anti-CCP); and (4) addition of HLA-DR typing, were assessed by comparing areas under the curves for ROC curves.
Results. Of 731 patients studied, 372 (50.9%) satisfied criteria for RA at 1 year. In univariate analysis, sensitivity was highest for distal articular presentation (94.6%), presence of IgM RF (69.4%), pain on metatarso-phalangeal squeeze test (66.1%), and presence of anti-CCP (65.6%); whereas specificity was highest for nodules (100%), HLA typing: shared-epitope double dose (95.9%), radiographic erosions (86.5%), and anti-CCP antibodies (86.4%). The most efficient model included swollen joint count, morning stiffness, erosions, RF, and anti-CCP. Adding rheumatoid nodules, C-reactive protein, or HLA-DR information was not contributive.
Conclusion. In addition to the clinical variables and radiographs, RF and/or anti-CCP are the single variables of interest that are contributive for the diagnosis of RA.
- RHEUMATOID ARTHRITIS
- DIAGNOSIS
- CRITERIA
- HLA
- RECEIVER-OPERATING CHARACTERISTIC CURVE
Rheumatoid arthritis (RA) is a frequent chronic inflammatory disease that can lead to severe morbidity1. It has been shown that early initiation of disease-modifying therapy is an important prognostic factor2,3. To this end, early diagnosis is important4. To date, the most widely validated5 and most frequently used criteria for the diagnosis of RA are the 1987 American College of Rheumatology (ACR) classification criteria6, pending further appraisal of the recently presented ACR/EULAR criteria7.
Many elements can contribute to the diagnosis of RA. These include (1) history and clinical examination, which have no specific cost but are time-consuming; (2) imaging to detect structural damage, such as through widely used standard radiographs; (3) biologic signs of autoimmunity: rheumatoid factor (RF) and anti-cyclic citrullinated protein (anti-CCP); and (4) genetics, such as human leukocyte antigen (HLA) typing. Although studies have confirmed the individual value of each of these elements8,9,10,11,12,13,14,15, to our knowledge there are few data in early arthritis regarding “diagnostic strategies,” i.e., assessing the relative influence on diagnosis of the different diagnostic elements. The only published studies in this regard are the Leiden strategies16,17,18. The relative results of different diagnostic elements is an important issue for several reasons; the first is expense: although clinical examination has no specific cost beyond the salary of the rheumatologist but takes time, biological tests such as autoimmunity tests and especially genetic typing, are costly. Radiographs, also costly, are justified not only for diagnosis but also as a predictive factor and for an ulterior comparison during followup19. The second reason to assess diagnostic tests is the need for rapid diagnosis in early arthritis2,3,4. Performing unnecessary diagnostic tests may lead to delay in diagnosis. Therefore determination of the most efficacious, but also most effective diagnostic strategy in early arthritis is important.
The objective of our study was to determine the relative diagnostic value of clinical, laboratory, and genetic elements for the diagnosis of RA, in early arthritis, using data from the French ESPOIR early arthritis cohort.
MATERIALS AND METHODS
Study design
The ESPOIR cohort (in French, the study and followup of early undifferentiated arthritis) is an ongoing, 10-year followup, multicenter early arthritis cohort20. With approval of the Montpellier University ethical committee, 16 university hospital rheumatology departments provided patients, covering a large part of the country. Clinical, laboratory, and imaging data were collected at baseline, then every 6 months for the first 2 years, then once a year. Data analyzed in the present study pertain to baseline and the first year of followup.
Participants
The inclusion criteria were the following: patient provided signed informed consent, was age 18–70 years, had 2 or more swollen joints, with a duration > 6 weeks and < 6 months, used no previous disease-modifying drugs and no steroids, and had no definite diagnosis of a disease other than RA or undifferentiated arthritis20. Thus, the ESPOIR cohort consists of both early undifferentiated inflammatory arthritis and recently developed RA.
Definition of outcome
The “gold standard” for the diagnosis of RA was the following: cumulative fulfillment of ACR classification criteria for RA6, i.e., ≥ 4 elements present out of a possible 7; AND investigator’s visual analog scale score (≥ 75/100) supporting a diagnosis of RA. The presence of each element of the ACR criteria was assessed cumulatively at baseline and at the 2 subsequent visits at 6 and 12 months of followup21,22.
Data collection
At baseline, an exhaustive data collection was performed according to recommendations in early arthritis23, including the following elements: (1) Demographic variables: age, sex, ethnicity, symptom duration. (2) Clinical history: mode of onset (constant vs intermittent), duration of morning stiffness, extraarticular symptoms. (3) Clinical examination: number and localization of painful and swollen joints (out of 28), with joints of the hands, wrists, and feet classified as distal articular presentation, induced pain by metatarso-phalangeal squeeze test, presence of nodules. (4) Radiographs: hand and wrist anteroposterior radiographs and feet anteroposterior and oblique views were taken at baseline and analyzed in each center by the investigator, in accord with usual practice. Patients’ radiographs were analyzed as: specific erosions (on hands and/or feet radiographs), yes/no. (5) Biology: acute-phase reactants, erythrocyte sedimentation rate (ESR) and C-reactive protein (CRP: positive cutoff 10 mg/l); IgM and IgA RF (ELISA, Marini, Paris, France; positive cutoff 9 IU/ml); anti-CCP2 antibodies (ELISA, DiaSorin, Antony, France; positive cutoff 50 U/ml). (6) HLA-DR typing: HLA-DRB1* genotypes were determined in each center, and analyzed as HLA-DRB1*01, and/or *04: presence of single dose or double dose of these alleles15.
Statistical analysis
Descriptive analysis: percentages were given for qualitative variables and mean and standard deviation for quantitative variables. Comparisons between patients with and without diagnosis of RA (as defined above) were performed using chi-square, Fisher’s exact test, or Student t test as appropriate.
Univariate analysis of diagnostic elements: all the diagnostic elements were put in binary form (using common clinical cutoffs or median value). For each element, the sensitivity, specificity, and accuracy were assessed with an exact 95% confidence interval (Clopper-Pearson method24).
Combination of diagnostic elements: 7 predetermined models were considered, combining different elements. Each model corresponded to a strategy. The first model included only clinical elements. In the second model, the radiographs were added. The third, fourth, and fifth models included not only the clinical signs and radiographs but also various biological variables (respectively, ESR and CRP, then adding RF or anti-CCP). The sixth model included all items from the fourth and fifth models, i.e., CRP, RF, and anti-CCP. The seventh and last model was the most complete, including HLA typing. For each model, the corresponding elements were introduced in a logistic regression model to predict the diagnosis of RA only if the p value in univariate analysis was < 0.20. A descending stepwise process was applied to keep only the relevant variables. The goodness-of-fit was checked using the Hosmer-Lemeshow statistic.
Evaluation of the diagnostic abilities of the 7 models: For each model and for each patient, a diagnostic score equal to the sum of the regression coefficients present in the patient at baseline was assessed. The global diagnostic ability of these scores was assessed by nonparametric receiver-operating characteristic (ROC) curves. The areas under the curve (AUC) and 95% confidence intervals were calculated25. AUC were compared using the nonparametric method of Delong, et al26 for paired data. The optimal cutoff was determined minimizing the number of misclassified patients27. For indicative purposes, sensitivity and specificity were assessed at these cutoffs.
For the model considered as optimal, points for a simplified prediction rule were derived from the regression coefficient and the validity of the simplified rule was assessed by comparing AUC of ROC curves.
In all the analyses, results were considered significant if p < 0.05. Analysis was performed using SAS version 9.1 and S-Plus version 8.
RESULTS
Patients’ characteristics
In all, 813 patients were included in the ESPOIR cohort. For the purpose of this study, the 731 patients who had complete data regarding the ACR diagnostic criteria for RA6 after 1 year were analyzed.
Patients’ characteristics are shown in Table 1. Mean (± SD) age at inclusion was 48 ± 12 years, 77% were female, 92% were Caucasian; mean synovitis duration at baseline was 149 ± 183 days. Symptom onset was rapid in 51%, insidious in 41%, and by flares in 8%. The mean swollen joint count was 7.2 ± 5.3 at inclusion.
Univariate analysis of the diagnostic elements
After 1 year, 372 patients (50.9%) fulfilled the definition of RA. Patients’ baseline characteristics and the sensitivity and specificity of variables of interest to predict the diagnosis of RA are summarized in Tables 1 and 2. In univariate analysis, sensitivity was highest for distal articular presentation (94.6%), presence of IgM RF (69.4%), pain on metatarso-phalangeal squeeze test (66.1%), presence of anti-CCP (65.6%), and morning stiffness > 30 minutes (65.6%); whereas specificity was highest for nodules (100%), HLA typing: shared-epitope double dose (95.9%), radiographic erosions (86.5%), and anti-CCP antibodies (86.4%).
Evaluation of diagnostic strategies
The variables composing the 7 models of increasing complexity (from clinical data only to clinical + radiographic + immunology + HLA typing data) are shown in Table 3, with the corresponding AUC of ROC curves, and the sensitivity, specificity and accuracy at the optimal cutoffs. Figure 1 shows the AUC increased slightly (but significantly, p = 0.01) from model 1 to model 2, but not from model 2 to model 3 (p = 0.51). Thus, radiographs appeared to be of diagnostic value whereas acute-phase reactants were not. An important gap was observed between models 3 and 4 (where RF was added): the AUC increased from 0.70 to 0.81 (p < 0.01), and acute-phase reactants disappeared from the model since they did not bring additional information in the multivariate model. Diagnostic properties of models including RF, anti-CCP, or RF + anti-CCP were globally similar (Table 3). The information brought by the HLA-DR typing was not contributive to the diagnosis, on top of the other tests, since the AUC of model 7 was not different from that of model 6 (p = 0.53).
Sensitivity was low for the clinical model, model 1 (51.3%, Table 3). It increased when radiographs were added (model 2: 60.4%), and by adding RF (from model 3 to model 4, 63.1% to 81.4%, respectively). Sensitivity was highest for model 4 comprising RF, and decreased when substituting anti-CCP for RF (model 5: 68.5%); the more complete models 6 and 7 did not reach the level of sensitivity of model 4 (73.6% and 70.2%, respectively). Specificity varied in the opposite way: it decreased slightly between the clinical model and the model with RF (models 1 to 4: 74.1% to 67.4%, respectively). It was higher for models 5, 6, and 7, which include anti-CCP (84.0%, 81.5%, and 87.5%).
Optimal model to predict RA
The model that may be considered optimal to predict RA was model 6, which includes clinical and radiographic data, as well as a combination of RF and anti-CCP (Table 4). A simplified score was derived from the regression coefficients, and Table 5 shows the observed percentage of patients who experienced progression in relation to the calculated prediction score. The simplified prediction model had good diagnostic properties: the AUC of the ROC curve was 0.84 (95% CI 0.81–0.87), therefore there was no loss of discriminative ability compared to model 6.
DISCUSSION
The comparative diagnostic values of clinical, biological, and radiological elements in early arthritis have been compared. Results indicate that the most effective combination for the diagnosis of early RA is the association of certain elements of anamnesis and clinical examination, with radiographs, RF, and/or anti-CCP. The added diagnostic value of assessing acute-phase reactants and HLA-DR typing was not evident in this study. RF and anti-CCP showed similar diagnostic properties; the addition of the 2 tests added slightly to the diagnostic properties and may be proposed where possible.
In clinical practice, patients presenting with early arthritis frequently have an undifferentiated disease that may progress to RA, or they may have a more benign disease course. The ACR criteria have been criticized for their low discriminative ability in patients presenting with recent-onset arthritis5. The recently presented ACR/EULAR criteria will hopefully have better discrimination in early disease. However, assessment of the value of each diagnostic element in early arthritis to identify patients who will progress to RA is needed, since recommendations strongly suggest that treatment is effective in the early phase of arthritis, before the disease is established23.
The small number of patients with erosive disease and of patients with nodules in our study influences the diagnostic capacities of these elements; however, our results are in keeping with those from other early RA cohorts28.
Erosions were searched for only with plain radiographs. Magnetic resonance imaging is a promising tool in this field29 and has been integrated into a prediction rule30. However, erosions seen on plain radiographs remain the gold standard and plain radiographs are the tool usually available in clinical practice. Further, the radiographic analyses could be discussed as the radiographs were analyzed globally (erosions yes/no) for the purposes of this study; thus, no complex scores were used31. This is also in keeping with daily practice situations. However, it is subject to potential bias, as the clinician may in fact score “typical erosion” when the diagnosis is evident11. Other predictive models have shown the importance of radiographic erosions in diagnosis16.
The gold standard used here for the diagnosis of RA should be discussed. The association of the ACR criteria assessed cumulatively22 with expert opinion aims at enhancing the diagnostic properties of the ACR criteria, as these properties are low in the context of early RA5. Using the ACR criteria as part of the definition may lead to incorporation bias, which results when the index test (prediction model) forms part of the reference standard, as is the case here since several significant variables are part of the ACR criteria. This may cause overestimation of the discriminative ability of the model32. To partly solve this problem, it was decided to associate expert opinion to the ACR criteria, although incorporation bias still exists in this case. Anti-CCP antibodies in our study have a rather low specificity (86%); however, this is in keeping with other studies where, for example, specificities of 88%30, 89%17, and 92%33 have been reported. Longer followup of the ESPOIR cohort is under way and will allow confirmation of the diagnoses.
This study has major strengths. The ESPOIR cohort is a national cohort of early arthritis20. Because of its entry criteria (more than 2 swollen joints for 6 weeks to 6 months), which are close to clinical practice, because of its large number of participants, and because of the extensive data collection at each visit, this cohort is well adapted to the objective of our study, with a good representation of patients with early arthritis. An early arthritis cohort such as ESPOIR may be better adapted to assess diagnostic values than an undifferentiated arthritis cohort excluding patients with RA32 since it corresponds to real-life situations. Further, the statistical analysis based on AUC of ROC curves is an interesting technique to compare the diagnostic properties of different strategies and may be used even when correlation between the items exists, as is the case here34. However, patients were treated during followup as deemed appropriate by their physicians, since ESPOIR is an observational cohort, and this could potentially modify the natural history of the disease, which should be taken into account. On the other hand, ESPOIR mimics natural conditions closely because of its observational design, which leads to better generalizability of the results.
To our knowledge, 2 other studies have assessed diagnostic capacities of various items in early RA16,17. In our study, the importance of the swollen joint count and of morning stiffness for diagnosis has been confirmed, as have radiographs and RF, whereas HLA typing, once again, was not of high diagnostic value16. HLA typing may, however, be of interest in individual cases, for example in certain patients with anti-CCP-negative early arthritis. The first Leiden study also found radiographs to be important16, but this was not evidenced in the second Leiden study17. In both of these models, RF and anti-CCP antibodies were both independent predictors, as in the present study.
We have assessed the sequential diagnostic value of various items for the diagnosis of RA using ROC curves. Results indicate the best diagnostic strategy involves clinical variables, radiographs, and RF/anti-CCP. Further followup of the ESPOIR cohort and of other early arthritis cohorts will allow longer-term determination of outcome and prognostic studies in this early arthritis population.
Acknowledgment
We thank S. Martin for the centralized immunology testing, and the ESPOIR steering committee and all the ESPOIR investigators for active patient recruitment.
Footnotes
-
The ESPOIR cohort was supported by an unrestricted grant from Merck Sharp and Dohme. Two additional grants from INSERM supported the biological database. The French Society of Rheumatology, Abbott, Amgen, and Wyeth supported the ESPOIR cohort study.
- Accepted for publication July 27, 2010.
APPENDIX
List of study collaborators. Investigators of the French Early Arthritis Cohort ESPOIR: F. Berenbaum, Paris-Saint Antoine; M.C. Boissier, Paris-Bobigny; A. Cantagrel, Toulouse; B. Combe, Montpellier; M. Dougados, Paris-Cochin; P. Fardelonne, Amiens; B. Fautrel, P. Bourgeois, Paris-La Pitié; R.M. Flipo, Lille; P. Goupille, Tours; F. Liote, Paris-Lariboisière; X. Le Loet, Rouen; X. Mariette, Paris Bicetre; O. Meyer, Paris Bichat; A. Saraux, Brest; T. Schaeverbeke, Bordeaux; and J. Sibilia, Strasbourg.
REFERENCES
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
- 17.
- 18.
- 19.
- 20.
- 21.
- 22.
- 23.
- 24.
- 25.
- 26.
- 27.
- 28.
- 29.
- 30.
- 31.
- 32.
- 33.
- 34.