TY - JOUR T1 - Reproducibility of Joint Swelling Assessments in Long-lasting Rheumatoid Arthritis: Influence on Disease Activity Score-28 Values (SEA-Repro Study Part I) JF - The Journal of Rheumatology JO - J Rheumatol SP - 932 LP - 937 DO - 10.3899/jrheum.090879 VL - 37 IS - 5 AU - THIERRY MARHADOUR AU - SANDRINE JOUSSE-JOULIN AU - GÉRARD CHALÈS AU - LAURENT GRANGE AU - CÉCILE HACQUARD AU - DAMIEN LOEUILLE AU - JÉRÉMIE SELLAM AU - JEAN-DAVID ALBERT AU - JACQUES BENTIN AU - ISABELLE CHARY VALCKENAERE AU - MARIA-ANTONIETTA d’AGOSTINO AU - FABIEN ETCHEPARE AU - PHILIPPE GAUDIN AU - CHRISTOPHE HUDRY AU - MAXIME DOUGADOS AU - ALAIN SARAUX Y1 - 2010/05/01 UR - http://www.jrheum.org/content/37/5/932.abstract N2 - Objectives. To evaluate the reproducibility of clinical synovitis assessments in rheumatoid arthritis and the effect of variability on the Disease Activity Score-28 (DAS28). Methods. Seven healthcare professionals from different cities examined the same patients with active non-early rheumatoid arthritis (RA; duration > 4 yrs), for whom a treatment change was being considered. There was no training session and the examination was to be performed as quickly as possible. The healthcare professionals assessed the 28 joints of the DAS28 in 7 patients (196 joints), then reexamined the same 28 joints in 4 of these 7 patients (112 joints), who had been rendered unrecognizable. Then 7 sonographers examined each of the 7 patients twice, using B-mode and power Doppler ultrasound (PD). The reference standards were presence of synovitis according to at least 50% of clinical examiners and 50% of sonographers. Agreement was assessed by Cohen’s kappa statistic. Results. Intraobserver reliability ranged from 0.31 (least experienced research technician) to 0.77 (most experienced physician). Interobserver reliability ranged from 0.18 to 0.62. The largest difference between the lowest and the highest swollen joint counts in the same patient was 15, and the greatest variation in the DAS28 score was 0.92. Agreement between clinical and sonographic reference standards was 0.46, 0.37, and 0.36 for B-mode, PD, and both, respectively. Conclusion. Clinical inter- and intraobserver reliability is highly dependent on the examiner. Consequences on the DAS28 score can be substantial. Agreement with sonography is poor when both B-mode and PD are used but seems better, although low, when B-mode is used alone. ER -