RT Journal Article SR Electronic T1 Ultrasound as an Outcome Measure in Gout. A Validation Process by the OMERACT Ultrasound Working Group JF The Journal of Rheumatology JO J Rheumatol FD The Journal of Rheumatology SP jrheum.141294 DO 10.3899/jrheum.141294 A1 Terslev, Lene A1 Gutierrez, Marwin A1 Schmidt, Wolfgang A. A1 Keen, Helen I. A1 Filippucci, Emilio A1 Kane, David A1 Thiele, Ralf A1 Kaeley, Gurjit A1 Balint, Peter A1 Mandl, Peter A1 Delle Sedie, Andrea A1 Hammer, Hilde Berner A1 Christensen, Robin A1 Möller, Ingrid A1 Pineda, Carlos A1 Kissin, Eugene A1 Bruyn, George A. A1 Iagnocco, Annamaria A1 Naredo, Esperanza A1 D’Agostino, Maria Antonietta YR 2015 UL http://www.jrheum.org/content/early/2015/08/26/jrheum.141294.abstract AB Objective To summarize the work performed by the Outcome Measures in Rheumatology (OMERACT) Ultrasound (US) Working Group on the validation of US as a potential outcome measure in gout. Methods Based on the lack of definitions, highlighted in a recent literature review on US as an outcome tool in gout, a series of iterative exercises were carried out to obtain consensus-based definitions on US elementary components in gout using a Delphi exercise and subsequently testing these definitions in static images and in patients with proven gout. Cohen’s κ was used to test agreement, and values of 0–0.20 were considered poor, 0.20–0.40 fair, 0.40–0.60 moderate, 0.60–0.80 good, and 0.80–1 excellent. Results With an agreement of > 80%, consensus-based definitions were obtained for the 4 elementary lesions highlighted in the literature review: tophi, aggregates, erosions, and double contour (DC). In static images interobserver reliability ranged from moderate to almost perfect, and similar results were found for the intrareader reliability. In patients the intraobserver agreement was good for all lesions except DC (moderate). The interobserver agreement was poor for aggregates and DC but moderate for the other components. Conclusion These first steps in evaluating the validity of US as an outcome measure for gout show that the reliability of the definitions ranged from moderate to excellent in static images and somewhat lower in patients, indicating that a standardized scanning technique may be needed, before testing the responsiveness of those definitions in a composite US score.