TY - JOUR T1 - Identification of Axial Spondyloarthritis Patients in a Large Dataset: The Development and Validation of Novel Methods JF - The Journal of Rheumatology JO - J Rheumatol DO - 10.3899/jrheum.181005 SP - jrheum.181005 AU - Jessica A. Walsh AU - Shaobo Pei AU - Gopi Penmetsa AU - Jared Lareno Hansen AU - Grant W. Cannon AU - Daniel O. Clegg AU - Brian C. Sauer Y1 - 2019/03/15 UR - http://www.jrheum.org/content/early/2019/09/11/jrheum.181005.abstract N2 - Objective Observational axial spondyloarthritis (axSpA) research in large datasets has been limited by a lack of adequate methods for identifying patients with axSpA, because there are no billing codes in the United States for most subtypes of axSpA. The objective of this study was to develop methods to accurately identify patients with axSpA in a large dataset. Methods The study population included 600 chart-reviewed veterans, with and without axSpA, in the Veterans Health Administration between January 1, 2005, and June 30, 2015. AxSpA identification algorithms were developed with variables anticipated by clinical experts to be predictive of an axSpA diagnosis [demographics, billing codes, healthcare use, medications, laboratory results, and natural language processing (NLP) for key SpA features]. Random Forest and 5-fold cross validation were used for algorithm development and testing in the training subset (n = 451). The algorithms were additionally tested in an independent testing subset (n = 149). Results Three algorithms were developed: Full algorithm, High Feasibility algorithm, and Spond NLP algorithm. In the testing subset, the areas under the curve with the receiver-operating characteristic analysis were 0.96, 0.94, and 0.86, for the Full algorithm, High Feasibility algorithm, and Spond NLP algorithm, respectively. Algorithm sensitivities ranged from 85.0% to 95.0%, specificities from 78.0% to 93.6%, and accuracies from 82.6% to 91.3%. Conclusion Novel axSpA identification algorithms performed well in classifying patients with axSpA. These algorithms offer a range of performance and feasibility attributes that may be appropriate for a broad array of axSpA studies. Additional research is required to validate the algorithms in other cohorts. ER -