Identifying Patients With Axial Spondyloarthritis in Large Datasets: Expanding Possibilities for Observational Research

Jessica A. Walsh; Shaobo Pei; Gopi K. Penmetsa; Rebecca S. Overbury; Daniel O. Clegg; Brian C. Sauer

doi:10.3899/jrheum.200570

Abstract

Objective Observational research of axial spondyloarthritis (axSpA) is limited by a lack of methods for identifying diverse axSpA phenotypes in large datasets. Algorithms were previously designed to identify a broad spectrum of patients with axSpA, including patients not identifiable with diagnosis codes. The study objective was to estimate the performance of axSpA identification methods in the general Veterans Affairs (VA) population.

Methods A patient sample with known axSpA status (n = 300) was established with chart review. For feasibility, this sample was enriched with veterans with axSpA risk factors. Algorithm performance outcomes included sensitivities, positive predictive values (PPV), and F1 scores (an overall performance metric combining sensitivity and PPV). Performance was estimated with unweighted outcomes for the axSpA-enriched sample and inverse probability weighted (IPW) outcomes for the general VA population. These outcomes were also assessed for traditional identification methods using diagnosis codes for the ankylosing spondylitis (AS) subtype of axSpA.

Results The mean age was 54.7 and 92% were male. Unweighted F1 scores (0.59–0.74) were higher than IPW F1 scores (0.48–0.65). The full algorithm had the best overall performance (F1_IPW 0.65). The Early Algorithm was the most inclusive (sensitivity_IPW 0.90, PPV_IPW 0.38). The traditional method using ≥ 2 AS diagnosis codes from rheumatology had the highest PPV (PPV_IPW 0.84, sensitivity_IPW 0.34).

Conclusion The axSpA identification methods demonstrated a range of performance attributes in the general VA population that may be appropriate for various types of studies. The novel identification algorithms may expand the scope of research by enabling identification of more diverse axSpA populations.

Key Indexing Terms:

Observational research in large populations is important for studying uncommon outcomes and diseases in real-world settings¹. In the field of axial spondyloarthritis (axSpA), there is a dearth of big data research due to challenges with identifying axSpA patients in large datasets. With advancements in imaging and treatment, it has become apparent that many patients with axSpA were previously unrecognized with traditional concepts of disease^2,3. Despite our broader understanding of axSpA, big data axSpA research continues to be constrained by outdated axSpA definitions, since International Classification of Diseases, 9th and 10th revisions (ICD-9 and ICD-10, respectively) billing codes exist only for a single axSpA phenotype: ankylosing spondylitis (AS)^4,5,6,7,8. As such, around half of the approximately 3.3 million Americans with axSpA have been excluded from big data axSpA research^9,10, and data are sparse with important outcomes such as mortality, comorbidities, and treatment patterns in real-world axSpA populations^8,11.

Even within the narrower AS disease spectrum, traditional methods for identifying patients are limited. AS research in large datasets typically relies on AS ICD codes^12,13,14,15. However, AS ICD codes may not perform well for case identification in the United States¹⁶. Other methods may improve accuracy, such as supplementing AS ICD codes with chart review; however, studies requiring chart review have limited sample sizes since chart review is impractical for thousands of patients. Further, chart review is not possible in many large datasets since access to patient-level data is restricted. Other approaches may improve specificity, such as requiring an AS medication or a rheumatology visit in conjunction with AS diagnosis codes, but these approaches fail to capture important subsets of AS patients (i.e., untreated patients or patients receiving AS care from a primary care provider).

To address limitations with axSpA patient identification in large datasets, we developed algorithms that use structured and unstructured data to identify axSpA patients in national Veterans Health Administration (VHA) datasets¹⁷. These algorithms performed well in a sample of veterans enriched with risk factors for axSpA (area under the curve [AUC] 0.86–0.96)¹⁸. A similar approach was applied to a Boston-based population enriched with axSpA patients, and the results were similar (AUC 0.80–0.93)¹⁹. However, the axSpA-enriched populations are expected to be different from general (unenriched) populations, and the ability of the algorithms to accurately identify patients with axSpA in the general population was unknown. The goal of this study was to estimate the performance of axSpA identification algorithms in the general Veteran population.

MATERIALS AND METHODS

Data sources. This study used historical data from veterans enrolled in the VHA. The data source was the Corporate Data Warehouse (CDW), a national repository of data from the VHA medical record system (VistA) and other VHA clinical and administrative systems². Data were housed and analyzed within the Veterans Affairs Informatics and Computing Infrastructure (VINCI)²⁰. This research was conducted in compliance with the Declaration of Helsinki, with the approval of the University of Utah Institutional Review Board (IRB_00052363).

AxSpA identification methods. Five methods for identifying axSpA in large datasets were assessed (Table 1). These included 3 novel methods developed by our team: the Full Algorithm, the High Feasibility Algorithm, and the Early Algorithm. We also assessed 2 traditional methods that have been used in axSpA epidemiologic studies including ≥ 2 AS ICD codes ≥ 7 days apart from any source (AS codes, any specialty) and ≥ 2 AS ICD codes ≥ 7 days apart from a rheumatology encounter (AS codes, rheumatology). Details about algorithm development were previously published^18,21. In brief, the Full Algorithm is the most comprehensive, with 3 natural language processing (NLP) models²² and 46 coded variables. The High Feasibility Algorithm included the 16 top-ranked coded variables. The Early Algorithm is similar to the Full Algorithm, except SpA ICD codes and rheumatology visits were not included, to enhance identification of earlier disease or less classic phenotypes. Random forest and 5-fold cross-validation were used to develop and test the algorithms^23,24,25,26.

View this table:

Table 1.

Methods for identifying axial SpA.

Population. To assess performance of the identification methods, an independent sample of 300 veterans with known axSpA status (axSpA vs no axSpA) was established with chart review. Since it was not feasible to chart review the tens of thousands of patients that would have been required to randomly capture a sufficient number of axSpA patients, we enriched this chart review sample with veterans at elevated risk for axSpA.

We used the Full Algorithm to estimate axSpA risk. Random forest cutoff scores from the Full Algorithm were used to quantify axSpA risk and assign each veteran to a risk quartile (i.e., cutoff scores of 0.75–1.00 for the highest risk quartile, 0.50–0.75 for the second highest, 0.25–0.50 for the third highest, and 0.0–0.25 for the lowest). Veterans were selected from each risk quartile for chart review. Rheumatologist chart reviewers used annotation software (eHOST²⁷) to review and classify the 300 sampled veterans as having or not having axSpA, according to expert opinion and our previously published axSpA chart review guidelines^18,21.

Inverse probability weighting for estimating algorithm performance in the general population. The performance estimates in axSpA-enriched populations are subject to partial verification bias²⁰. Verification bias occurs when the reference standard has not been carried out in all patients because of ethical or practical reasons²⁸ (i.e., the algorithms were developed in an axSpA-enriched sample rather than the general Veterans Affairs [VA] population). Verification is known to lead to biased accuracy estimates. To correct for verification bias and understand how the algorithms are expected to function in the general VA population, we applied inverse probability weighting (IPW). IPW enables calculating statistics standardized to a target population (general VHA population) that is different from the population in which the data were collected (axSpA-enriched sample)²⁹. General population-level statistics were calculated by applying sampling weights based on the distribution of patients within each risk quartile (Supplementary Table 1, available with the online version of this article) and bootstrapping to obtain 95% CI.

Other statistics. Means, percentages, SD, and 95% CI were used to compare the subset of randomly selected veterans to the sample selected for chart review. Algorithm performance was evaluated with sensitivity, positive predictive values (PPV), and F1 scores, with and without IPW. The F1 score is an overall measure of performance that considers both the PPV (precision) and sensitivity (recall)³⁰ . F1 is useful for assessing performance in low-prevalence conditions. F1 scores were used to select the optimal cutoff scores for determining a positive vs negative outcome with the algorithms. As expected in a low-prevalence disease, specificity, negative predictive value, accuracy, and receiver-operator curve analysis were not useful for differentiating between axSpA identification methods, since the differences between algorithms with each of these outcomes was very small.

RESULTS

Population. We randomly selected 150,000 veterans participating in the national VHA system between January 1, 2007, and June 30, 2017 (Figure 1). After excluding Veterans with < 2 outpatient encounters ≥ 7 days apart, there were 79,826 remaining veterans. They were classified into quartiles for axSpA risk determined by probabilities calculated from the Full Algorithm. Sixty-one patients were classified in the first (highest risk) quartile, 65 in the second quartile, 276 in the third quartile, and 79,424 in the fourth quartile. All patients in the first and second quartiles were selected for chart review. From the third quartile, 74 of the 276 patients were randomly selected for chart review, and from the fourth quartile, 100 of 79,424 patients were randomly selected for chart review. In total, 300 patients were selected for chart review.

Figure 1.

Selection of patient sample for chart review. axSpA: axial spondyloarthritis; VHA: Veterans Health Administration.

Patient features were compared between veterans from the general population (with ≥ 2 encounters in the VHA system ≥ 7 days apart) vs patients selected to the chart review study sample (Table 2). Compared to the general VA population, the chart review patients were younger (54.7 vs 58.1 yrs) and had a higher comorbidity burden (Charlson comorbidity index 2.6 vs 2.1). The percentage of veterans with a clinically available positive HLA-B27 test result was higher in the chart review sample than the general VA population (8.0% vs 0.09%). Low back pain was more common in the chart review sample than the general VA population (12.0% vs 7.1%), and exposure to disease-modifying antirheumatic drugs (DMARD) was higher in the chart review sample (nonbiologic: 21.3% vs 1.6%, biologic: 22.0% vs 0.5%). The chart review patients were enrolled in the VA for a longer duration (7.7 vs 6.5 yrs). The chart review patients had more encounters within the VA system per year than the general VA population (37.9 vs 26.9), where VA system encounters were defined as provider visits and ancillary services (pharmacy, laboratory, radiology, physical therapy, social work, nutrition, chiropractic services, etc.). Per chart review, 30.3% of veterans in the chart review patients were classified as having axSpA. According to the 5 axSpA identification methods, 0.06–0.51% were classified as having axSpA in the general VA population vs 16.3–52.7% in the chart review sample.

View this table:

Table 2.

Characteristics and axSpA classifications of the general VHA population and the chart review sample.

Performance of identification methods. For the Full Algorithm, Early Algorithm, and High Feasibility Algorithm, random forest cutoff scores (for classifying patients as having or not having axSpA) were evaluated with sensitivity, PPV, and F1 scores (exemplified with the Full Algorithm in Figure 2). We selected optimal cutoff scores of 0.50 for all 3 algorithms. These scores were applied to subsequent analyses.

Figure 2.

Random forest cutoff scores for full algorithm. PPV: positive predictive value.

The unweighted sensitivities of the 5 identification methods ranged from 0.45 to 0.92 (Figure 3). The unweighted PPV ranged from 0.53 to 0.84, and the unweighted F1 scores ranged from 0.59 to 0.74. With IPW, the weights assigned to each risk quartile were proportional to the number of patients from the general veteran population in each risk quartile (first quartile 1.0, second quartile 1.0, third quartile 3.7, fourth quartile 794.2; Supplementary Table 1, available with the online version of this article). After applying IPW, the sensitivities of the 5 identification methods ranged from 0.34 to 0.90 (Figure 4). The Early Algorithm had statistically higher sensitivity (nonoverlapping 95% CI) compared to the High Feasibility Algorithm, AS codes (any specialty), and AS Codes (rheumatology). The PPV ranged from 0.38–0.84. The AS codes (rheumatology) had statistically higher PPV than the other 4 identification methods. F1 scores ranged from 0.48 to 0.65. The Full Algorithm had the numerically highest F1 score, but the F1 scores for all methods were statistically similar.

Figure 3.

Performance of identification methods in the axSpA-enriched population, as measured by unweighted sensitivity, PPV, and F1 scores. AS: ankylosing spondylitis; axSpA: axial spondyloarthritis; PPV: positive predictive value.

Figure 4.

Performance estimates of identification methods for the general veteran population, as measured with inverse probability weighted sensitivity, PPV, and F1 scores. AS: ankylosing spondylitis; PPV: positive predictive value.

DISCUSSION

We developed novel methods for identifying axSpA patients in national VHA datasets. Previous validation work with axSpA identification algorithms demonstrated excellent performance in axSpA-enriched populations^18,19. This study is important for understanding how the algorithms are estimated to perform in the general veteran population with future studies. The differences in baseline population characteristics (axSpA-enriched vs general VHA) and outcomes (unweighted vs IPW) demonstrate the need to account for verification bias that can occur when the reference population differs from the target population. We addressed verification bias with IPW and found lower algorithm performance estimates with IPW outcomes than unweighted outcomes. While the unweighted outcomes reflect algorithm performance in the axSpA-enriched population (reference population), the IPW outcomes reflect algorithm performance in the general VA population (target population).

The 5 axSpA identification methods evaluated with IPW for use in the general VA population demonstrate various performance attributes. While there is no consensus on optimal methods for axSpA cohort identification, the use of ≥ 2 AS diagnosis codes from any specialty is commonly used^14,15,31,32. This method has the advantages of being simple and inclusive of patients who may not be attending rheumatology clinics. However, our data suggest that the performance of this method is suboptimal for the general VA population; the 55% sensitivity demonstrated that approximately one-half of axSpA veterans remained unidentified by this method, and the 54% PPV demonstrated that approximately one-half of veterans identified by this method did not have axSpA. A higher PPV (66%) was reported in 102 chart-reviewed patients in a Kaiser Permanente study, but sensitivity could not be determined, since patients without AS diagnosis codes were not reviewed¹⁵.

The highest PPV (84%) occurred with the other traditional method evaluated in this study (≥ 2 AS diagnosis codes from a rheumatology clinic). This PPV was in the range of other studies reporting a PPV of 81%²⁹ and 100%³³. The sensitivity estimate, available in the latter study, was higher than in our study (82% vs 34%, respectively). This difference was likely due to the inclusion of all axSpA phenotypes in our study vs only patients classified as AS in the previous study. The method using ≥ 2 AS codes from rheumatology may be appropriate for studies for which a high confidence in axSpA diagnosis is prioritized over inclusivity of diverse axSpA patients. An example of such a study may involve comparing outcomes in axSpA vs mechanical back pain patients.

In contrast to the method requiring AS diagnosis codes from rheumatology, the Early Algorithm had a high sensitivity (90%) and lower PPV (38%). This was expected since the Early Algorithm was designed to enhance identification of patients with early disease and less classic phenotypes. This identification method is unique in that neither SpA diagnosis codes nor rheumatology encounters are considered in the determination of axSpA risk. The Early Algorithm may be particularly useful for screening studies designed to identify undiagnosed or untreated axSpA patients.

The Full Algorithm had the numerically highest overall performance (F1 scores: 0.65 vs 0.48–0.54). This was expected since the Full Algorithm included the largest number of variables. The Full Algorithm may be considered the best method for studies in which sensitivity and PPV are equally prioritized.

The High Feasibility Algorithm was designed to be less resource-intensive than the Full Algorithm and more accurate than traditional axSpA identification methods. Unfortunately, the High Feasibility Algorithm did not perform better than traditional methods. Since the High Feasibility Algorithm is more resource intensive than the traditional methods, it is unlikely to be used for axSpA identification in large datasets.

Strengths of this study include the development of a method for identifying axSpA patients who are at high risk for being missed with alternative identification methods. In particular, the Early Algorithm does not consider SpA diagnosis codes or rheumatology visits when calculating axSpA risk for classification assignments. Thus, this method may enable previously impractical research in patients with early disease or less classic phenotypes. Other strengths include extensively studied methodology for the algorithm development, including deep evaluations of NLP model performance, comparisons of different methods for algorithm development (e.g., random forest vs lasso vs K-nearest neighbor, vs a combination of these methods called SuperLearner), and well-characterized chart review processes completed by rheumatologists specializing in axSpA^18,21,26. The methods developed with this research will enable further refinement of the algorithms (i.e., additional NLP variables) and identification of patients at high risk of undiagnosed or unaddressed axSpA who may be recommended for a rheumatologic evaluation.

In summary, there are important unmet needs for better axSpA identification methods for real-world research across the broad spectrum of axSpA patients. There are growing opportunities to advance observational research with large data resources and bioinformatics advancements. This study demonstrates that these resources can be successfully used to improve research methods in axSpA. Further, the novel identification algorithms evaluated in this study may expand the scope of observational research in diverse axSpA populations.

Footnotes

The authors declare no conflicts of interest.
Full Release Article. For details see Reprints and Permissions at jrheum.org.
This study was supported by the Marriott Daughters Foundation and Rheumatology Research Foundation.

Accepted for publication August 13, 2020.

REFERENCES

1.↵
1. Lee CH,
2. Yoon HY.
Medical big data: promise and challenges. Kidney Res Clin Pract 2017;36:3-11.
OpenUrl CrossRef PubMed
2.↵
1. Garg N,
2. van den Bosch F,
3. Deodhar A.
The concept of spondyloarthritis: where are we now? Best Pract Res Clin Rheumatol 2014;28:663-72.
OpenUrl CrossRef PubMed
3.↵
1. Lubrano E,
2. De Socio A,
3. Perrotta FM.
Unmet needs in axial spondyloarthritis. Clin Rev Allergy Immunol 2018;55:332-9.
OpenUrl
4.↵
1. Walsh JA,
2. Adejoro O,
3. Chastek B,
4. Park Y.
Treatment patterns of biologics in US patients with ankylosing spondylitis: descriptive analyses from a claims database. J Comp Eff Res 2018;7:369-80.
OpenUrl
5.↵
1. Deodhar A,
2. Mittal M,
3. Reilly P,
4. Bao Y,
5. Manthena S,
6. Anderson J, et al.
Ankylosing spondylitis diagnosis in US patients with back pain: identifying providers involved and factors associated with rheumatology referral delay. Clin Rheumatol 2016;35:1769-76.
OpenUrl CrossRef PubMed
6.↵
1. Lu MC,
2. Koo M,
3. Lai NS.
Incident spine surgery in patients with ankylosing spondylitis: a secondary cohort analysis of a nationwide, population-based health claims database. Arthritis Care Res 2018;70:1416-20.
OpenUrl
7.↵
1. Wysham KD,
2. Murray SG,
3. Hills N,
4. Yelin E,
5. Gensler LS.
Cervical Spinal Fracture and Other Diagnoses Associated With Mortality in Hospitalized Ankylosing Spondylitis Patients. Arthritis Care Res 2017;69:271-7.
OpenUrl
8.↵
1. Wang R,
2. Ward MM.
Epidemiology of axial spondyloarthritis: an update. Curr Opin Rheumatol 2018;30:137-43.
OpenUrl CrossRef PubMed
9.↵
1. Baraliakos X,
2. Braun J.
Non-radiographic axial spondyloarthritis and ankylosing spondylitis: what are the similarities and differences? RMD Open;1 Suppl 1:e000053.
10.↵
1. Reveille JD,
2. Weisman MH.
The epidemiology of back pain, axial spondyloarthritis and HLA-B27 in the United States. Am J Med Sci 2013;345:431-6.
OpenUrl CrossRef PubMed
11.↵
1. Walsh JA,
2. Zhou X,
3. Clegg DO,
4. Teng C,
5. Cannon GW,
6. Sauer B.
Mortality in American Veterans with the HLA-B27 gene. J Rheumatol 2015;42:638-44.
OpenUrl Abstract/FREE Full Text
12.↵
1. Walsh J,
2. Hunter T,
3. Schroeder K,
4. Sandoval D,
5. Bolce R.
Trends in diagnostic prevalence and treatment patterns of male and female ankylosing spondylitis patients in the United States, 2006-2016. BMC Rheumatol 2019;3:39.
OpenUrl
13.↵
1. Walsh JA,
2. Song X,
3. Kim G,
4. Park Y.
Evaluation of the comorbidity burden in patients with ankylosing spondylitis treated with tumour necrosis factor inhibitors using a large administrative data set. J Pharm Health Serv Res 2018;9:115-21.
OpenUrl
14.↵
1. Sloan VS,
2. Sheahan A,
3. Stark JL,
4. Suruki RY.
Opioid use in patients with ankylosing spondylitis is common in the United States: outcomes of a retrospective cohort study. J Rheumatol 2019;46:1450-7.
OpenUrl Abstract/FREE Full Text
15.↵
1. Curtis JR,
2. Harrold LR,
3. Asgari MM,
4. Deodhar A,
5. Salman C,
6. Gelfand JM, et al.
Diagnostic prevalence of ankylosing spondylitis using computerized health care data, 1996 to 2009: underrecognition in a US health care setting. Perm J 2016;20:15-151.
OpenUrl CrossRef PubMed
16.↵
1. Zhao SS,
2. Ermann J,
3. Xu C,
4. Lyu H,
5. Tedeschi SK,
6. Liao KP, et al.
Comparison of comorbidities and treatment between ankylosing spondylitis and non-radiographic axial spondyloarthritis in the United States. Rheumatology 2019;58:2025-30.
OpenUrl
17.↵
1. Walsh JA,
2. Rozycki M,
3. Yi E,
4. Park Y.
Application of machine learning in the diagnosis of axial spondyloarthritis. Curr Opin Rheumatol 2019;31:362-36.
OpenUrl PubMed
18.↵
1. Walsh JA,
2. Pei S,
3. Penmetsa G,
4. Hansen JL,
5. Cannon GW,
6. Clegg DO, et al.
Identification of axial spondyloarthritis patients in a large dataset: the development and validation of novel methods. J Rheumatol 2020;47:42-9.
OpenUrl Abstract/FREE Full Text
19.↵
1. Zhao SS,
2. Hong C,
3. Cai T,
4. Xu C,
5. Huang J,
6. Ermann J, et al.
Incorporating natural language processing to improve classification of axial spondyloarthritis using electronic health records. Rheumatology 2020;59:1059-65.
OpenUrl
20.↵
1. O’Sullivan JW,
2. Banerjee A,
3. Heneghan C,
4. Pluddemann A.
Verification bias. BMJ Evid Based Med 2018;23:54-5.
OpenUrl Abstract/FREE Full Text
21.↵
1. Walsh JA,
2. Pei S,
3. Penmetsa GK,
4. Leng J,
5. Cannon GW,
6. Clegg DO, et al.
Cohort identification of axial spondyloarthritis in a large healthcare dataset: current and future methods. BMC Musculoskelet Disord 2018;19:317.
OpenUrl
22.↵
1. Walsh JA,
2. Shao Y,
3. Leng J,
4. He T,
5. Teng CC,
6. Redd D, et al.
Identifying axial spondyloarthritis in electronic medical records of US veterans. Arthritis Care Res 2017;69:1414-20.
OpenUrl
23.↵
1. Breiman, L.
(2001) Random forests. Machine Learning 2001; 45:5-32.
OpenUrl CrossRef
24.↵
1. Touw WG,
2. Bayjanov JR,
3. Overmars L,
4. Backus L,
5. Boekhorst J,
6. Wels M, et al.
Data mining in the life sciences with Random Forest: a walk in the park or lost in the jungle? Brief Bioinform 2013; 14:315-26.
OpenUrl CrossRef PubMed
25.↵
1. Lin WJ,
2. Chen JJ.
Class-imbalanced classifiers for high-dimensional data. Brief Bioinform 2013;14:13-26.
OpenUrl CrossRef PubMed
26.↵
1. Rodríguez JD,
2. Pérez A,
3. Lozano JA.
Sensitivity analysis of kappa-fold cross validation in prediction error estimation. IEEE Trans Pattern Anal Mach Intell 2010;32:569-75.
OpenUrl PubMed
27.↵
1. Leng CJ,
2. South B,
3. Shuying S.
eHOST: The Extensible Human Oracle Suite of Tools. [Internet. Accessed December 15, 2020]. Available from: orbit.nlm.nih.gov/browse-repository/software/nlp-information-extraction/62-ehost-the-extensible-human-oracle-suite-of-tools
28.↵
1. de Groot JA,
2. Janssen KJM,
3. Zwinderman AH,
4. Bossuyt PMM,
5. Reitsma JB,
6. Karel Moons KGM.
Correcting for partial verification bias: a comparison of methods. Ann Epidemiol 2011;21:139-48.
OpenUrl CrossRef PubMed
29.↵
1. Peters CB,
2. Hansen JL,
3. Halwani A,
4. Cho ME,
5. Leng J,
6. Huynh T, et al.
Validation of algorithms used to identify red blood cell transfusion related admissions in veteran patients with end stage renal disease. EGEMS 2019;7:23.
OpenUrl
30.↵
1. Sasaki Y.
The truth of the F-measure. Teach Tutor mater 2007;1:1-5.
OpenUrl
31.↵
1. Walsh JA,
2. Pei S,
3. Penmetsa GK,
4. Sauer BC,
5. Patil V,
6. Walker JH, et al.
Treatment patterns with disease-modifying antirheumatic drugs in U.S. veterans with newly diagnosed rheumatoid arthritis, psoriatic arthritis, or ankylosing spondylitis. J Manag Care Spec Pharm 2019;25:1218-28.
OpenUrl
32.↵
1. Wu LC,
2. Leong PY,
3. Yeo KJ,
4. Li TY,
5. Wang YH,
6. Chiou JY, et al.
Celecoxib and sulfasalazine had negative association with coronary artery diseases in patients with ankylosing spondylitis: a nation-wide, population-based case-control study. Medicine 2016;95:e4792.
OpenUrl
33.↵
1. Singh JA,
2. Holmgren AR,
3. Krug H,
4. Noorbaloochi S.
Accuracy of the diagnoses of spondylarthritides in veterans affairs medical center databases. Arthritis Rheum 2007; 15;57:648-55.
OpenUrl
34.
1. Quan H,
2. Sundararajan V,
3. Halfon P,
4. Fong A,
5. Burnand B,
6. Luthi JC, et al.
Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med Care 2005;43:1130-9.
OpenUrl CrossRef PubMed

ONLINE SUPPLEMENT

Supplementary material accompanies the online version of this article.

In this issue

Download PDF

Bookmark this article

Keywords

ANKYLOSING SPONDYLITIS

COHORT STUDY

EPIDEMIOLOGY

SPONDYLOARTHRITIS

Cited By...

More in this TOC Section

Show more Spondyloarthritis

Keywords

[1] 1.↵
Lee CH,
Yoon HY.
Medical big data: promise and challenges. Kidney Res Clin Pract 2017;36:3-11.
OpenUrl CrossRef PubMed

[2] Lee CH,

[3] Yoon HY.

[4] 2.↵
Garg N,
van den Bosch F,
Deodhar A.
The concept of spondyloarthritis: where are we now? Best Pract Res Clin Rheumatol 2014;28:663-72.
OpenUrl CrossRef PubMed

[5] Garg N,

[6] van den Bosch F,

[7] Deodhar A.

[8] 3.↵
Lubrano E,
De Socio A,
Perrotta FM.
Unmet needs in axial spondyloarthritis. Clin Rev Allergy Immunol 2018;55:332-9.
OpenUrl

[9] Lubrano E,

[10] De Socio A,

[11] Perrotta FM.

[12] 4.↵
Walsh JA,
Adejoro O,
Chastek B,
Park Y.
Treatment patterns of biologics in US patients with ankylosing spondylitis: descriptive analyses from a claims database. J Comp Eff Res 2018;7:369-80.
OpenUrl

[13] Walsh JA,

[14] Adejoro O,

[15] Chastek B,

[16] Park Y.

[17] 5.↵
Deodhar A,
Mittal M,
Reilly P,
Bao Y,
Manthena S,
Anderson J, et al.
Ankylosing spondylitis diagnosis in US patients with back pain: identifying providers involved and factors associated with rheumatology referral delay. Clin Rheumatol 2016;35:1769-76.
OpenUrl CrossRef PubMed

[18] Deodhar A,

[19] Mittal M,

[20] Reilly P,

[21] Bao Y,

[22] Manthena S,

[23] Anderson J, et al.

[24] 6.↵
Lu MC,
Koo M,
Lai NS.
Incident spine surgery in patients with ankylosing spondylitis: a secondary cohort analysis of a nationwide, population-based health claims database. Arthritis Care Res 2018;70:1416-20.
OpenUrl

[25] Lu MC,

[26] Koo M,

[27] Lai NS.

[28] 7.↵
Wysham KD,
Murray SG,
Hills N,
Yelin E,
Gensler LS.
Cervical Spinal Fracture and Other Diagnoses Associated With Mortality in Hospitalized Ankylosing Spondylitis Patients. Arthritis Care Res 2017;69:271-7.
OpenUrl

[29] Wysham KD,

[30] Murray SG,

[31] Hills N,

[32] Yelin E,

[33] Gensler LS.

[34] 8.↵
Wang R,
Ward MM.
Epidemiology of axial spondyloarthritis: an update. Curr Opin Rheumatol 2018;30:137-43.
OpenUrl CrossRef PubMed

[35] Wang R,

[36] Ward MM.

[37] 9.↵
Baraliakos X,
Braun J.
Non-radiographic axial spondyloarthritis and ankylosing spondylitis: what are the similarities and differences? RMD Open;1 Suppl 1:e000053.

[38] Baraliakos X,

[39] Braun J.

[40] 10.↵
Reveille JD,
Weisman MH.
The epidemiology of back pain, axial spondyloarthritis and HLA-B27 in the United States. Am J Med Sci 2013;345:431-6.
OpenUrl CrossRef PubMed

[41] Reveille JD,

[42] Weisman MH.

[43] 11.↵
Walsh JA,
Zhou X,
Clegg DO,
Teng C,
Cannon GW,
Sauer B.
Mortality in American Veterans with the HLA-B27 gene. J Rheumatol 2015;42:638-44.
OpenUrl Abstract/FREE Full Text

[44] Walsh JA,

[45] Zhou X,

[46] Clegg DO,

[47] Teng C,

[48] Cannon GW,

[49] Sauer B.

[50] 12.↵
Walsh J,
Hunter T,
Schroeder K,
Sandoval D,
Bolce R.
Trends in diagnostic prevalence and treatment patterns of male and female ankylosing spondylitis patients in the United States, 2006-2016. BMC Rheumatol 2019;3:39.
OpenUrl

[51] Walsh J,

[52] Hunter T,

[53] Schroeder K,

[54] Sandoval D,

[55] Bolce R.

[56] 13.↵
Walsh JA,
Song X,
Kim G,
Park Y.
Evaluation of the comorbidity burden in patients with ankylosing spondylitis treated with tumour necrosis factor inhibitors using a large administrative data set. J Pharm Health Serv Res 2018;9:115-21.
OpenUrl

[57] Walsh JA,

[58] Song X,

[59] Kim G,

[60] Park Y.

[61] 14.↵
Sloan VS,
Sheahan A,
Stark JL,
Suruki RY.
Opioid use in patients with ankylosing spondylitis is common in the United States: outcomes of a retrospective cohort study. J Rheumatol 2019;46:1450-7.
OpenUrl Abstract/FREE Full Text

[62] Sloan VS,

[63] Sheahan A,

[64] Stark JL,

[65] Suruki RY.

[66] 15.↵
Curtis JR,
Harrold LR,
Asgari MM,
Deodhar A,
Salman C,
Gelfand JM, et al.
Diagnostic prevalence of ankylosing spondylitis using computerized health care data, 1996 to 2009: underrecognition in a US health care setting. Perm J 2016;20:15-151.
OpenUrl CrossRef PubMed

[67] Curtis JR,

[68] Harrold LR,

[69] Asgari MM,

[70] Deodhar A,

[71] Salman C,

[72] Gelfand JM, et al.

[73] 16.↵
Zhao SS,
Ermann J,
Xu C,
Lyu H,
Tedeschi SK,
Liao KP, et al.
Comparison of comorbidities and treatment between ankylosing spondylitis and non-radiographic axial spondyloarthritis in the United States. Rheumatology 2019;58:2025-30.
OpenUrl

[74] Zhao SS,

[75] Ermann J,

[76] Xu C,

[77] Lyu H,

[78] Tedeschi SK,

[79] Liao KP, et al.

[80] 17.↵
Walsh JA,
Rozycki M,
Yi E,
Park Y.
Application of machine learning in the diagnosis of axial spondyloarthritis. Curr Opin Rheumatol 2019;31:362-36.
OpenUrl PubMed

[81] Walsh JA,

[82] Rozycki M,

[83] Yi E,

[84] Park Y.

[85] 18.↵
Walsh JA,
Pei S,
Penmetsa G,
Hansen JL,
Cannon GW,
Clegg DO, et al.
Identification of axial spondyloarthritis patients in a large dataset: the development and validation of novel methods. J Rheumatol 2020;47:42-9.
OpenUrl Abstract/FREE Full Text

[86] Walsh JA,

[87] Pei S,

[88] Penmetsa G,

[89] Hansen JL,

[90] Cannon GW,

[91] Clegg DO, et al.

[92] 19.↵
Zhao SS,
Hong C,
Cai T,
Xu C,
Huang J,
Ermann J, et al.
Incorporating natural language processing to improve classification of axial spondyloarthritis using electronic health records. Rheumatology 2020;59:1059-65.
OpenUrl

[93] Zhao SS,

[94] Hong C,

[95] Cai T,

[96] Xu C,

[97] Huang J,

[98] Ermann J, et al.

[99] 20.↵
O’Sullivan JW,
Banerjee A,
Heneghan C,
Pluddemann A.
Verification bias. BMJ Evid Based Med 2018;23:54-5.
OpenUrl Abstract/FREE Full Text

[100] O’Sullivan JW,

[101] Banerjee A,

[102] Heneghan C,

[103] Pluddemann A.

[104] 21.↵
Walsh JA,
Pei S,
Penmetsa GK,
Leng J,
Cannon GW,
Clegg DO, et al.
Cohort identification of axial spondyloarthritis in a large healthcare dataset: current and future methods. BMC Musculoskelet Disord 2018;19:317.
OpenUrl

[105] Walsh JA,

[106] Pei S,

[107] Penmetsa GK,

[108] Leng J,

[109] Cannon GW,

[110] Clegg DO, et al.

[111] 22.↵
Walsh JA,
Shao Y,
Leng J,
He T,
Teng CC,
Redd D, et al.
Identifying axial spondyloarthritis in electronic medical records of US veterans. Arthritis Care Res 2017;69:1414-20.
OpenUrl

[112] Walsh JA,

[113] Shao Y,

[114] Leng J,

[115] He T,

[116] Teng CC,

[117] Redd D, et al.

[118] 23.↵
Breiman, L.
(2001) Random forests. Machine Learning 2001; 45:5-32.
OpenUrl CrossRef

[119] Breiman, L.

[120] 24.↵
Touw WG,
Bayjanov JR,
Overmars L,
Backus L,
Boekhorst J,
Wels M, et al.
Data mining in the life sciences with Random Forest: a walk in the park or lost in the jungle? Brief Bioinform 2013; 14:315-26.
OpenUrl CrossRef PubMed

[121] Touw WG,

[122] Bayjanov JR,

[123] Overmars L,

[124] Backus L,

[125] Boekhorst J,

[126] Wels M, et al.

[127] 25.↵
Lin WJ,
Chen JJ.
Class-imbalanced classifiers for high-dimensional data. Brief Bioinform 2013;14:13-26.
OpenUrl CrossRef PubMed

[128] Lin WJ,

[129] Chen JJ.

[130] 26.↵
Rodríguez JD,
Pérez A,
Lozano JA.
Sensitivity analysis of kappa-fold cross validation in prediction error estimation. IEEE Trans Pattern Anal Mach Intell 2010;32:569-75.
OpenUrl PubMed

[131] Rodríguez JD,

[132] Pérez A,

[133] Lozano JA.

[134] 27.↵
Leng CJ,
South B,
Shuying S.
eHOST: The Extensible Human Oracle Suite of Tools. [Internet. Accessed December 15, 2020]. Available from: orbit.nlm.nih.gov/browse-repository/software/nlp-information-extraction/62-ehost-the-extensible-human-oracle-suite-of-tools

[135] Leng CJ,

[136] South B,

[137] Shuying S.

[138] 28.↵
de Groot JA,
Janssen KJM,
Zwinderman AH,
Bossuyt PMM,
Reitsma JB,
Karel Moons KGM.
Correcting for partial verification bias: a comparison of methods. Ann Epidemiol 2011;21:139-48.
OpenUrl CrossRef PubMed

[139] de Groot JA,

[140] Janssen KJM,

[141] Zwinderman AH,

[142] Bossuyt PMM,

[143] Reitsma JB,

[144] Karel Moons KGM.

[145] 29.↵
Peters CB,
Hansen JL,
Halwani A,
Cho ME,
Leng J,
Huynh T, et al.
Validation of algorithms used to identify red blood cell transfusion related admissions in veteran patients with end stage renal disease. EGEMS 2019;7:23.
OpenUrl

[146] Peters CB,

[147] Hansen JL,

[148] Halwani A,

[149] Cho ME,

[150] Leng J,

[151] Huynh T, et al.

[152] 30.↵
Sasaki Y.
The truth of the F-measure. Teach Tutor mater 2007;1:1-5.
OpenUrl

[153] Sasaki Y.

[154] 31.↵
Walsh JA,
Pei S,
Penmetsa GK,
Sauer BC,
Patil V,
Walker JH, et al.
Treatment patterns with disease-modifying antirheumatic drugs in U.S. veterans with newly diagnosed rheumatoid arthritis, psoriatic arthritis, or ankylosing spondylitis. J Manag Care Spec Pharm 2019;25:1218-28.
OpenUrl

[155] Walsh JA,

[156] Pei S,

[157] Penmetsa GK,

[158] Sauer BC,

[159] Patil V,

[160] Walker JH, et al.

[161] 32.↵
Wu LC,
Leong PY,
Yeo KJ,
Li TY,
Wang YH,
Chiou JY, et al.
Celecoxib and sulfasalazine had negative association with coronary artery diseases in patients with ankylosing spondylitis: a nation-wide, population-based case-control study. Medicine 2016;95:e4792.
OpenUrl

[162] Wu LC,

[163] Leong PY,

[164] Yeo KJ,

[165] Li TY,

[166] Wang YH,

[167] Chiou JY, et al.

[168] 33.↵
Singh JA,
Holmgren AR,
Krug H,
Noorbaloochi S.
Accuracy of the diagnoses of spondylarthritides in veterans affairs medical center databases. Arthritis Rheum 2007; 15;57:648-55.
OpenUrl

[169] Singh JA,

[170] Holmgren AR,

[171] Krug H,

[172] Noorbaloochi S.

[173] 34.
Quan H,
Sundararajan V,
Halfon P,
Fong A,
Burnand B,
Luthi JC, et al.
Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med Care 2005;43:1130-9.
OpenUrl CrossRef PubMed

[174] Quan H,

[175] Sundararajan V,

[176] Halfon P,

[177] Fong A,

[178] Burnand B,

[179] Luthi JC, et al.

Main menu

User menu

Search

Identifying Patients With Axial Spondyloarthritis in Large Datasets: Expanding Possibilities for Observational Research

Abstract

MATERIALS AND METHODS

RESULTS

DISCUSSION

Footnotes

REFERENCES

ONLINE SUPPLEMENT

In this issue

Citation Manager Formats

Keywords

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Keywords

Content

Resources

Subscribers

More

Main menu

User menu

Search

Identifying Patients With Axial Spondyloarthritis in Large Datasets: Expanding Possibilities for Observational Research

Abstract

MATERIALS AND METHODS

RESULTS

DISCUSSION

Footnotes

REFERENCES

ONLINE SUPPLEMENT

In this issue

Citation Manager Formats

Jump to section

Keywords

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Keywords

Content

Resources

Subscribers

More