Identifying Rheumatoid Arthritis Cases within the Quebec Health Administrative Database

Zeinab F. Slim; Cristiano Soares de Moura; Sasha Bernatsky; Elham Rahme

doi:10.3899/jrheum.181121

Abstract

Objective. Our objective was to calculate rheumatoid arthritis (RA) point prevalence estimates in the CARTaGENE cohort, as well as to estimate the sensitivity and specificity of our ascertainment approach, using physician billing data. We investigated the effects of using varying observation windows in the Régie de l’assurance maladie du Québec (RAMQ) health services administrative databases, alone or in combination with self-reported diagnoses and drugs.

Methods. We studied subjects enrolled in the CARTaGENE cohort, which recruited 19,995 participants from 4 metropolitan regions in Québec from August 2009 to October 2010. A series of Bayesian latent class models were developed to assess the effects of 3 factors: the number of years of billing data, the addition of self-reported information on RA diagnoses and drugs, and the adjustment for misclassification error.

Results. The 3-year 2010 point prevalence estimate among cohort members aged 40–69 years, using physician billing plus self-report, adjusting for misclassification error in each source, was 0.9% [95% credible interval (CrI) 0.7–1.2] with RAMQ sensitivity of 84.0% (95% CrI 74.0–93.7) and a specificity of 99.8% (95% CrI 99.6–100.0). Our results show variations in the prevalence point estimates related to all 3 factors investigated.

Conclusion. Our study illustrates that multiple data sources identify more RA cases and thus a higher prevalence estimate. RA point prevalence estimates using billing data are lower if fewer years of data are used.

Key Indexing Terms:

Rheumatoid arthritis (RA) is a type of chronic autoimmune disease, and like most chronic diseases, it is caused by a constellation of potential factors, including environmental and genetic risk factors¹. Surveillance data can provide insights into the epidemiology of RA. Additionally, prevalence data derived from surveillance can assist in making future projections and studying geographic variations². Having unbiased prevalence estimates is essential to improving care and outcomes. In Canada, the provincial government health insurance is nearly universal and administrative databases such as those collected by the Régie de l’assurance maladie du Québec (RAMQ) have been an attractive resource for prevalence studies on RA³. Methods for estimating RA prevalence in these databases rely on physician billing and/or hospitalization International Classification of Diseases (ICD) codes⁴. Prevalence estimates of RA obtained from administrative health databases have varied depending on several factors such as case definitions⁵ and the size of the observation window available for analysis in the health administrative database^6,7. Any ascertainment method within health administrative databases may miss some true cases and misclassify others.

An additional source of data for RA surveillance is self-reported data collected from large survey databases^8,9,10,11. Ascertainment of RA based on the patient’s self-reported data should be done with caution because misclassification is a concern. Supplementing this ascertainment method with medication information such as disease-modifying antirheumatic drugs (DMARD) improved the accuracy of self-reported RA in some studies¹². DMARD are the cornerstone of RA treatment and according to national and international guidelines, all RA patients with active disease should be offered DMARD therapies. Of course, a small number of patients with RA cannot take these drugs (if their RA is in remission — a relatively rare event — or for other reasons), so there could be false-positive and false-negative RA cases using this method as well. What makes the situation more challenging in large population-based surveillance studies is the absence of a gold standard to validate self-report or health administrative data sources.

Few RA prevalence estimates are available in Quebec or even in Canada. One prior study estimated RA prevalence for Quebec, using only physician billing and hospitalization diagnostic codes for the period 1992–2008; this accounted for misclassification error in administrative data³. However, additional studies may be helpful to elucidate the effects of the observation window within health administrative databases, the use of self-reported information, and the adjustment for misclassification error in all ascertainment methods on RA prevalence estimates. This study’s specific objective was to calculate, within 11 different observation windows in physician billing data, 2010 RA point prevalence estimates (unadjusted and adjusted for misclassification error) among the CARTaGENE cohort of adults aged 40–69 years, as well as to estimate the sensitivity and specificity of our ascertainment approach, using administrative data (alone or combined with self-reported data)¹³.

MATERIALS AND METHODS

Study setting, sources of data, ascertainment of RA cases, and time frame

This study took place in the context of a large established cohort entitled CARTaGENE, which recruited 19,995 participants (aged 40–69 yrs) from August 2009 to October 2010 from 4 metropolitan regions in Québec (Montreal, Sherbrooke, Québec City, and Saguenay, constituting 55.7% of the Quebec population). Participants were randomly selected from the provincial health insurance FIPA files (fichier administratif des inscriptions des personnes assurées), which include the entire population because health insurance coverage in Quebec is universal. Individuals were excluded if they were not registered in the FIPA files (such as the military), resided outside the selected regions in 2009, lived in First Nations reserves or longterm healthcare facilities, or were in prison. Participants were invited to an interview and completed a self-administered sociodemographic and lifestyle questionnaire as well as an interviewer-administered health questionnaire. Participation rate was 25.6% and there were regional variations in the participation rates, with the Saguenay region having the highest participation rate (33.9%) and the Montreal northern suburbs having the lowest (21.8% for Laval and 21.2% for the North Shore). Data on demographic and socioeconomic factors, lifestyle habits, mental health, individual and family history of disease, medical care history such as visits to a doctor or a nurse, and current medications were collected⁸. Further details on the CARTaGENE cohort can be found elsewhere⁸. The CARTaGENE research cohort has been linked to RAMQ data using patients’ unique provincial health insurance numbers. The RAMQ medical service database has information on physician outpatient visits, including diagnoses coded according to the International Classification of Diseases, 9th revision (ICD-9) during the time interval of data collection.

Our study included all CARTaGENE participants who were interviewed between 2009 and 2010. Individuals with incomplete or missing information concerning RA diagnosis and current DMARD use were excluded. Therefore, our reported estimates may be considered as estimates of 2010 point prevalence for RA, in which the point represents the end of 2010 and the denominators are those individuals enrolled in CARTaGENE by the end of the data collection phase. Survey-based RA cases were defined using the self-reported information on RA diagnosis as well as current use of either conventional DMARD (hydroxychloroquine, sulfasalazine, methotrexate, leflunomide, azathioprine, cyclosporine, gold, and cyclophosphamide) and/or the biologic DMARD (infliximab, adalimumab, etanercept, abatacept, and rituximab). RAMQ-based RA cases were defined using physicians’ claims data according to an algorithm requiring 2 or more RA diagnoses by any physician at least 2 months apart but within a 2-year span, or at least 1 RA diagnosis by a rheumatologist.

RAMQ data were available for our study subjects from January 1, 1998, to December 31, 2010. Eleven successive nested observation windows that ranged from a minimum of 3 years (2008–2010) to a maximum of 13 years (1998–2010) were constructed by adding successively one earlier year to the years under observation (2008–2010; 2007–2010 … 1998–2010). Therefore, all time windows ended in December 2010 and were used to calculate the point prevalence of 2010.

Statistical methods

In our analyses, we considered both the self-reported and physician claims ascertainment methods to be imperfect. In such case (i.e., in the absence of a gold standard), the true RA status can be thought of as “missing.” By knowing the values of the sensitivity and specificity of the imperfect ascertainment method, a latent class analysis can be used to adjust the prevalence for misclassification errors. We used a Bayesian latent class analysis to summarize the existing information about each variable (sensitivity, specificity, and prevalence) in the form of prior distributions. Then, the prior information was updated by the data through Bayes’ theorem to result in posterior distributions of these variables^{14,15,16,17,18}.

More specifically, the number of subjects who are categorized as having RA according to each imperfect ascertainment method is a mix of true-positive and false-positive individuals. The Bayesian latent class model links the observed results of each method to the unobserved truth of RA status using the following formula: (total sample size)*[(prevalence of RA*sensitivity of the ascertainment method) + (1 − prevalence)(1 − specificity of the ascertainment method)]¹⁸.

Informative prior distributions were used over the sensitivity and specificity of RAMQ based on the subjective opinions of 8 experts in the field as well as on a published validation study of provincial administrative data, which used primary care records as reference standard¹⁹. We varied the prior distributions of the sensitivity and specificity of the physician claim ascertainment method ranging from 60% to 90% and 82% to 99%, respectively. Informative prior distributions over the prevalence ranging from 0% to 8% were chosen based on the literature. For the sensitivity and specificity of self-reported data, “uninformative” prior distributions [e.g., β (1,1)] were used. For all variables, a β prior distribution was used¹⁸.

In a Bayesian latent class model, the likelihood function relating the observed and latent data to the unknown variables for one ascertainment method (i.e., RAMQ) is as follows:

L (a,b,X,Y/π, Se, Sp) = [πSe]^X [π(1 − Se)]^Y [(1 − π)(1 − Sp)]^a–X [(1 − π)(Sp)]^b–Y, where “a” and “b” are the observed number of individuals with positive and negative results on the ascertainment method (here RA diagnoses in RAMQ), respectively; X and Y are the latent truly positive; π is the prevalence of RA; and Se and Sp are the sensitivity and specificity of the ascertainment method, respectively. In the case where RAMQ was combined with self-reported sources, the likelihood contributions of all possible combinations of observed and latent data are provided in Table 1. The likelihood is proportional to the product of each entry in the last column raised to the power of the corresponding entry in the first column of the table.

View this table:

Table 1.

Likelihood contribution of observed and latent data when combining Régie de l’assurance maladie du Québec (RAMQ) ascertainment method with self-reported RA diagnosis and DMARD use.

To address the potential issue that self-reported RA diagnosis and DMARD use may be dependent, even conditional, on the true disease status in the model combining the 3 methods, conditional correlation between the 2 CARTaGENE self-reported sources of information in RA subjects and in non-RA subjects were incorporated²⁰.

The unadjusted (naive) estimates of RA prevalence were estimated based on RAMQ billing codes for each time window in administrative data. These estimates were obtained by dividing the number of those diagnosed with RA by the total sample size. The unadjusted prevalence estimates were calculated using the Bayesian method for single proportions. Uninformative β prior distribution [e.g. β (1,1)], where all values are equally likely, was used over the unknown unadjusted prevalence variable. In this case, the posterior prevalence estimates (unadjusted for misclassification error) are expected to be numerically the same as those obtained using frequentist method¹⁸ (i.e., dividing the number of those diagnosed with RA using billing codes by the total sample size).

Posterior estimates for each variable were determined based on a sample from the posterior distribution using Gibbs sampling with the WinBUGS statistical freeware (version 1.4.3, MRC Biostatistics Unit). Each model was assessed after a burn-in of 5000 iterations and a further 30,000 iterations for use in inferences²¹. The mean and 2.5–97.5 percentile values (95% credible intervals; CrI) for each variable were extracted.

Approval for the study was obtained from McGill University Ethics Review Board (approval number: A04-M47-12B), CARTaGENE as well as Commission d’accès à l’information du Québec (approval number: 100 49 57). Additionally, participants signed a written informed consent to publish the material.

RESULTS

The baseline characteristics of the study cohort were evaluated, including age, sex, geographical region, education, and current working status. Just over half of the sample was female, and the overwhelming majority lived in Montreal. The full profile of the participants is presented in Table 2.

View this table:

Table 2.

Demographics of CARTaGENE participants who have complete self-reported information.

Using only self-reported RA diagnosis, without any adjustment for misclassification, the RA prevalence estimate was 2.9% (564 out of 19,704) with 95% CrI 2.6–3.1. The naive estimate from DMARD use was lower at 0.9% (182 out of 19,704) with 95% CrI 0.8–1.1. Adjusting for misclassification error decreased the point prevalence estimate to 1.3% (95% CrI 0.07–3.2) for self-RA diagnosis and 0.4% (95% CrI 0.02–1.1) for current DMARD use.

We found 197 RA cases using only 3 years of physician billing, unadjusted for misclassification error. When more years were used, the number of RA cases continued to increase, up to 321 when looking back 13 years.

The unadjusted 2010 RA prevalence point estimate based on 3 years of RAMQ data alone was 1.0% (197 RA cases out of 19,704) with 95% CrI 0.9–1.2. Using 5 years of data, the prevalence point estimate increased by 20%. When using 13 years of RAMQ data, there was a 60% increase in the unadjusted prevalence point estimate (1.6%, 95% CrI 1.5–1.8) compared to the estimate from using 3 years of data (Table 3).

View this table:

Table 3.

RA prevalence of the different combinations of ascertainment methods for the 11 observation periods, CARTaGENE cohort, Quebec, 2009–2010.

Adjusting for misclassification error using the Bayesian latent class model, RA prevalence point estimate was 0.4% (95% CrI 0.03–1.1) for the shortest observation window. Additionally, the adjusted prevalence was lower than the unadjusted prevalence estimates for all observation windows. The adjusted estimates across all time windows showed an increasing trend but remained lower than the RAMQ-based unadjusted estimate. The CrI around the adjusted point estimate using RAMQ alone were much wider than the CrI around the unadjusted estimates, which is expected because adjustment accounts for misclassification.

As for the combined RAMQ and self-reported information, the different combinations of the observed data are presented in Supplementary Table 1 (available from the authors on request). For all observation windows, the adjusted point estimates derived from combining RAMQ with self-reported data were lower than the unadjusted estimates and higher than the adjusted estimates using RAMQ alone. When combining administrative and self-reported data, adding more years of administrative data increased the adjusted point estimates (Table 3) in a similar fashion to when administrative data were used alone. The CrI were all overlapping. Figure 1 shows the increasing trends in the point estimates (unadjusted and adjusted, with administrative data alone and then adding self-reported data).

Figure 1.

RA point prevalence estimates by the duration of observation period within Régie de l’assurance maladie du Québec (RAMQ). RA: rheumatoid arthritis.

The results for the sensitivity estimates of case ascertainment across varying time windows (with administrative data alone and combining with self-reported data) are shown in Table 4. The sensitivity of case ascertainment using RAMQ data alone was unchanged (78%) for all observation windows. However, complementing the RAMQ billing codes case ascertainment method with self-reported data sources on RA diagnosis and current DMARD use increased the point estimate for sensitivity from 78.1% (95% CrI 58.3–92.6) to 84.0% (95% CrI 74.0–93.7) for the shortest time window. Our estimates of the sensitivity of RAMQ data versus the self-reported data remained relatively steady over time. The specificity of RAMQ ascertainment method alone as well as combining it with self-reported data was high (99%) and stable throughout all time windows.

View this table:

Table 4.

Sensitivity and specificity of Régie de l’assurance maladie du Québec (RAMQ) ascertainment method alone and when combining it with self-reported data for the 11 observation periods, CARTaGENE cohort, Quebec (2009–2010).

DISCUSSION

In this study, a series of Bayesian latent class models were developed to assess the effects of 3 factors (i.e., the length of observation window within administrative data, the inclusion of self-reported information on RA, and adjustment for misclassification error in administrative data) on RA prevalence estimates in the CARTaGENE sample. Our results show variations in the prevalence point estimates related to all 3 factors. There was negligible change in the sensitivity estimates for case ascertainment using administrative data with more years of observation, but a noticeable gain in sensitivity when additional information from self-reported information on RA diagnosis and current DMARD use were added to the model. The 3-year 2010 point prevalence estimate among adults aged 40–69 years using the 3 ascertainment methods and adjusting for misclassification error in each method was 0.9% (95% CrI 0.7–1.2).

Previous studies of the effect of increasing years of administrative data on rheumatic diseases prevalence estimates found trends similar to ours (i.e., higher prevalence estimates with more years of data)^6,7,22,23,24. However, ours is the only one that adjusted for the imperfect data sources. As evident from our study, the inclusion of self-reported RA data reduced the trend for incomplete ascertainment with few years of administrative data. RA is a dynamic chronic disease, characterized by unpredictable flares and remissions of disease activity²⁵. During periods of remission, patients may not seek medical treatment, at least for RA. So, extracting ICD codes for a short observation window in RAMQ may miss some cases, specifically those patients in remission or with mild disease activity who happen not to use health services in the years under observation. Since 1 diagnostic code is allowed per physician visit in Quebec, RA patients with comorbidities may escape detection based on ICD codes within short observation windows if the code reported by the physician is for comorbidity and not RA.

Ng, et al studied the effect of the number of years of administrative data observed on estimates of SLE prevalence and recommended the use of long time windows to avoid underascertainment⁶. However, using longer observation windows could lead to overestimation of RA prevalence if misclassification error is not accounted for. This highlights the importance of carefully thinking about both sensitivity and specificity. Moreover, using longer time windows within health administrative databases has some drawbacks when the interest is in more recent prevalence estimates because temporal changes such as diagnostic drift have occurred over time²⁶. For example, the American College of Rheumatology (ACR) criteria for RA diagnosis have changed 3 times in the last 50 years²⁷. The most recent are the 2010 ACR/European League Against Rheumatism classification criteria²⁸. These changes in diagnostic criteria could alter RA prevalence estimates when longer time windows are analyzed.

The sensitivity of case ascertainment using administrative data alone was about 78% and remained steady throughout all time windows in our study. Supplementing administrative data with patient self-reported RA diagnosis and current use of DMARD increased the point estimate for sensitivity to about 85% (although CrI overlapped). This finding may be important for investigators who may have access to only a few years of administrative data, if they have additional sources of information on RA status. The importance of using multiple data sources is corroborated by recommendations from other researchers working on chronic disease surveillance^26,29,30. In the absence of other data sources, lengthening the number of years of RAMQ data increases RA prevalence point estimates, but with overlapping CrI across all observation windows.

One potential limitation in our study is the use of current DMARD consumption as an ascertainment method. Prior DMARD use was not available in the data. If ever DMARD use was assessed instead, then a better identification of RA cases (i.e., increase in the sensitivity estimate) would have been likely with the 3 ascertainment methods. Current DMARD use identifies only those with active disease. Although the low sensitivity of this ascertainment method was accounted for in the prior distribution, it is possible that accounting for ever DMARD use would have improved the collection of RA cases and further reduced the misclassification error by identifying those who were in remission during the survey.

Additionally, our adjusted results using health administrative data alone were not that precise even with such a large sample size. The difficulty in getting accurate prior information on the sensitivity and specificity can affect the precision of the posterior intervals. However, the precision was improved with additional information on RA status from self-reported data.

In our study, we did not use hospitalization RA codes. In fact, the Canadian working group on rheumatic disease definitions for surveillance using administrative data has done analyses of billing data with or without hospitalization data, and their consensus (based on analyses from each province) was that hospitalization data does not increase sensitivity of RA ascertainment.

The strengths of our study were the use of a very large cohort of individuals with both self-reported and administrative data on RA. Both data sources were adjusted for misclassification error in the absence of gold standard, which reflects a real-life challenge because few RA ascertainment approaches are considered 100% accurate. To the authors’ knowledge, this is the first study to date to combine self-reported data and Canadian provincial health administrative data to estimate an adjusted RA prevalence.

Our study illustrates that when using administrative data, RA point prevalence estimates are lower if few years of data are observed, and that multiple data sources can help identify more RA cases.

Accepted for publication February 13, 2019.

REFERENCES

1.↵
1. Thacker SB,
2. Stroup DF,
3. Rothenberg RB
. Public health surveillance for chronic conditions: a scientific basis for decisions. Stat Med 1995;14:629–41.
OpenUrl CrossRef PubMed
2.↵
1. Gordis L
. The occurrence of disease: I. Disease surveillance and measures of morbidity. In: Epidemiology. Fifth ed. Philadelphia: Elsevier Saunders; 2014:49–52.
3.↵
1. Bernatsky S,
2. Dekis A,
3. Hudson M,
4. Pineau CA,
5. Boire G,
6. Fortin PR,
7. et al.
Rheumatoid arthritis prevalence in Quebec. BMC Res Notes 2014;7:937.
OpenUrl CrossRef PubMed
4.↵
1. Malone DC,
2. Billups SJ,
3. Valuck RJ,
4. Carter BL
. Development of a chronic disease indicator score using a Veterans Affairs Medical Center medication database. J Clin Epidemiol 1999;52:551–7.
OpenUrl CrossRef PubMed
5.↵
1. Widdifield J,
2. Labrecque J,
3. Lix L,
4. Paterson JM,
5. Bernatsky S,
6. Tu K,
7. et al.
Systematic review and critical appraisal of validation studies to identify rheumatic diseases in health administrative databases. Arthritis Care Res 2013;65:1490–503.
OpenUrl
6.↵
1. Ng R,
2. Bernatsky S,
3. Rahme E
. Observation period effects on estimation of systemic lupus erythematosus incidence and prevalence in Quebec. J Rheumatol 2013;40:1334–6.
OpenUrl Abstract/FREE Full Text
7.↵
1. Nightingale A,
2. Farmer R,
3. de Vries CS
. Systemic lupus erythematosus prevalence in the UK: methodological issues when using the general practice research database to estimate frequency of chronic relapsing-remitting disease. Pharmacoepidemiol Drug Saf 2007;16:144–51.
OpenUrl CrossRef PubMed
8.↵
1. Awadalla P,
2. Boileau C,
3. Payette Y,
4. Idaghdour Y,
5. Goulet JP,
6. Knoppers B,
7. et al;
8. CARTaGENE Project
. Cohort profile of the CARTaGENE study: Quebec’s population-based biobank for public health and personalized genomics. Int J Epidemiol 2013;42:1285–99.
OpenUrl CrossRef PubMed
9.↵
1. Chaaya M,
2. Slim ZN,
3. Habib RR,
4. Arayssi T,
5. Dana R,
6. Hamdan O,
7. et al.
High burden of rheumatic diseases in Lebanon: A COPCORD study. Int J Rheum Dis 2012;15:136–43.
OpenUrl CrossRef PubMed
10.↵
1. Gariepy G,
2. Rossignol M,
3. Lippman A
. Characteristics of subjects self-reporting arthritis in a population health survey: distinguishing between types of arthritis. Can J Public Health 2009;100:467–71.
OpenUrl PubMed
11.↵
1. Centers for Disease Control and Prevention (CDC)
. Prevalence of self-reported arthritis or chronic joint symptoms among adults— United States, 2001. MMWR Morb Mortal Wkly Rep 2002;51: 948–50.
OpenUrl PubMed
12.↵
1. Walitt BT,
2. Constantinescu F,
3. Katz JD,
4. Weinstein A,
5. Wang H,
6. Hernandez RK,
7. et al.
Validation of self-report of rheumatoid arthritis and systemic lupus erythematosus: The Women’s Health Initiative. J Rheumatol 2008;35:811–8.
OpenUrl Abstract/FREE Full Text
13.↵
1. Slim Z
. Estimating rheumatoid arthritis prevalence and care quality in a large sample from the Quebec population [dissertation]. Montreal: McGill University; 2018:111 pp.
14.↵
1. Rutjes A,
2. Reitsma J,
3. Coomarasamy A,
4. Khan K,
5. Bossuyt P
. Evaluation of diagnostic tests when there is no gold standard. A review of methods. Health Technol Assess 2007;11:iii, ix–51.
OpenUrl PubMed
15.↵
1. van Smeden M,
2. Naaktgeboren CA,
3. Reitsma JB,
4. Moons KG,
5. de Groot JA
. Latent class models in diagnostic studies when there is no reference standard—a systematic review. Am J Epidemiol 2014;179:423–31.
OpenUrl CrossRef PubMed
16.↵
1. Toft N,
2. Jørgensen E,
3. Højsgaard S
. Diagnosing diagnostic tests: evaluating the assumptions underlying the estimation of sensitivity and specificity in the absence of a gold standard. Prev Vet Med 2005;68:19–33.
OpenUrl CrossRef PubMed
17.↵
1. Enøe C,
2. Georgiadis MP,
3. Johnson WO
. Estimation of sensitivity and specificity of diagnostic tests and disease prevalence when the true disease state is unknown. Prev Vet Med 2000;45:61–81.
OpenUrl CrossRef PubMed
18.↵
1. Joseph L,
2. Gyorkos TW,
3. Coupal L
. Bayesian estimation of disease prevalence and the parameters of diagnostic tests in the absence of a gold standard. Am J Epidemiol 1995;141:263–72.
OpenUrl CrossRef PubMed
19.↵
1. Widdifield J,
2. Bombardier C,
3. Bernatsky S,
4. Paterson JM,
5. Green D,
6. Young J,
7. et al.
An administrative data validation study of the accuracy of algorithms for identifying rheumatoid arthritis: the influence of the reference standard on algorithm performance. BMC Musculoskelet Disord 2014;15:216.
OpenUrl CrossRef PubMed
20.↵
1. Dendukuri N,
2. Joseph L
. Bayesian approaches to modeling the conditional dependence between multiple diagnostic tests. Biometrics 2001;57:158–67.
OpenUrl CrossRef PubMed
21.↵
1. Weichenthal S,
2. Joseph L,
3. Bélisle P,
4. Dufresne A
. Bayesian estimation of the probability of asbestos exposure from lung fiber counts. Biometrics 2010;66:603–12.
OpenUrl
22.↵
1. Wiréhn A-BE,
2. Karlsson HM,
3. Carstensen JM
. Estimating disease prevalence using a population-based administrative healthcare database. Scand J Public Health 2007;35:424–31.
OpenUrl PubMed
23.↵
1. Powell KE,
2. Diseker RA,
3. Presley RJ,
4. Tolsma D,
5. Harris S,
6. Mertz KJ,
7. et al.
Administrative data as a tool for arthritis surveillance: estimating prevalence and utilization of services. J Public Health Manag Prac 2003;9:291–8.
OpenUrl PubMed
24.↵
1. Kopec JA,
2. Rahman MM,
3. Berthelot JM,
4. Le Petit C,
5. Aghajanian J,
6. Sayre EC,
7. et al.
Descriptive epidemiology of osteoarthritis in British Columbia, Canada. J Rheumatol 2007;34:386–93.
OpenUrl Abstract/FREE Full Text
25.↵
1. Kvien TK
. Epidemiology and burden of illness of rheumatoid arthritis. Pharmacoeconomics 2004;2 Suppl 1:1–12.
OpenUrl CrossRef
26.↵
1. Ward MM
. Estimating disease prevalence and incidence using administrative data: some assembly required. J Rheumatol 2013;40:1241–3.
OpenUrl FREE Full Text
27.↵
1. Arnett FC,
2. Edworthy SM,
3. Bloch DA,
4. McShane DJ,
5. Fries JF,
6. Cooper NS,
7. et al.
The American Rheumatism Association 1987 revised criteria for the classification of rheumatoid arthritis. Arthritis Rheum 1988;31:315–24.
OpenUrl CrossRef PubMed
28.↵
1. Aletaha D,
2. Neogi T,
3. Silman AJ,
4. Funovits J,
5. Felson DT,
6. Bingham CO 3rd,
7. et al.
2010 rheumatoid arthritis classification criteria: an American College of Rheumatology/European League Against Rheumatism collaborative initiative. Arthritis Rheum 2010;62:2569–81.
OpenUrl CrossRef PubMed
29.↵
1. Cricelli C,
2. Mazzaglia G,
3. Samani F,
4. Marchi M,
5. Sabatini A,
6. Nardi R,
7. et al.
Prevalence estimates for chronic diseases in Italy: Exploring the differences between self-report and primary care databases. J Public Health Med 2003;25:254–7.
OpenUrl CrossRef PubMed
30.↵
1. Bernatsky S,
2. Lix L,
3. Hanly J,
4. Hudson M,
5. Badley E,
6. Peschken C,
7. et al.
Surveillance of systemic autoimmune rheumatic diseases using administrative data. Rheumatol Int 2011;31:549–54.
OpenUrl CrossRef PubMed

In this issue

Download PDF

Bookmark this article

Keywords

BAYESIAN LATENT CLASS MODELS

PREVALENCE

QUEBEC

SELF-REPORT DATA

CANADIAN PROVINCIAL HEALTH ADMINISTRATIVE DATA

RHEUMATOID ARTHRITIS

Cited By...

More in this TOC Section

Show more Rheumatoid Arthritis

Keywords

[1] 1.↵
Thacker SB,
Stroup DF,
Rothenberg RB
. Public health surveillance for chronic conditions: a scientific basis for decisions. Stat Med 1995;14:629–41.
OpenUrl CrossRef PubMed

[2] Thacker SB,

[3] Stroup DF,

[4] Rothenberg RB

[5] 2.↵
Gordis L
. The occurrence of disease: I. Disease surveillance and measures of morbidity. In: Epidemiology. Fifth ed. Philadelphia: Elsevier Saunders; 2014:49–52.

[6] Gordis L

[7] 3.↵
Bernatsky S,
Dekis A,
Hudson M,
Pineau CA,
Boire G,
Fortin PR,
et al.
Rheumatoid arthritis prevalence in Quebec. BMC Res Notes 2014;7:937.
OpenUrl CrossRef PubMed

[8] Bernatsky S,

[9] Dekis A,

[10] Hudson M,

[11] Pineau CA,

[12] Boire G,

[13] Fortin PR,

[14] et al.

[15] 4.↵
Malone DC,
Billups SJ,
Valuck RJ,
Carter BL
. Development of a chronic disease indicator score using a Veterans Affairs Medical Center medication database. J Clin Epidemiol 1999;52:551–7.
OpenUrl CrossRef PubMed

[16] Malone DC,

[17] Billups SJ,

[18] Valuck RJ,

[19] Carter BL

[20] 5.↵
Widdifield J,
Labrecque J,
Lix L,
Paterson JM,
Bernatsky S,
Tu K,
et al.
Systematic review and critical appraisal of validation studies to identify rheumatic diseases in health administrative databases. Arthritis Care Res 2013;65:1490–503.
OpenUrl

[21] Widdifield J,

[22] Labrecque J,

[23] Lix L,

[24] Paterson JM,

[25] Bernatsky S,

[26] Tu K,

[27] et al.

[28] 6.↵
Ng R,
Bernatsky S,
Rahme E
. Observation period effects on estimation of systemic lupus erythematosus incidence and prevalence in Quebec. J Rheumatol 2013;40:1334–6.
OpenUrl Abstract/FREE Full Text

[29] Ng R,

[30] Bernatsky S,

[31] Rahme E

[32] 7.↵
Nightingale A,
Farmer R,
de Vries CS
. Systemic lupus erythematosus prevalence in the UK: methodological issues when using the general practice research database to estimate frequency of chronic relapsing-remitting disease. Pharmacoepidemiol Drug Saf 2007;16:144–51.
OpenUrl CrossRef PubMed

[33] Nightingale A,

[34] Farmer R,

[35] de Vries CS

[36] 8.↵
Awadalla P,
Boileau C,
Payette Y,
Idaghdour Y,
Goulet JP,
Knoppers B,
et al;
CARTaGENE Project
. Cohort profile of the CARTaGENE study: Quebec’s population-based biobank for public health and personalized genomics. Int J Epidemiol 2013;42:1285–99.
OpenUrl CrossRef PubMed

[37] Awadalla P,

[38] Boileau C,

[39] Payette Y,

[40] Idaghdour Y,

[41] Goulet JP,

[42] Knoppers B,

[43] et al;

[44] CARTaGENE Project

[45] 9.↵
Chaaya M,
Slim ZN,
Habib RR,
Arayssi T,
Dana R,
Hamdan O,
et al.
High burden of rheumatic diseases in Lebanon: A COPCORD study. Int J Rheum Dis 2012;15:136–43.
OpenUrl CrossRef PubMed

[46] Chaaya M,

[47] Slim ZN,

[48] Habib RR,

[49] Arayssi T,

[50] Dana R,

[51] Hamdan O,

[52] et al.

[53] 10.↵
Gariepy G,
Rossignol M,
Lippman A
. Characteristics of subjects self-reporting arthritis in a population health survey: distinguishing between types of arthritis. Can J Public Health 2009;100:467–71.
OpenUrl PubMed

[54] Gariepy G,

[55] Rossignol M,

[56] Lippman A

[57] 11.↵
Centers for Disease Control and Prevention (CDC)
. Prevalence of self-reported arthritis or chronic joint symptoms among adults— United States, 2001. MMWR Morb Mortal Wkly Rep 2002;51: 948–50.
OpenUrl PubMed

[58] Centers for Disease Control and Prevention (CDC)

[59] 12.↵
Walitt BT,
Constantinescu F,
Katz JD,
Weinstein A,
Wang H,
Hernandez RK,
et al.
Validation of self-report of rheumatoid arthritis and systemic lupus erythematosus: The Women’s Health Initiative. J Rheumatol 2008;35:811–8.
OpenUrl Abstract/FREE Full Text

[60] Walitt BT,

[61] Constantinescu F,

[62] Katz JD,

[63] Weinstein A,

[64] Wang H,

[65] Hernandez RK,

[66] et al.

[67] 13.↵
Slim Z
. Estimating rheumatoid arthritis prevalence and care quality in a large sample from the Quebec population [dissertation]. Montreal: McGill University; 2018:111 pp.

[68] Slim Z

[69] 14.↵
Rutjes A,
Reitsma J,
Coomarasamy A,
Khan K,
Bossuyt P
. Evaluation of diagnostic tests when there is no gold standard. A review of methods. Health Technol Assess 2007;11:iii, ix–51.
OpenUrl PubMed

[70] Rutjes A,

[71] Reitsma J,

[72] Coomarasamy A,

[73] Khan K,

[74] Bossuyt P

[75] 15.↵
van Smeden M,
Naaktgeboren CA,
Reitsma JB,
Moons KG,
de Groot JA
. Latent class models in diagnostic studies when there is no reference standard—a systematic review. Am J Epidemiol 2014;179:423–31.
OpenUrl CrossRef PubMed

[76] van Smeden M,

[77] Naaktgeboren CA,

[78] Reitsma JB,

[79] Moons KG,

[80] de Groot JA

[81] 16.↵
Toft N,
Jørgensen E,
Højsgaard S
. Diagnosing diagnostic tests: evaluating the assumptions underlying the estimation of sensitivity and specificity in the absence of a gold standard. Prev Vet Med 2005;68:19–33.
OpenUrl CrossRef PubMed

[82] Toft N,

[83] Jørgensen E,

[84] Højsgaard S

[85] 17.↵
Enøe C,
Georgiadis MP,
Johnson WO
. Estimation of sensitivity and specificity of diagnostic tests and disease prevalence when the true disease state is unknown. Prev Vet Med 2000;45:61–81.
OpenUrl CrossRef PubMed

[86] Enøe C,

[87] Georgiadis MP,

[88] Johnson WO

[89] 18.↵
Joseph L,
Gyorkos TW,
Coupal L
. Bayesian estimation of disease prevalence and the parameters of diagnostic tests in the absence of a gold standard. Am J Epidemiol 1995;141:263–72.
OpenUrl CrossRef PubMed

[90] Joseph L,

[91] Gyorkos TW,

[92] Coupal L

[93] 19.↵
Widdifield J,
Bombardier C,
Bernatsky S,
Paterson JM,
Green D,
Young J,
et al.
An administrative data validation study of the accuracy of algorithms for identifying rheumatoid arthritis: the influence of the reference standard on algorithm performance. BMC Musculoskelet Disord 2014;15:216.
OpenUrl CrossRef PubMed

[94] Widdifield J,

[95] Bombardier C,

[96] Bernatsky S,

[97] Paterson JM,

[98] Green D,

[99] Young J,

[100] et al.

[101] 20.↵
Dendukuri N,
Joseph L
. Bayesian approaches to modeling the conditional dependence between multiple diagnostic tests. Biometrics 2001;57:158–67.
OpenUrl CrossRef PubMed

[102] Dendukuri N,

[103] Joseph L

[104] 21.↵
Weichenthal S,
Joseph L,
Bélisle P,
Dufresne A
. Bayesian estimation of the probability of asbestos exposure from lung fiber counts. Biometrics 2010;66:603–12.
OpenUrl

[105] Weichenthal S,

[106] Joseph L,

[107] Bélisle P,

[108] Dufresne A

[109] 22.↵
Wiréhn A-BE,
Karlsson HM,
Carstensen JM
. Estimating disease prevalence using a population-based administrative healthcare database. Scand J Public Health 2007;35:424–31.
OpenUrl PubMed

[110] Wiréhn A-BE,

[111] Karlsson HM,

[112] Carstensen JM

[113] 23.↵
Powell KE,
Diseker RA,
Presley RJ,
Tolsma D,
Harris S,
Mertz KJ,
et al.
Administrative data as a tool for arthritis surveillance: estimating prevalence and utilization of services. J Public Health Manag Prac 2003;9:291–8.
OpenUrl PubMed

[114] Powell KE,

[115] Diseker RA,

[116] Presley RJ,

[117] Tolsma D,

[118] Harris S,

[119] Mertz KJ,

[120] et al.

[121] 24.↵
Kopec JA,
Rahman MM,
Berthelot JM,
Le Petit C,
Aghajanian J,
Sayre EC,
et al.
Descriptive epidemiology of osteoarthritis in British Columbia, Canada. J Rheumatol 2007;34:386–93.
OpenUrl Abstract/FREE Full Text

[122] Kopec JA,

[123] Rahman MM,

[124] Berthelot JM,

[125] Le Petit C,

[126] Aghajanian J,

[127] Sayre EC,

[128] et al.

[129] 25.↵
Kvien TK
. Epidemiology and burden of illness of rheumatoid arthritis. Pharmacoeconomics 2004;2 Suppl 1:1–12.
OpenUrl CrossRef

[130] Kvien TK

[131] 26.↵
Ward MM
. Estimating disease prevalence and incidence using administrative data: some assembly required. J Rheumatol 2013;40:1241–3.
OpenUrl FREE Full Text

[132] Ward MM

[133] 27.↵
Arnett FC,
Edworthy SM,
Bloch DA,
McShane DJ,
Fries JF,
Cooper NS,
et al.
The American Rheumatism Association 1987 revised criteria for the classification of rheumatoid arthritis. Arthritis Rheum 1988;31:315–24.
OpenUrl CrossRef PubMed

[134] Arnett FC,

[135] Edworthy SM,

[136] Bloch DA,

[137] McShane DJ,

[138] Fries JF,

[139] Cooper NS,

[140] et al.

[141] 28.↵
Aletaha D,
Neogi T,
Silman AJ,
Funovits J,
Felson DT,
Bingham CO 3rd,
et al.
2010 rheumatoid arthritis classification criteria: an American College of Rheumatology/European League Against Rheumatism collaborative initiative. Arthritis Rheum 2010;62:2569–81.
OpenUrl CrossRef PubMed

[142] Aletaha D,

[143] Neogi T,

[144] Silman AJ,

[145] Funovits J,

[146] Felson DT,

[147] Bingham CO 3rd,

[148] et al.

[149] 29.↵
Cricelli C,
Mazzaglia G,
Samani F,
Marchi M,
Sabatini A,
Nardi R,
et al.
Prevalence estimates for chronic diseases in Italy: Exploring the differences between self-report and primary care databases. J Public Health Med 2003;25:254–7.
OpenUrl CrossRef PubMed

[150] Cricelli C,

[151] Mazzaglia G,

[152] Samani F,

[153] Marchi M,

[154] Sabatini A,

[155] Nardi R,

[156] et al.

[157] 30.↵
Bernatsky S,
Lix L,
Hanly J,
Hudson M,
Badley E,
Peschken C,
et al.
Surveillance of systemic autoimmune rheumatic diseases using administrative data. Rheumatol Int 2011;31:549–54.
OpenUrl CrossRef PubMed

[158] Bernatsky S,

[159] Lix L,

[160] Hanly J,

[161] Hudson M,

[162] Badley E,

[163] Peschken C,

[164] et al.

Main menu

User menu

Search

Identifying Rheumatoid Arthritis Cases within the Quebec Health Administrative Database

Abstract

MATERIALS AND METHODS

Study setting, sources of data, ascertainment of RA cases, and time frame

Statistical methods

RESULTS

DISCUSSION

REFERENCES

In this issue

Citation Manager Formats

Keywords

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Keywords

Content

Resources

Subscribers

More

Main menu

User menu

Search

Identifying Rheumatoid Arthritis Cases within the Quebec Health Administrative Database

Abstract

MATERIALS AND METHODS

Study setting, sources of data, ascertainment of RA cases, and time frame

Statistical methods

RESULTS

DISCUSSION

REFERENCES

In this issue

Citation Manager Formats

Jump to section

Keywords

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Keywords

Content

Resources

Subscribers

More