Abstract
Objective. To identify empirical evidence of diagnostic delay in spondyloarthritis (SpA), determine whether sex-related differences persist, and conduct an analysis from that perspective of the possible causes, including the influence of quality research, in this group of inflammatory rheumatic diseases.
Methods. A systematic review was done of delay in diagnosis of SpA in MEDLINE and EMBASE and other sources. Study quality was determined in line with the Strengthening The Reporting of OBservational studies in Epidemiology (STROBE) statement. A metaanalysis of 13 papers reporting sex-disaggregated data was performed to evaluate sex-related differences in diagnostic delay. The global effect of diagnostic delay by sex was calculated using means difference (D) through a fixed effects model.
Results. The review included 23,883 patients (32.3% women) from 42 papers. No significant differences between the sexes were detected for symptoms at disease onset or during evolution. However, the mean for delay in diagnosis of SpA showed sex-related differences, being 8.8 years (7.4–10.1) for women and 6.5 (5.6–7.4) for men (p = 0.01). Only 40% of papers had high quality. A metaanalysis included 12,073 participants (31.2% women). The mean global effect was D = 0.6 years (0.31–0.89), indicating that men were diagnosed 0.6 year (7 months) before women.
Conclusion. Delay in diagnosis of SpA persists, and is longer in women than in men. There are no significant sex-related differences in symptoms that could explain sex-related differences in diagnostic delay. Methodological and possible publication bias could result in sex-biased medical practice.
Spondyloarthritis (SpA) comprises a heterogeneous group of interrelated inflammatory rheumatic diseases that can affect the synovium, enthesis of axial and peripheral joints, and some extraarticular sites. The SpA group includes reactive arthritis, enteropathic arthritis, juvenile SpA, psoriatic arthritis (PsA), and ankylosing spondylitis (AS), which was the first condition to be characterized and is considered the prototype disease. Initially, the different forms were looked upon as independent diseases, but similarities in their clinical characteristics, and later the association of these diseases with HLA-B27 after its description in 1973, proved that they are similar or closely related conditions now jointly recognized as SpA. In earlier studies, AS had been found to be far more common in men by a ratio of 10:1, but once all the different forms were considered in conjunction, this ratio narrowed to 2–3:11,2,3. The disease commences with spinal or lumbar pain, and enthesitis or peripheral arthritis, most often in the lower limbs. Radiological alterations are slow to appear and hence underrecognition or considerable diagnostic delay has been frequent. As a result of the discovery of the association with HLA-B27, the development of classification criteria for the different clinical forms, and probably, closer attention, delay has been reduced in recent years4.
Because of the clearly lower frequency of classical AS in women, the disease was considered a male disease, prompting underrecognition in women and resulting in a longer delay to diagnosis than in men, and this delay continues to be reported when all the different clinical forms are considered together5.
With recent therapeutic advances, early diagnosis and knowledge of sex differences have become indispensable. Several studies have shown that there is a systematic sex bias in healthcare, partially related to biased research hypotheses and/or methods due to insensitivity to sex6.
The most widely used definition of sex bias is the differential medical treatment of men and women, the effect of which may be positive, negative, or neutral7. However, for research purposes, it is useful to define sex bias as a systematic wrong approach regarding sex insensitivity that incorrectly considers women and men as similar in the natural history of disease6. Sex bias in SpA is an interesting area to study8 because of the longer diagnosis delay in women and the causes of that delay.
The aim of our present study was to identify empirical evidence of diagnostic delay in SpA, determine whether sex-related differences persist, and conduct an analysis from a sex perspective of the possible causes, including the influence of quality research, in this group of inflammatory rheumatic diseases.
MATERIALS AND METHODS
Data sources and literature search
A systematic literature search was conducted in MEDLINE and EMBASE through natural language. The key words were spondyloarthropathies, spondyloarthritis, “reiter arthritis”, “reactive arthritis”, “juvenile spondyloarthropathy”, “ankylosing spondylitis”, “enteropathic spondyloarthritis”, (“spondyloarthropathy AND inflammatory bowel disease”), (“spondyloarthritis AND inflammatory bowel disease”), and (“ankylosing spondylitis AND inflammatory bowel disease”). Each of these was individually combined with: (delay OR delayed) AND (sex OR gender OR women OR woman) NOT (psoriatic arthritis). Complementary papers potentially eligible were included, a search through snowballing was performed by cross reference, and 16 studies were identified1,3,8–21. Ultimately, 379 identified papers were consulted. The search included all material published before April 2016, regardless of date of publication.
Study selection
The inclusion criteria were original articles in which SpA was the main subject, which analyzed delay in diagnosis and included women. We excluded papers with PsA patients because, although this disease also shows diagnosis delay, it is less than that found in AS4, probably due to the presence of psoriasis in two-thirds of patients prior to arthritis. In addition, papers were excluded if they did not fulfill the study criteria or consisted of editorials, reviews, case studies, questionnaires, or diagnostic/therapeutic techniques. Non-English and non-Spanish papers were excluded.
We obtained 102 papers from MEDLINE and 261 from EMBASE and 16 from other sources. Of these, 226 were eliminated because they were duplicated, and 111 met the exclusion criteria described above. The other 42 were included in the review (Figure 1).
Data extraction and quality assessment
Two observers (JV and BBM) performed a content analysis of the 42 original articles finally selected, for which an information compilation protocol was established and applied, comprising the following variables:
Patients: total number of patients and number disaggregated by sex, type of SpA (axial, peripheral, or mixed), age at onset of disease, age at diagnosis, duration of disease, HLA-B27 positivity, family history of AS and social/sex-related causes explaining sex differences in delay in diagnosis (family/reproductive and/or productive roles, social factors and/or factors related with medical practice), clinical manifestations at baseline and later (neck and back pain, peripheral arthritis, enthesitis, and uveitis), and delay in diagnosis, which was considered as the time difference between the onset of symptoms and the date of diagnosis by a professional as established by the Assessment of Spondyloarthritis International Society (ASAS).
Study characteristics: design (cross-sectional, cohort, case control), type of cases (incident, prevalent), data source (hospital, patient group), type of data (medical history, questionnaire, register), classification criteria (Rome, New York, modified New York, European Spondyloarthropathy Study Group, and ASAS).
The quality of the studies was determined in line with the Strengthening The Reporting of OBservational studies in Epidemiology (STROBE) statement (www.strobe-statement.org), a checklist of items that should be included in reports of observational studies, related to title and abstract, introduction, methods, results, discussion, and other information. Papers were classified as follows: low quality (< 50%), medium (≥ 50% to < 70%), high (≥ 70% to < 90%), and excellent (≥ 90%).
Concordance between the 2 observers was computed for all the variables included in the review. A third observer (RCMT) evaluated cases of disagreement between observers in the classification of some variables.
Data analysis and synthesis
First, a descriptive analysis was carried out of the distribution of the previously mentioned variables. We calculated means and CI 95% to compare age at onset, age at diagnosis, disease duration, and diagnostic delay in women and men. Further, to compare sex-related differences as regards clinical manifestations at onset and during the course of the disease, we calculated means and CI 95% of the percentages of neck and back pain, peripheral arthritis, enthesitis, and uveitis.
Comparisons of the continuous parametric variables were carried out by Student t test and the nonparametric ones by Mann-Whitney U test. A value of p < 0.05 was considered significant. Statistical evaluations were performed by SPSS 15.0 package program.
A metaanalysis was performed to evaluate sex-related differences in diagnosis delay, in which women were the case group and men the control one. The main selection criteria were papers that reported information about median and SD of diagnosis delay by sex. Twenty-five papers did not offer diagnostic delay disaggregated by sex, 2 papers were excluded because they did not show SD to calculate the CI, and 1 more was excluded because of low quality. Of the remaining 14 papers, in 2 studies, the SD from the data (no. men and women, means for men and women, and statistical significance) were calculated assuming similar variances for men and women. One of the studies22 gave data for 4 subgroups that formed 4 different samples and these were included in the metaanalysis (southern China with juvenile AS, southern China with adult AS, northern China with juvenile AS, and northern China with adult AS).
Seventeen samples were included in the metaanalysis (Figure 1). Because delay in diagnosis is a continuous variable, the means difference (D) and its CI (95%) were used to calculate the global effect of diagnostic delay by sex. Heterogeneity between the studies was measured using the chi-squared statistics test; probability value of p < 0.05 was considered statistically significant.
The I2 statistic was used to quantify the degree of heterogeneity of the 17 samples (14 papers), considering a value over 50% high. Because a heterogeneity was obtained not only by chi-squared test (p < 0.00001) but also by I2 statistic = 95% by the randomized effects model, sensitivity analyses were performed to assess the robustness of the results of the metaanalysis. Specifically, the influence of effect sizes was assessed by deletion of each paper to check heterogeneous data that could affect the overall result. Only 1 paper (Bandinelli, et al23) was identified as the cause of the entire heterogeneity and it was excluded from the metaanalysis. That paper observed less diagnostic delay in women than in men, likely related to higher education, work levels, and peripheral involvement. As Higgins and Green write, if an obvious reason for the outlying result is apparent, the study might be removed with more confidence24. This practice is also carried out in a Cochrane review25.
Then, a second metaanalysis was developed with 16 samples (13 papers). Publication bias was determined by means of a funnel plot. The metaanalysis was performed using Review Manager software (Revman version 5.3; Cochrane Collaboration; http://tech.cochrane.org/revman).
Ethics approval was not required, in accordance with the policy of our institution.
RESULTS
Table 1 shows the main characteristics of the 42 studies on diagnostic delay in SpA included in this systematic review1,2,3,8–18,19,20,21,22,23,26–36,37–47,48. The review included 23,883 patients (67.6% men, 32.3% women). Prevalent cases accounted for all studies; 97.6% used a cross-sectional design and 1 was a cohort study48. Criteria used for patient selection are shown in Table 1.
The mean for delay in diagnosis was given in 3 papers that included women only, and a further 17 studies gave data disaggregated by sex, indicating statistically significant differences between women and men in the mean for delay in diagnosis, with a mean of 8.8 years (7.4–10.1) being obtained for women and 6.5 (5.6–7.4) for men (p = 0.01). Thirty papers gave overall data of delay in diagnosis (9 of them also gave data by sex), and the mean was 6.6 years (5.9–7.3).
There were no detected significant sex differences in mean age at onset of disease, age at diagnosis, and mean disease duration.
HLA-B27 positivity was reported in 26 papers and patients’ family history in 22 papers. In both cases, there was no detected statistical significance.
Although 9 papers mention the concepts of social events21,47, social support21,42, social system21, social situation27,30, effect and consequences11,21, social status15,21,23,28, social domain and function31, and quality of life21,31, they are not in relation to social/sex-related causes explaining sex differences in delay in diagnosis. Only Aggarwal in 2009 adequately discussed the complex range of sex factors involved in the diagnostic process of SpA according to sex8.
Table 2 with data of review and metaanalysis shows that no statistically significant differences were detected between the sexes regarding symptoms at onset of disease, nor were statistically significant differences detected in symptoms during evolution of the disease.
As determined according to the different sections of the STROBE Statement, 11 (26.1%) papers were of low quality, 14 (33.3%) were of medium quality, and 17 (40.4%) were high quality; these latter were published after 2005. The low quality of papers was primarily related to the methods section: 41 papers did not describe any attempts to address potential sources of bias, 33 papers did not explain how sample size was calculated, 30 papers did not describe analytical methods that took sampling strategy into account, 35 papers did not explain how missing data were addressed, 21 papers did not describe any methods used to examine subgroups and interactions, and 20 papers did not present key elements of study design early in the paper. In the title or the abstract, 27 papers did not indicate the study design with a commonly used term. Regarding the discussion section, 13 papers did not discuss study limitations in terms of sources of potential bias or inaccuracy (Table 3).
Figure 2 shows the findings of the first metaanalysis of the 17 samples or subgroups (13 papers + 4 samples22) concerning delay in diagnosis of SpA disaggregated by sex; this included 12,208 participants (3812 women and 8396 men8,17,18,19,21,22,23,28,31,34,36,37,41,42). The mean global effect was D = 0.77 years (−0.58, 2.12); this result was not statistically significant, indicating differences in effects across studies. Therefore, we can say that the average effect reported did not represent the studies adequately because of heterogeneity. Afterward, a sensitivity analysis was performed, and a study23 with a sample size of 91 men and 44 women identified as the cause of the entire heterogeneity was removed, because the diagnostic delay in men (9.909 ± 0.8453) was significantly higher than in women (6.336 ± 1.104; p = 0.0023). In the second metaanalysis, 16 samples (13 papers) with 12,073 participants (3768 women and 8305 men) were included and the mean global effect obtained was D = 0.6 years (0.31, 0.89), which indicates men were diagnosed 0.6 years (7 months and 6 days) before women, and this result was statistically significant. No heterogeneity was detected in included studies regarding delay in diagnosis of AS by sex: chi-squared test (p = 0.57), and I2 statistic = 0%, indicate that there was no variability between studies.
A funnel plot (Figure 3) suggests possible publication bias. The slight asymmetry at the lower left indicates that there is a lack of articles with small samples reporting estimations that women have less diagnosis delay than men. In this case, metaanalysis would estimate less diagnosis delay in men than in women.
DISCUSSION
We found that diagnostic delay in SpA was greater in women than in men. One possible explanation for this delay in women could be that they present different clinical symptoms that might delay the diagnosis, as mentioned in the literature; however, none of the studies reviewed in connection to diagnostic delay detected such sex-related differences in symptoms either at the beginning or during the course of SpA.
Delay in diagnosis of SpA has been described in women5,9,10 and in both sexes6,7,11–16,20,26,27,29,30,32,33,35, 38,39,40,44,45,46, and although Bandinelli23 observed less diagnostic delay in women than in men, in most of the papers where delay was compared between men and women, it was greater in women8,17,18,19,21,22,28,31,34,37,41,42,43,47,48. This implies that patients have had medically unsolved disorders for a long time, have continued searching for a diagnosis and treatment, or are being erroneously diagnosed and managed49. One possible explanation is that dorsal/lumbar pain is a frequent symptom occurring in various conditions, and is one of the most common symptoms attributable to unsolved disorders that challenge physicians’ skills to evaluate patients and give a diagnosis such as SpA as the cause of pain.
This situation is more frequent in women than in men, which is unexpected because the most common clinical manifestations of SpA in both sexes were dorsal and/or lumbar pain followed by peripheral arthritis, with similar prevalence at onset, so diagnostic delay should not have differed.
The longer diagnostic delay in women may be related to a lack of familiarity with the disease in women47,48, because the prototypic SpA, AS, is predominately a male disease and this has received wide attention. Although the disability rate is worse in women than in men at all ages17, women are identified as less disabled and with better spinal mobility than men42. These differences and the slower progression of radiological changes in women may influence earlier diagnosis in men, because the spine is more severely affected in them31. Consequently, the disease is not suspected in women, and a wrong diagnosis or “non-specific symptoms and signs” is given in healthcare records49.
According to the theory of knowledge, underdiagnosed SpA in women might be due to a biased viewpoint of the physician as observer. Some symptoms of SpA, such as low back pain, morning stiffness, and sleep disorders associated with pain are shared with fibromyalgia (FM)18 and have traditionally been assigned low status in the cultural hierarchy of medicine. Often, these symptoms in women are not regarded as having the same meaning as similar symptoms in men and they are not usually considered in the differential diagnosis of SpA, in spite of sometimes coexisting with enthesitis and peripheral arthritis8, sometimes misdiagnosed as FM36,42. Further, SpA in women may be confused and patients referred to gynecology, surgery, gastroenterology36, or psychiatry10.
The quality of the research on SpA has increased since 2005, and the use of Assessment of SpondyloArthritis International Society criteria has improved the operative definition of this family of diseases, a development that will facilitate comparison between the sexes. However, in the papers reviewed, sex-related differences in delay of diagnosis of SpA may be true or artificial owing to methodological reasons, such as the publication bias observed. Moreover, because the most frequent source of information was the hospital, selection bias is also a possibility, which would affect sex-related differences in diagnostic delay because women patients with SpA would not be recognized at the primary healthcare level and would not be referred to rheumatology clinics, where the research was conducted. In addition, there is a lack of information about how study samples were calculated and how missing data were addressed; only 1 paper described attempts to address potential sources of bias38. The low score for quality obtained for the papers was largely related to the methods section. Most of the study designs were descriptive and cross-sectional without strong inferential power. Only 1 paper analyzed a case control study of patients with AS, finding a longer delay in diagnosis in women48. Another methodological challenge is related to the lack of sex-stratified results in most studies of diagnostic delay in SpA; influences of sex were rarely discussed. Studies without sex-stratified findings are easier to conduct; even so, some reflection is required on the part of both authors and editors on the reasons for producing and publishing such a large percentage of these studies.
The results of the studies analyzed present some limitations. They provide only a partial picture of worldwide attempts to identify the causes of sex-related differences regarding diagnostic delay in SpA. Few of the articles presented an empirical analysis of the information, and only one-third of them reported a sex analysis or comparison of diagnostic delay in SpA. In the case of studies focusing exclusively on men, which was an exclusion criterion of this review, this proportion would clearly be even lower. The above reflects the near-absence of scientific studies on this issue; it also restricts our ability as researchers to draw meaningful conclusions about the nature of research on the topic.
Another limitation is that, although SpA has more measures of disease than the ones we compared, such as blood tests, radiographic studies, and functional measures, we chose for this review those most frequently described in the papers such as clinical and demographic characteristics. Regarding juvenile arthritis, we offer data together with adult onset; although there are some papers that include juvenile onset, only 6 of them6,22,30,39,40,44 offer diagnostic delay disaggregated between juvenile and adult onset of disease. About excluding patients with PsA, there is 1 study37 in this review in which some patients in the sample may have this disease.
From a perspective of sex differences, this review was limited to a number of factors related to family history and HLA-B27, age at onset of symptoms, and at diagnosis, and years of diagnostic delay. No analysis was conducted of information reported by the authors about the influence of social conditions on the delay. This aspect is crucial to understanding whether diagnostic delay is related to the sex context of the female patient (i.e., personal/family history related to productive/reproductive roles on the basis of the delay in the demand for healthcare), or due to unfamiliarity with SpA in women by physicians from primary care or other levels of healthcare. The influence of social conditions was not normally included although it was discussed in a few papers8,23,27,30,31,32,34. It is a classic sex bias in which the research approach de-contextualizes women’s health risks, blocking new knowledge about the chain of responsibility, in this case related to sex-related differences in diagnostic delay in SpA.
Serious and important consequences of sex bias have been described for women’s health in relation to clinical management50. It is possible that medical knowledge has been built up over the decades in ways that yield a disease classification more suited to men’s complaints and health problems, and signs or symptoms indicative of specific conditions. Patients classified as having nonspecific signs and symptoms may in fact have an identifiable disease, either at an early stage not identified by contemporary (male-based) diagnostic criteria or because they present an atypical (non-male) set of symptoms for the disease. No treatment, or incorrect treatment, at this point can lead to a worsening of the disease, particularly among women50. There is probably a lack of knowledge about sex-related aspects of SpA in the medical community, with consequent delays in diagnosis and suitable treatment. More information is required about probably unfounded sex differences in diagnostic delay, such as the number of previous erroneous diagnoses and treatments, and doctors visited, all of which are important factors to achieve the principle of equity in healthcare provision.
Acknowledgment
The authors thank Elisa Chilet-Rosell for valuable assistance in designing information compilation protocol and collecting data.
Footnotes
Supported by FEDER-Instituto de Salud Carlos III, Ministerio de Ciencia e Innovación, Spain (grant PI11/00865).
- Accepted for publication October 27, 2016.