Abstract
Objective. To assess gender differences in disease characteristics and treatment responses over time in a disease-modifying antirheumatic drug (DMARD)-naive seropositive early rheumatoid arthritis (RA) cohort.
Methods. Patients with polyarticular disease who were DMARD-naive and had seropositive early RA (< 14 months) were recruited by the Western Consortium of Practicing Rheumatologists. Each patient was examined at study entry, after 6 and 12 months, and yearly thereafter. Clinical and demographic data were collected. We investigated gender differences in baseline disease characteristics and treatment using chi-squared, Mann-Whitney U, and t tests. We used generalized estimating equations (GEE) models for repeated measures to examine whether the rate of change of specific disease outcomes during the first 2 years after DMARD initiation was significantly influenced by gender.
Results. At baseline, men (n = 67) and women (n = 225) had similar disease activity and radiographic damage; men, however, had significantly worse erosion, while women had worse joint space narrowing. Despite similar treatment, women had worse disease progression over the 2-year followup, as assessed by trends in Disease Activity Score 28/erythrocyte sedimentation rate (DAS28-ESR4), physician global scores, and tender joint counts. In the GEE model, gender was significantly associated with the rate of change of DAS28-ESR4 scores (p = 0.009), although not independently associated with disease activity. Self-reported measures (Health Assessment Questionnaire-Disability Index, patient global scores, fatigue, pain) were worse among women at baseline and throughout the study period. Men were more likely to achieve remission.
Conclusion. At baseline, men and women had similar disease activity and joint damage. Responses to treatment over time were better among men in this prebiologic era; women had worse progression despite similar treatment.
Gender differences in rheumatoid arthritis (RA) have been described at multiple levels. Men and women with RA have been found to differ in incidence of the disease1, extent of disability2, arthritis-related pain3, load of genetic risk factors4,5, and even mortality6. Yet the mechanisms by which these differences arise, including the possible involvement of sex hormones7,8, are as yet unclear. Recently, there has been a renewed interest in gender differences in RA within the rheumatology community, with a focus on disease outcomes and responses to treatment.
Studies of gender differences in disease outcomes and responses to treatment in RA have used a variety of study designs as well as patient populations that vary widely in disease duration and severity, ranging from prospective early RA cohorts9,10,11 to cross-sectional patient populations with longstanding disease3,5,12,13. A number of studies have reported worse RA outcomes among women in terms of symptoms, disease activity, and functional capacity, both in early14,15 and late RA3,12,16, although the early RA studies also reported no gender differences in severity in early stages of the disease. In contrast, other studies of longstanding RA have found men to be more likely to have severity markers, such as higher levels of anti-citrullinated peptide antibodies (ACPA) and the HLA-DRB1 shared epitope4,5, and to have worse outcomes13. It is still unclear whether RA occurs in more severe forms among women or men, and whether the disease progresses differently in each gender. Further, there is now emerging evidence that men may be more likely than women to achieve remission, in response to traditional as well as biologic disease-modifying antirheumatic drugs (DMARD)11,17,18,19. There is currently much debate about whether the observed differences are intrinsic to the disease. It has been suggested that the gender differences may be accounted for by differences in duration of the disease between men and women11,20, or may just reflect a gender bias in the reporting of symptoms12.
Given these conflicting findings and views regarding whether and how men and women with RA differ in their disease characteristics and responses to treatment, detailed examination of prospective early RA cohorts may be better suited to explore these issues, while studies of established RA of variable duration may be limited by their cross-sectional design. Thus, in order to determine whether gender differences, if they do exist, are present in the early stages of the disease or appear in later stages, we have investigated such differences in a prospective DMARD-naive early RA cohort (< 14 months’ duration) at baseline21; we also examined differences in disease progression and treatment outcomes between men and women in the longitudinal setting, in this fairly homogeneous cohort limited to patients with positive rheumatoid factor (RF) tests and active polyarticular arthritis.
MATERIALS AND METHODS
Patients with RA were recruited between January 1, 1993, and April 1, 2002, as a joint effort of the Western Consortium of Practicing Rheumatologists, through 29 recruitment centers in the western region of the United States and Mexico, including 4 university medical centers, and 25 community practices21. To enter the observational study, patients had to (1) satisfy the 1987 American College of Rheumatology (ACR) criteria for RA22; (2) be within 14 months of symptom onset; (3) have no prior treatment with a DMARD; (4) have positive titers for rheumatoid factor (RF) antibodies (≥ 40 IU/ml); and (5) have at least 6 swollen joints (of 66) and at least 9 tender joints (of 68).
Approval for the study was obtained from local ethics committees and all participating patients provided informed consent.
Data collection
Each patient was examined by their rheumatologist at the time of entry in the study (baseline), after 6 and 12 months, and yearly thereafter. The rheumatologists collected demographic and clinical data, including joint counts for 68 tender and 66 swollen joints, and physician global assessment at each of these timepoints, as well as data on utilization and outcomes for all DMARD, biologics, and combination therapies used while enrolled in the study. Counts for 28 tender (TJC28) and swollen (SJC28) joints were derived from the recorded 66/68 joint counts. Radiographs of hands/wrists and forefeet were obtained. Patients also completed self-report questionnaires, providing data on their Health Assessment Questionnaire-Disability Index (HAQ-DI), arthritis-related pain on a visual analog scale (VAS), fatigue VAS, and global health and depression scores (Center for Epidemiological Studies Depression Scale, CES-D). Blood samples drawn at baseline and at each followup were used to evaluate levels of RF, C-reactive protein (CRP), and erythrocyte sedimentation rate (ESR). ACPA titers were measured only at baseline from frozen samples of a subset of subjects at Specialty Laboratories (Valencia, CA, USA).
Assessment of disease activity
Disease Activity Scores (DAS28) using 28-joint counts were computed using (1) ESR:
Assessment of radiographs
Radiographs for hands and feet were evaluated for RA joint damage by calculating the erosion and joint space narrowing (JSN) scores using the method developed by Sharp, et al24,25. Total Sharp scores were calculated by adding erosion scores and JSN scores.
Improvement of RA symptoms and remission
Improvement of disease symptoms was determined based on the ACR criteria for 20% and 50% improvement from baseline26,27 at 6, 12, and 24 months. We did not use ACR70 criteria because few patients met these criteria. Patients were considered to be in point remission if their DAS28-ESR4 score was < 2.628.
Statistical analyses
In a cross-sectional analysis at baseline, differences in disease features and treatment prescribed between men and women were examined, using chi-squared tests to compare distributions of categorical variables; t tests were used for normally distributed continuous variables, and the Mann-Whitney U test for those non-normally distributed. For continuous variables, if normally distributed, means and SD are reported, while medians and 25th and 75th percentiles are reported if non-normally distributed.
In the longitudinal analyses, generalized estimating equations (GEE) models for repeated measures, using an independent correlation structure with robust estimation, were used to model differences in disease activity and disease severity measures as well as frequency of ACR improvement or DAS28 remission between men and women, from baseline to 24 months. For all outcomes examined, followup time and gender were included as the main explanatory variables in the GEE model, adjusting for age, baseline disease activity (DAS28-ESR4), and DMARD prescribed at the baseline visit. An interaction term was also included in the model to evaluate whether the rate of change of the outcome over the 2 years of followup was modified by gender. The GEE model with DAS28-ESR4 as the outcome variable was tested with and without adjusting for RF titers, fatigue VAS, and CES-D scores when evaluating the influence of gender and followup time on disease activity over time. All analyses were repeated in the subset of patients who were prescribed DMARD at the baseline visit, to assess responses to treatment among men and women in terms of the outcomes described. The ESR and CRP variables were analyzed using observed values as well as adjusted values obtained after gender-specific published formulae29,30 were applied to adjust for age.
In a sensitivity analysis, we assessed gender differences in loss to followup, and repeated the longitudinal analyses in the subset of patients with complete DAS28-ESR4 data at all timepoints.
All statistical analyses were performed using the Stata software package (Release 10, Stata, College Station, TX, USA).
RESULTS
Gender differences at baseline
A total of 292 patients, consisting of 225 women and 67 men with seropositive early RA, were included at baseline. The main gender differences are summarized in Table 1. As expected from the selection criteria, at study entry, all patients had recent-onset RA with a mean duration of 6.2 months (SD 3.5 mo), and men and women had equally active disease, as assessed by the DAS28-ESR4. Overall, the patients had moderate (men 13%, women 15%) or severe disease activity (men 87%, women 84%), with only 1 woman having mild disease activity. Physician global assessment scores were also similar between genders at baseline, as were the total Sharp scores. Compared to women, men had significantly higher erosion scores, swollen joint counts (SJC28), RF titers, and frequencies of nodules; they were also significantly more likely to have ever smoked. Men tended to have later mean age at onset, slightly longer duration of RA symptoms, and were more likely to have radiographic erosions, higher tender joint counts (TJC28), and higher CRP levels and ACPA titers than women, although these differences were not statistically significant. In contrast, women had significantly worse function (HAQ-DI) and significantly higher patient global scores, fatigue, and depression scores (CES-D). They had worse arthritis-related pain and joint space narrowing scores than men, although these were not statistically significant.
Treatment initiated at the baseline visit and first DMARD
Treatment regimens prescribed at the baseline visit are summarized in Table 1. A DMARD [methotrexate (MTX), sulfasalazine, hydroxychloroquine, or gold] was prescribed at the baseline visit for 76% (221 of 292) of patients, i.e., 72% of men and 77% of women (p = 0.38). In terms of first DMARD used for each patient, including those started after the baseline visit, MTX, as monotherapy or in combination with other DMARD, was the first DMARD in 172 patients [37 (55%) men and 135 (60%) women; p = 0.49], which is consistent with expected practice patterns in this prebiologic era.
Changes in disease outcomes over time
Disease activity
During the first 2 years after baseline, mean DAS28-ESR4 scores showed more improvement in men compared to women (Figure 1a). Similar trends were observed for DAS28-CRP4 scores (Figure 1b). In the GEE model, after adjusting for age, RF titers, fatigue VAS, depression score (CES-D), baseline DAS28-ESR4, and baseline DMARD, gender was found to significantly influence the rate of change of the DAS28-ESR4 (p = 0.009), but was not independently associated with this outcome (p = 0.18; Table 2). Significant predictors for DAS28-ESR4 over the 2-year followup included baseline DAS28-ESR4 (p < 0.0005), followup time (p < 0.0005), fatigue VAS (p < 0.0005), RF titer (p = 0.006), and depression score (p = 0.007). As shown in Figure 1c and 1e, tender joint counts and physician global assessment showed similar trajectories to the DAS28-ESR4, with more improvement among men over time. Swollen joint counts, on the other hand, were significantly higher among men at baseline, but were quite similar in both genders thereafter (Figure 1d). Gender was a significant predictor for swollen joint counts (p = 0.03), but not for tender joint counts and physician global scores. Gender did not seem to influence the rate of change of these outcomes (swollen and tender joint counts and physician global scores) over time.
Self-reported measures
Measures of disease activity, functional capacity, and quality of life obtained by patient self-report, i.e., patient global scores, HAQ-DI scores, fatigue VAS, pain VAS, and CES-D scores, were all higher among women at baseline (Table 1). Although they improved in both genders over time, they remained higher among women (Figure 1f, 1g, and 1h; data not shown for pain VAS and CES-D). Accordingly, female gender was significantly associated with these measures in the GEE models where these were the outcome variables. Only pain VAS was not significantly influenced by gender, although it remained higher among women over time. The mean difference, from the GEE models, in each of these measures between women and men was as follows: patient global VAS scores 9.3 (p = 0.001, 95% CI 3.6, 15.0), HAQ-DI 0.30 (p < 0.0005, 95% CI 0.14, 0.46), fatigue VAS 10.2 (p = 0.002, 95% CI 3.9, 16.5), and CES-D scores 4.6 (p = 0.003, 95% CI 1.6, 7.5). However, gender did not influence the rate of change of these self-reported measures during the 2-year followup.
Acute-phase reactants and RF
A different trend was observed for ESR, CRP, and RF. Although levels of these measures decreased significantly in men and women during the first 6 months after baseline, and continued to decrease among men, mean levels among women increased after 6 months (Figure 1j, 1k). However, gender was not significantly associated with these measures (ESR: p = 0.8; CRP: p = 0.8; RF: p = 0.1), or their rate of change. Similar results were obtained for ESR and CRP after gender-specific published formulae were applied to adjust for age.
Radiographic damage
As shown in Figure 2, radiographic damage, as assessed by total Sharp scores, increased significantly over time in both men and women, with mean scores being consistently higher among women. Interestingly, throughout the study period, men had worse erosion scores, and women had worse JSN scores. In the GEE models with JSN scores as outcome, female gender was a significant predictor for higher JSN scores (p = 0.005), but did not influence the rate of change of this outcome. The JSN scores were also significantly influenced by age (p < 0.0005). Interestingly, after stratifying by gender, age was found to be a significant predictor of JSN scores only among women (p < 0.0005), while among men, there was no association with age (p = 0.28).
ACR improvement and DAS28 remission
In the GEE models, gender did not significantly influence ACR20 or ACR50 improvement or DAS28 remission. Nonetheless, there was a nonsignificant trend for increased proportions of men satisfying the criteria for ACR20 improvement and DAS28 remission throughout the 24-month followup, as shown in Figure 3a and 3c. This trend was, however, not observed for ACR50 improvement (Figure 3b).
Responses to treatment
When the analyses were limited to the subset of 221 patients (48 men, 173 women) who were prescribed DMARD at the baseline visit, similar results were obtained as those described, with women showing worse progression and men showing better improvement in the different outcome measures over time (data not shown).
Sensitivity analyses
A total of 15 men and 62 women were lost to followup: 4 men and 19 women were lost to followup by 6 months, 3 men and 13 women by 12 months, 8 men and 30 women by 24 months. Among the patients who remained in the study, data items for random variables were missing for various timepoints in random patients. Repeating the analyses using a subset of 106 patients (26 men and 80 women) who had complete DAS28-ESR4 data from all visits also yielded results similar to those described above for the entire cohort (Figure 4).
DISCUSSION
Our study is the first to report the longitudinal analysis of a prospective DMARD-naive seropositive early RA cohort assessing how gender influences disease activity, functional disability, and radiographic outcomes, as well as subsequent treatment responses, over time, adjusting for within-patient correlation of data31. Previous studies examining gender differences in RA have mostly used ordinary linear regression and correlation methods to examine the influence of baseline characteristics on disease outcomes after a specified followup time, thus not taking into account within-subject correlation. Our results show that, in this DMARD-naive, seropositive early RA cohort with active polyarticular arthritis, men and women had equally active disease and radiographic joint damage at baseline, although women had higher scores for self-reported measures. Similar proportions of men and women were prescribed DMARD at baseline, or had MTX as their first DMARD during the course of the study. Over time, however, even among patients who initiated DMARD at baseline, disease progression was worse among women, while men showed better responses to treatment. Our results are consistent with the existing evidence that, in early stages of the disease, both men and women have similar disease activity, as shown in a cohort in The Netherlands9, the Swedish BARFOT cohort10,15, and a cohort in Greece32, all with RA duration < 12 months at study entry. In contrast, cross-sectional studies of longstanding RA have reported women having more active disease than men12,17. This difference is most likely due to different progression rates of RA in men and women. As demonstrated in a number of early RA cohorts9,14,15, after about 2 years, disease activity tends to be significantly worse among women compared to men, as we also observed. Thus, all currently available evidence points to similar disease activity in the early stages of the disease, followed by a worse disease course among women over time.
It is unclear why women have higher disease activity than men after the disease has progressed for over a year. As seen from the GEE results, in this early RA cohort, the rate of change of disease activity scores was significantly influenced by gender; interestingly, followup time was a significant predictor — a covariate not often taken into account. In examining components of the DAS28-ESR4, it appears that the main differences between men and women were in ESR, with minor differences in tender joint counts and patient global scores. However, even after adjusting the ESR by the standard correction factor for gender and age29, the reported gender differences in the DAS28-ESR4 at 2 years remained. Further, the trends in improvement of both the DAS28-CRP4 and DAS28-ESR4 scores were significantly better among men in our cohort, suggesting that gender differences in ESR do not explain the observed differences in disease activity, as previously suggested19. It has also been argued that observed differences in disease activity are not due to differences in the disease per se, but arise as a result of gender differences in reporting of disease activity measures12,18. Although the women in our cohort reported worse scores for pain, function, and global health compared to men, this pattern of reporting was consistent throughout the study, including at baseline when there was no gender difference in disease activity. Based on the equation to calculate the DAS28, patient global health scores contribute little to the overall DAS28, while pain VAS and HAQ-DI do not contribute at all, suggesting that differences in these self-reported measures do not account for the increasing difference in disease activity between men and women over time. Further, measures assessed by the physician, i.e., physician global assessment and tender joint counts, also followed the same trends over time as the disease activity scores, with the improvement rates among women starting to slow down after 6 months. Interestingly, an increase in the levels of inflammation markers, i.e., ESR and CRP, as well as in RF levels, was observed among women after 6 months, when disease activity scores started to diverge between men and women. It is possible that women are not as responsive to antiinflammatory medication as men, which might explain the short-lived amelioration in levels of inflammatory markers. This increase in inflammation markers could be related to the increase in tender joint counts and patient global scores, as well as physician global assessment, observed after 6 months, and eventually higher disease activity among women. Increased patient global scores and joint tenderness among women were also observed in the BARFOT cohort and attributed to increased pain perception in women15. It is unclear, however, why swollen joint counts were the same in men and women throughout the study duration if there was increased inflammation among women after 6 months.
Given the higher disease activity among women compared to men over time, it not surprising that men appeared more likely to satisfy the ACR criteria for 20% or 50% improvement, and to achieve DAS28 remission in our cohort. This is consistent with previous findings of increased remission among men, both in the early RA cohorts11,14,18,33 and in well established RA17,18. Although most studies have used the less-stringent DAS28 remission criteria, male gender has been significantly associated with remission regardless of the criteria used18. In addition to these observational studies, a metaanalysis of data from randomized controlled trials comparing MTX to placebo or other DMARD has also suggested that women were less likely to respond to treatment than men33.
Interestingly, in our dataset, men and women had similar radiographic joint damage, calculated as total Sharp scores, throughout the study, resulting from higher proportions of men having radiographic erosions and worse mean erosion scores, and women having worse JSN, as reported from other early RA cohorts9,11,15. Although the mean JSN scores increased in both men and women over 2 years, age was a significant predictor only among women. It is not clear what other factors are involved among men. It is also unclear why nodules were more prevalent among the men in our study. We and others have previously shown that in longstanding severe RA, men are more prone to nodules5,13, while other studies found no such gender differences34. Since the patients in our early RA cohort were DMARD-naive at study entry, increased nodules in men cannot be explained as a side effect of MTX use, but instead appears to be intrinsic to the disease process. Further, men had higher RF titers than women even though all patients, regardless of gender, had been selected to be seropositive for RF. We had previously reported such a gender bias in RF and ACPA in familial RA5. Higher RF titers may be explained in part by the significantly higher proportions of ever-smokers among men, since RF titers appear to be increased among smokers35. The ACPA titers among men in this early RA cohort, on the other hand, were only marginally higher than in women, suggesting that the difference we had previously observed5 could be a feature of familial RA as opposed to sporadic RA.
The study has some limitations. First, the early RA cohort used was clinic-based and was selected for severe RA. It is thus not representative of population-based early inflammatory polyarthritis cohorts. However, men and women were selected using the same criteria; we are not aware of any gender biases in patient selection that may have affected our results. Second, the sample size was small, especially for male patients. Even though the GEE models use all available data, thus increasing the power to detect small associations, the results should be interpreted with caution until replicated in larger early RA datasets. Third, there was some loss to followup and missing data items during the 24-month study period; however, we found no gender biases in the proportions of men and women lost to followup or in the disease activity of those who remained. However, we cannot rule out the possibility of selection bias in the outcome measures of those who were lost to followup. Fourth, we used the DAS28 < 2.6 criterion for remission, which reflects remission at 1 timepoint rather than sustained remission, and is thus less stringent. Since only a few patients satisfied the criteria for ACR remission, we felt that the DAS28 remission was appropriate, and it also allowed our results to be compared with previous reports, most of which have used DAS28 remission. Lastly, the GEE approach does not provide a measure of how well the model fits the data, and hence, the choice of a correlation structure is not always clear. We therefore compared different correlation structures and got similar results.
In summary, based on the results from our cohort, responses to treatment over time in early RA, as assessed by disease outcomes, ACR improvement, and DAS28 remission, appeared to be better among men in the prebiologic era. And although there were no gender differences at baseline in disease activity and radiographic damage, disease progression seemed to be worse among women.
Footnotes
-
D. Jawaheer is supported by a Career Development Award (K01AR053496) from the National Institute of Arthritis and Musculoskeletal and Skin Diseases. The study had previous support from NIH Multipurpose Arthritis and Musculoskeletal Disease Center Grant P60 AR 36834, and from Specialty Laboratories (Valencia, CA, USA).
- Accepted for publication August 12, 2010.
APPENDIX
List of study Collaborators: The Western Consortium of Practicing Rheumatologists: Robert Shapiro, Maria W. Greenwald, H. Walter Emori, Fredrica E. Smith, Craig W. Wiesenhutter, Charles Boniske, Max Lundberg, Anne MacGuire, Jeffry Carlin, Robert Ettlinger, Michael H. Weisman, Elizabeth Tindall, Karen Kolba, George Krick, Melvin Britton, Rudy Greene, Ghislaine Bernard Medina, Raymond T. Mirise, Daniel E. Furst, Kenneth B. Wiesner, Robert F. Willkens, Kenneth Wilske, Karen Basin, Robert Gerber, Gerald Schoepflin, Marcia J. Sparling, George Young, Philip J. Mease, Ina Oppliger, Douglas Roberts, J. Javier Orozco Alcala, John Seaman, Martin Berry, Ken J. Bulpitt, Grant Cannon, Gregory Gardner, Allen Sawitzke, Andrew Lun Wong, Daniel O. Clegg, Timothy Spiegel, Wayne Jack Wallis, Mark Wener, and Robert Fox.