Abstract
Objective. To evaluate the utility of the duration of morning stiffness (MS), as a patient-reported outcome (PRO), in assessing rheumatoid arthritis (RA) disease activity.
Methods. We acquired information on 5439 patients in QUEST-RA, an international database of patients with RA evaluated by a standard protocol. MS duration was assessed from time of waking to time of maximal improvement. Ability of MS duration to differentiate RA activity states, based on Disease Activity Score (DAS)28, was assessed by analysis of variance; and a receiver-operating characteristic (ROC) curve was plotted for discriminating clinically active (DAS28 > 3.2) from less active (DAS28 ≤ 3.2) RA. Mixed-effect analysis of covariance (ANCOVA) models were used to assess the utility of adding MS duration to Routine Assessment of Patient Index Data (RAPID)3, a PRO index based on physical function, pain, and general health (GH), in predicting the 3-variable DAS28 (DAS28v3).
Results. MS duration had moderate correlation (r = 0.41–0.48) with pain, Health Assessment Questionnaire, and GH; and weak correlation (r = 0.23–0.39) with joint counts and erythrocyte sedimentation rate. MS duration differed significantly among patients with different RA activity (p < 0.001). The area under the ROC curve of 0.74 (95% CI 0.72–0.75) showed moderate ability of MS duration to differentiate clinically active from less active RA. ANCOVA showed significant interactive effects between RAPID3 and the MS duration categories (p = 0.0005) in predicting DAS28v3. The effect of MS was found to be clinically important in patients with the low RAPID3 scores (< 6) in whom the presence of MS may indicate clinically active disease (DAS28v3 > 3.2).
Conclusion. MS duration has a moderate correlation with RA disease activity. Assessment of MS duration may be clinically helpful in patients with low RAPID3 scores.
- RHEUMATOID ARTHRITIS
- SEVERITY OF ILLNESS INDEX
- QUESTIONNAIRES
- HEALTH STATUS INDICATORS
- MORNING STIFFNESS
Stiffness in the joints after periods of rest is commonly experienced by patients with rheumatoid arthritis (RA) and is assessed clinically by morning stiffness (MS). Morning stiffness is listed in classification criteria1 and is a component of American College of Rheumatology (ACR) remission criteria for RA2. Recognition of MS as a common symptom of RA led to the suggestion that MS might be useful for differentiating RA from noninflammatory joint diseases3. However, MS was found to have poor discriminative ability in differentiating such conditions from RA1,4,5. Further, more than one-third of individuals older than 50 years without RA report MS of > 15 min6. Morning stiffness used to be included as a common outcome measure in RA clinical trials7. Although recent clinical trials of RA do not include MS as an outcome measure, it is still commonly used as an eligibility criterion for participation8.
In clinical practice, MS assessment is used as one of the indicators of RA disease activity. The 2002 ACR guidelines for the management of RA recommend assessment of MS duration as one of the variables for evaluation of the disease activity9. The Rheumatoid Arthritis Disease Activity Index (RADAI)10, one of the 6 composite indices recommended for RA activity assessment in the updated 2008 ACR treatment guidelines for RA11, has MS duration as one of its components. MS duration was found to be the second strongest predictive factor for change of disease-modifying antirheumatic drug therapy in routine clinical care of patients with RA in a tertiary care center12. However, the utility of MS duration as an indicator of inflammatory activity of RA has been questioned. In a study with 93 RA patients, there was no significant difference in MS duration between active and inactive disease as assessed by their treating physician4. In a cohort of 337 patients with early RA, MS duration was associated at higher level with patient-reported measures such as functional status, pain, and patient’s assessment of general health (GH) than joint counts and erythrocyte sedimentation rate (ESR), suggesting that MS may be an inadequate marker of inflammatory activity13. Removal of MS as a selection criterion for “active RA” in clinical trials has been advocated since it had little effect on classification of patients as having active or inactive disease14.
No single variable is considered sufficient to assess RA disease activity. Composite indices [Disease Activity Score (DAS), Simplified Disease Activity Index (SDAI), and Clinical Disease Activity Index (CDAI)] derived from tender joint count, swollen joint count, an acute-phase reactant (except CDAI), patient assessment of GH (in DAS) or patient global assessment of disease activity (PGA; in SDAI and CDAI), and evaluator’s global assessment of disease activity (EGA; in SDAI and CDAI), have been extensively validated for assessment of RA disease activity15–17. However, formal quantitative joint counts, an integral part of these indices, are frequently not performed in routine clinical practice18. Composite indices based solely on 3 patient-reported outcomes (PRO) in the ACR core data set (physical function, pain, and PGA or GH) have been validated and shown to effectively differentiate treatment response and be less susceptible to placebo response in randomized controlled trials of RA19,20. They have also been shown to predict institution of tumor necrosis factor inhibitor therapy and mortality in patients with RA21. There are no data about correlation of MS duration with a composite RA disease activity index based on quantitative joint counts and how well it differentiates disease activity states using such an index. MS duration is easily amenable to assessment as a PRO. However, there are no data whether addition of MS duration to a composite PRO index based on ACR core data set will improve RA disease activity assessment.
The purpose of our study was to clarify the utility of assessing MS in routine clinical practice. Our primary aim was to correlate the severity of MS (as assessed by duration) with activity of RA as assessed by a DAS28, and whether MS duration differentiates the disease activity categories (remission, low, moderate, and high) as classified by DAS28. The secondary aim was to assess utility of addition of MS duration to Routine Assessment of Patient Index Data 3 (RAPID3), a composite index based on PRO in the ACR core data set, in assessment of RA disease activity22.
MATERIALS AND METHODS
Patients
QUEST-RA is an international database of 100 non-selected consecutive RA outpatients in ≥ 3 rheumatology clinics in several countries23. The database was started in 2005 and by January 2008 had information on > 5800 patients from 24 countries24. The patients were assessed by a standard protocol to evaluate RA25.
Clinical information
Demographic information and clinical characteristics of RA were obtained from the database. Duration of RA was categorized as early (≤ 2 yrs) or late (> 2 yrs). MS duration (in min) was queried in the patient self-report questionnaire from the time of waking up to the time to maximal improvement in the stiffness that was experienced over the last week. For statistical analyses, MS duration was categorized as none, mild (1–30 min), moderate (31–60 min), and severe (> 60 min). Functional status was assessed by Health Assessment Questionnaire (HAQ; minimum–maximum: 0–3). The psychological HAQ (PSHAQ; minimum–maximum: 0–3) was used to evaluate psychological distress26. Pain, fatigue, GH, and evaluator global assessment were assessed on 0–10 cm visual analog scale (VAS). RAPID3 was calculated by adding HAQ score (after multiplying by 3.33 to convert scores to a 0–10 scale), pain and GH VAS scores. The scores of < 3, 3–6, 6.1–12, and > 12 have been proposed to represent “near remission” and low, moderate, and high RA activity, respectively22. Tender and swollen joint counts (28 joints) were assessed by the treating physician. ESR was obtained and DAS28 scores were calculated by the formula 0.56 × sqrt(tender28) + 0.28 × sqrt(swollen28) + 0.70 × ln(ESR) + 0.014 × GH15. Since DAS28 and RAPID3 have GH as a common variable, we used DAS28 scores based on 3 variables (DAS28v3), as calculated by the formula [0.56 × sqrt(tjc28) + 0.28 × sqrt(sjc28) + 0.70 × ln(ESR)] × 1.08 + 0.16, when assessing the utility of adding MS duration to RAPID315.
RA disease activity was classified according to DAS28 score as remission (< 2.6), low (2.6 to ≤ 3.2), moderate (3.2 < to ≤ 5.1), and high (> 5.1). The current paradigm of RA management advocates aggressive treatment to achieve low disease activity state27. We also classified RA as clinically active (DAS28 score > 3.2), indicating moderate or high disease activity, and clinically less active (DAS28 score ≤ 3.2), indicating low disease activity or remission, to assess the diagnostic accuracy of MS.
Statistical methods
Data were analyzed using SPSS version 11.0 (SPSS Inc., Chicago, IL, USA) and SAS version 9.2 (SAS Institute Inc., Cary, NC, USA). The variables with skewed distribution (MS duration, tender joint count, swollen joint count, and ESR) were square-root transformed. Relationships between MS duration and demographic and disease activity variables were analyzed by calculating Pearson’s correlation coefficient. Values of Pearson’s correlation coefficient values were interpreted as representing slight (< 0.2), low (0.2–0.4), moderate (0.4–0.7), high (0.7–0.9), and very high (> 0.9) correlation between variables28. Distribution of frequency of MS duration according to RA disease activity by DAS28 score was assessed. Differences in MS duration were compared with Student’s t-test when there were 2 groups. Analysis of variance (ANOVA) was used to study whether MS duration differed among RA disease activity states. Pairwise comparisons, among different RA activity states, were made using Tamhane’s T2 post-hoc test since there was unequal variance between different activity state groups. A receiver-operating characteristic (ROC) curve to assess diagnostic utility of MS duration to differentiate clinically active from less active RA was plotted. The area under the ROC curve indicates overall accuracy in discriminating 2 categorical states and varies from 0.5 to 1.0, where an area of 0.5–0.7 indicates low accuracy, 0.7–0.9 moderate accuracy, and > 0.9 high accuracy29. To further clarify clinical utility of MS duration assessment, positive likelihood ratios for assessing presence of active RA for each of its 4 categories were calculated.
Mixed-effect analysis of covariance (ANCOVA) models were used to assess the utility of adding MS duration to RAPID3 with DAS28v3 as dependent variable. To adjust for potential cultural effects, country of origin was included in these models as a random effect. MS duration was included as a categorical variable with the following groupings: 0 min, 1–30 min, 31–60 min, and > 60 min. The RAPID3 variable was included in the model as a continuous variable. The interaction between MS duration and RAPID3 was also evaluated. To further understand the role of individual PRO, a mixed-effect ANCOVA model was used to assess the strength of association of MS duration and PRO in the RAPID3 (GH, HAQ, and pain) with DAS28v3. All 2-way interactions involving MS duration were examined. Interactions not significant at the 0.10 level were removed, one at a time, in a backward elimination fashion. Finally, Pearson’s and Spearman’s partial correlation coefficients were used to describe the association between DAS28v3 and MS duration (continuous), GH, HAQ, and pain. The correlation between DAS28v3 and a predictor variable was adjusted for the remaining predictors.
RESULTS
The QUEST-RA database had 5848 patients from 24 countries at the time of analysis. This report includes 5439 (93%) patients on whom the information on MS duration was available. Patients whose MS information was available did not differ in DAS28 scores (4.24 vs 4.13; p = 0.2) and EGA rating (2.9 vs 2.8; p = 0.62) compared to those with missing MS information. However, patients with MS missing information had higher age (mean age 60 vs 56 yrs; p < 0.001), HAQ score (1.11 vs 1.01; p = 0.02), pain level (4.5 vs 4.1; p = 0.003), and GH score (4.3 vs 4.1; p = 0.03). Although statistically significant, we considered these differences to be small and not clinically relevant. Patient characteristics are shown in Table 1. The patients were mostly women and in the age group that is typical for RA. Eight hundred thirty patients (14.2%) had early RA. Table 2 shows the correlation of MS duration with different RA variables. There was moderate correlation of MS duration with pain, HAQ score, GH, RAPID3, and DAS28. The correlation with fatigue, PSHAQ, EGA, joint counts, and ESR was weak. There was no significant difference in MS duration between sexes (p = 0.47) and those with early or late RA (p = 0.25).
The distribution of MS duration among different RA activity states assessed by DAS28 scores is shown in Table 3, indicating that as RA activity increases, the percentage of patients with longer MS duration increases. MS duration differed significantly among patients with different RA disease activity states by ANOVA test [F(3,5226) = 273.8, p < 0.001]. As shown in Table 4, all the disease activity groups had significantly different MS duration by Tamhane’s T2 test. The difference was smallest between the remission and low disease activity states. The difference in median MS duration between clinically active and less active RA was 40 min. Figure 1 shows the ROC curve to assess accuracy of MS duration to differentiate active from inactive disease. The area under the curve (AUC) is 0.74 (95% confidence interval 0.72–0.75). There was no significant difference among patients with early and late RA (data not shown). Table 5 shows likelihood ratios of having active disease for different MS duration categories.
The results of a test for fixed effects in an ANCOVA to understand the role of addition of MS duration to RAPID3 in predicting DAS28v3 are shown in Table 6. MS duration was significantly associated with DAS28v3. A highly significant interactive effect between RAPID3 and MS duration categories was revealed. This implies that for each MS category there is a separate line that describes the relationship between RAPID3 and DAS28v3 as shown in Table 7. The significant interaction also implies that differences between MS duration categories with respect to mean DAS28v3 values depend on RAPID3 values. Estimates of DAS28v3 means for the MS duration categories are presented for several RAPID3 values in Table 8. These results are clinically important at the RAPID3 scores (< 6) that are considered to represent low disease activity. In patients with the low RAPID3 scores, presence of MS may indicate presence of clinically active disease (DAS28v3 > 3.2). At higher RAPID3 scores, patients are likely to have clinically active disease irrespective of the MS duration.
The results of test for fixed effects in an ANCOVA to better understand the contribution of MS duration and individual PRO in RAPID3 are shown in Table 9. HAQ was the strongest variable, followed by pain, MS duration, and GH for association with DAS28v3 scores. The pain-by-MS-duration interaction contributed significantly to the model (p = 0.01), while the HAQ-by-MS interaction was not quite significant (p = 0.052). The effect of MS duration is complex and difficult to quantify, as it depends on both pain and HAQ values. What is clear is that MS duration contributes significantly to the model. Table 10 shows results of partial correlation between DAS28v3 with MS duration and RAPID 3 variables. HAQ has the strongest association with DAS28v3 after adjustment for GH, pain, and MS duration. The partial correlations for MS duration and pain are essentially the same, while GH appears to contribute the least.
DISCUSSION
In this study, we showed that MS duration correlates better with physical function, pain, and GH than with quantitative joint counts and ESR. This is consistent with the findings of a previous report13. MS duration correlates with the degree of inflammatory activity in patients with RA and has moderate accuracy in distinguishing clinically active from inactive RA. MS duration made a significant independent contribution and showed an interactive effect with RAPID3 in predicting DAS28v3. As an individual PRO, MS duration contributed more than GH in predicting DAS28v3. Partial correlation statistics also showed MS duration to have stronger correlation with DAS28v3 than GH.
Morning stiffness is a complex symptom. Most patients describe MS either alone or as a combination of difficulty moving, pain, or abnormal sensation of tightness4,30. All our patients were assessed by a standard self-report questionnaire, ensuring consistency in information collected. The questionnaire assessed the MS duration from time of waking up to its maximal improvement. This manner of assessing MS duration has been shown to have the least daily intra-individual variability and to be the best indicator of average MS duration experienced by the patient31.
The pathophysiology of MS in RA has been linked to abnormalities in circadian rhythm of the hypothalamic-pitutary-adrenal (HPA) axis. Abnormally high levels of cytokines like interleukin 6 and tumor necrosis factor-α have been implicated in derangement of the HPA axis, resulting in earlier than normal peak in serum cortisol and inappropriately low or normal cortisol levels in relation to the degree of inflammation due to RA activity32. A modified-release form of prednisone that seeks to restore the normal circadian rhythm of endogenous cortisol has been shown to significantly improve MS duration without affecting other RA variables and disease activity33. This is in concordance with our finding that MS duration has a moderate correlation with RA disease activity.
There is dearth of data on the actual effect of MS on patients with RA, although a recent study showed that severe baseline MS (by VAS) in patients with early RA was predictive of premature retirement. However, when DAS28 and HAQ scores were entered into the analysis, MS lost its predictive power34. Our data do not address this aspect of MS. There is a need to identify which groups of patients are significantly affected and in what manner by MS before using it as a therapeutic target.
Quantitative joint counts are commonly not performed, and frequently the acute-phase reactants are unavailable during the patient’s assessment in routine clinical practice. This has driven the interest in development of composite indices based solely on PRO variables to improve the feasibility and efficiency of quantitative RA activity assessment in a standardized manner. Resources (time and personnel) are limited in clinical practice. Assessment of RA variables that provide maximum information about disease activity in a non-overlapping manner would facilitate most efficient use of clinical resources. As MS duration can easily be inquired in a standardized manner by a patient questionnaire, we were particularly interested in assessing its possible value as part of a composite PRO index. We did this indirectly by assessing whether addition of MS duration to RAPID3 improved its correlation with and prediction of DAS28v3 scores. We used DAS28v3 instead of DAS28 based on 4 variables to avoid confounding by GH as a common shared variable. The results showed that MS duration made significant important contribution in addition to RAPID3 in explaining DAS28v3 variance. However, a highly significant interaction between MS duration and RAPID3 implies that the effect of MS duration on DAS28v3 depends upon level of RAPID3. We estimated DAS28v3 at several RAPID3 levels to understand the clinical implications of these results. Our findings are clinically relevant for patients with low RAPID3 scores (< 6). In these patients, assessment of MS duration would be valuable and may lead to change in assessment of level of RA activity. For patients with higher RAPID3 scores, RA activity remains clinically unchanged irrespective of MS duration.
We recognize several limitations of our findings. First, we have used DAS28 as a surrogate for RA disease activity. Although widely used, DAS28 may not be a completely accurate measure of inflammatory activity of RA. A patient may have tenderness or swelling in several joints while having a DAS28 in the “remission” state35. DAS28 is also more sensitive to ESR, and clinically small changes in ESR may cause changes in DAS2836. Nevertheless, DAS28 has been extensively validated in both clinical trials and clinical practice, and is significantly correlated with disability and radiographic progression37.
Second, our data were collected from a very diverse range of patients in many countries. Although collected in a standardized manner, it is possible that socioeconomic and cultural factors may have affected the assessment, particularly of the PRO. On the other hand, our results derived from a diverse group of patients with RA are more generalizable. While not a perfect solution, we did use mixed-effect ANCOVA models with country of origin as a random effect to adjust for potential cultural effects.
Third, we assessed MS in the form of a time interval. The 2002 ACR guidelines for management of RA do recommend collecting MS information in the form of duration9. However, MS has also been evaluated by other means, such as assessing severity by VAS and a numerical rating scale30,38. Compared with MS duration, MS by VAS has been reported to have a normal distribution, better correlation with RA disease activity, and more responsiveness to treatment effect38. MS assessment by severity scale may need to be studied for its possible utility as a component of a PRO index.
Fourth, our data are based on cross-sectional assessment of patients with RA. We cannot comment upon the responsiveness of MS duration to effective treatment.
Finally, it is important to stress that we assessed the role of MS duration in patients with an established RA diagnosis. MS duration has been shown to be predictive of development of persistent arthritis, and erosive arthritis in patients presenting with early undifferentiated arthritis39. Our results do not preclude other possible applications of MS duration in inflammatory arthritis.
In conclusion, we found that duration of MS correlates better with other PRO than joint counts and ESR. As an individual variable, the duration of MS had a moderate ability to differentiate active from inactive RA. Assessment of duration of MS in patients with low RAPID3 scores may be clinically relevant.
Footnotes
- Accepted for publication June 24, 2009.