Responsiveness of Single versus Composite Measures of Pain in Knee Osteoarthritis

Abstract

Objective. In rheumatoid arthritis, composite outcomes constructed from a combination of outcome measures are widely used to enhance responsiveness (sensitivity to change) and comprehensively summarize response. Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) pain is the primary outcome measure in many osteoarthritis (OA) trials. Information from other outcomes, such as rescue medication use and other WOMAC subscales, could be added to create composite outcomes, but the sensitivity of such a composite has not been tested.

Methods. We used data from a completed trial of tanezumab for knee OA (NCT00733902). The WOMAC questionnaire and rescue medication use were measured at several timepoints, up to 16 weeks. Pain and rescue medication outcomes were standardized and combined into 3 composite outcomes through principal components analysis to produce 1 score (composite outcome) and their responsiveness was compared to WOMAC pain, the standard. We pooled all treatment doses of tanezumab into 1 treatment group, for simplicity, and compared this to the control group (placebo).

Results. The composite outcomes showed modestly, but not statistically significantly greater responsiveness when compared to WOMAC pain alone. Adding information on rescue medication to the composite improved responsiveness. While improvements in sensitivity were modest, the required sample sizes for trials using composites was 20–40% less than trials using WOMAC pain alone.

Conclusion. Combining information from related but distinct outcomes considered relevant to a particular treatment improved responsiveness, could reduce sample size requirements in OA trials, and might offer a way to better detect treatment efficacy in OA trials.

Key Indexing Terms:

OSTEOARTHRITIS
OUTCOMES
PAIN
SENSITIVITY TO CHANGE
RESPONSIVENESS

Clinical trialists have a tendency to measure many outcomes. Several of these outcomes (often deliberately) cover overlapping “domains,” attempting to ensure that the “signal” of a true change in an outcome following an intervention is identified. Pain is a good example; researchers will often use a variety of similar pain-related outcomes in interventional trials.

Pain is a complex, multidimensional measure^1,2, and generating just 1 scale or item that adequately records most, if not all, aspects of pain is challenging. Further, because pain is strongly related to functional limitation³, the most appropriate pain outcome might cover aspects of both pain and function. The optimal clinical trial pain outcome(s) should additionally be sensitive to change following an intervention, by which we mean the outcome’s ability to detect a change, often also termed an outcome’s responsiveness⁴, discriminating well between a true signal (treatment effect) and noise (random variation).

Composite outcomes are a way of combining (often related) indices or scores to form 1 overall outcome. This approach, which has been used in many disease areas including osteoarthritis (OA)⁵, rheumatoid arthritis^6,7,8, and asthma⁹, may improve the identification of a domain more completely because it takes account of more information than 1 outcome alone. Pain measurement appears particularly suited to this approach, given its complexity. Combining information from several different domains may improve a composite’s ability to detect a change when one truly occurs, and therefore responsiveness may also be improved.

There are several methods for combining outcomes into composites. Some of these facilitate domain coverage; others increase responsiveness. Ideally, the method used should improve both. The simplest method of combining 2 or more outcomes is through summing or averaging them⁵. This method assumes that the constituent outcomes have equal weighting in the composite, and that units from the constituent outcomes are comparable and exchangeable.

A second method of combining multiple outcomes is through the use of weights to assign the “importance” of constituent outcomes. The composite is produced by multiplying each constituent outcome by its weight, and then summing these scores. An example of this is the 28-joint count Disease Activity Score (DAS28)^6,7. Weights can be derived from a variety of sources, including statistical modeling (as with the DAS), but also from group consultation, for example through a Delphi exercise^{10,11,12,13,14}.

Another data-driven approach uses principal components analysis (PCA), a data-reduction method that inherently concentrates as much variance from constituent outcomes into as few factors as possible. This method may produce a composite outcome that more completely identifies the variance from an underlying multidimensional process, such as pain.

Theoretically, combining several outcomes purporting to measure aspects of pain and its consequences, such as function loss and rescue medication use, should increase domain coverage (because each outcome contributes some information about the pain signal) and therefore responsiveness. Because all of the contributing outcomes should measure that same latent factor (pain), the analysis model used should assume a priori a 1-factor solution, rather than generating multiple outcomes. In this way, we can combine all outcomes related to pain into 1 composite outcome, which it is hoped will show maximal responsiveness in pain.

This study sought to combine several pain outcomes using PCA, taken from a large completed clinical trial of a treatment that reduced pain, and compare the relative responsiveness of these composites to the uncombined WOMAC pain subscale score alone, to establish whether the inclusion of additional pain information improves responsiveness following administration of an intervention.

Assessment of responsiveness is optimal in certain trial designs. The ideal trial should contain a treatment arm with an intervention that is known to truly change the construct of interest (pain, for example); a control arm that is known to not truly change the construct of interest; and at least 2 (ideally more) timepoints in both arms, over which the change in each outcome is assessed. The trial we selected had these features. If the outcome of interest is not changed during the study, then it is not possible to assess responsiveness.

MATERIALS AND METHODS

The data used in this study were from a large completed clinical trial of tanezumab in participants with knee OA (NCT00733902). This trial was a 32-week 4-arm parallel-group phase III trial, comparing 3 doses of tanezumab (2.5 mg, 5 mg, or 10 mg) against placebo. Participants were observed at baseline and 2, 4, 8, 12, 16, 24, and 32 weeks; we used data from the 2-week visit to the 16-week visit, because data for rescue medication use were collected only at these visits. For simplicity, we pooled all tanezumab doses (2.5 mg, 5 mg, and 10 mg) into 1 treatment group and compared this to the placebo group. Further details regarding the trial’s design as well as data on unstandardized outcome scores have been published previously¹⁵. Our study is a reanalysis of completed clinical trial data, and is exempt from ethical review under the UK National Health Service Health Research Authority Guidelines.

Single outcomes definition

We used the following pain-related outcomes featured in NCT00733902: Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) pain, stiffness, and function subscales; and number of rescue medication pills taken per week.

Composite outcomes definition

Including information from at least 2 and up to 4 outcomes in each composite gives 11 possible combinations available from which composites could be generated. We generated a total of 3 composite outcomes that were considered the most meaningful and pragmatic of the 11 possible combinations. Composite 1 consisted of the WOMAC pain subscale plus rescue medication. Composite 2 consisted of all 3 WOMAC subscales (pain, stiffness, and function). Composite 3 consisted of all 3 WOMAC subscales, plus the rescue medication outcome. Composite outcomes were derived by including the selected combination of variables in the PCA, which assumed a 1-factor solution. We opted for PCA, given its propensity to maximize the amount of variance identified in the first (and in this case, only) derived component. We assumed that all included outcomes measured different aspects of 1 latent (multidimensional) pain variable, and forcing a 1-component solution therefore ensured that this variable was derived. Angst, et al found that unrestricted factor analysis of individual WOMAC items established new factors that drew from both the pain and function subscales and merged them¹⁶, supporting this idea. It also simplifies the analysis, because it creates only 1 composite outcome rather than allowing many composite factors to be generated in each PCA model. We constructed 3 PCA models, each generating 1 of 3 composite outcomes. Rotation of the factor solution (of any type: varimax, promax, or other) was not indicated in our approach, because a 1-factor solution has only 1 possible orientation.

We pooled data from all study visits in the analysis models (rather than using data from baseline only, for example), assuming that it was best to include the maximum available number of observations in the PCA models.

Analysis approach

All composite outcome measures were compared to the WOMAC pain, assuming this as the standard.

All of the single outcomes (WOMAC pain subscale score, WOMAC function subscale score, WOMAC stiffness subscale score, and number of rescue medication pills taken) were standardized prior to inclusion in the factor analysis models, and the composites (composites 1, 2, and 3 detailed above) were also standardized. Having all variables standardized (as Z scores) allows direct comparison of outcomes with different units.

We used a random-effects panel linear regression (using SAS’s PROC MIXED) to assess change in the standardized outcome score over time, with outcome type, study visit, and treatment group (either tanezumab or placebo) and all possible interactions as predictor variables. Constructing the data in long format, and using outcome type as a categorical dummy-coded variable, allow direct testing for differences in responsiveness between all outcomes in 1 statistical model (Appendix 1). SAS’s PROC MIXED command uses a likelihood-based approach, treating missing observations as missing at random.

We used linear combinations of coefficients from the regression model (using SAS PROC ESTIMATE) to produce the difference in standardized change between the WOMAC pain subscale and each composite outcome, at each study timepoint. This formally tests whether the outcomes differed from the WOMAC pain subscale regarding responsiveness at each of the 5 timepoints in the study.

Statistical analysis used SAS software version 9.3 (SAS Institute Inc.). A nominal alpha level of 0.05 was used for all CI.

RESULTS

Study sample demographics

At baseline, the placebo group (n = 172) comprised of 119 women (69.2%), with a mean age of 62.2 years, Kellgren-Lawrence grades 2, 3, and 4 of 39.5%, 47.7%, and 12.8%, respectively, mean WOMAC pain subscale score (0–10) of 7.1, and mean WOMAC function subscale score (0–10) of 6.6. The pooled tanezumab group (n = 518) at baseline had 301 women (58.1%), with a mean age of 61.4 years, Kellgren-Lawrence grades 2, 3, and 4 of 38.4%, 46.3%, and 14.5%, respectively, mean WOMAC pain subscale score (0–10) of 7.1, and mean WOMAC function subscale score (0–10) of 6.8.

Ten participants had missing observations for all outcomes at the timepoints of interest, giving a total sample size for this analysis of 680, compared with the original trial sample size of 690, with 509 in the pooled tanezumab group and 171 in the placebo group. Data for the 680 included patients could have been collected on 7 outcomes, at 5 timepoints, giving a total of 23,800 possible observations. Of these, 20,597 were actual observed data points, with 3203 observations missing (13.5%).

PCA results

The PCA process generated composites with component loadings shown in Table 1. WOMAC pain and stiffness subscales consistently had the greatest, and indeed equal, loading, closely followed by the WOMAC function subscale. When all 3 WOMAC subscale variables were included in the PCA model (in composite 3), the rescue medication’s loading dropped considerably.

View this table:

Table 1.

Pain outcome combinations used to create composites, with component loadings.

Composite outcome performance

All composites showed responsiveness greater than at least some of their constituent outcomes on their own, and this difference was consistent across many timepoints (Figure 1). Composite 1 showed consistently greater responsiveness than the WOMAC pain subscale alone. The remaining 2 composites displayed responsiveness greater than all other constituent outcomes, except the WOMAC stiffness subscale. None of the single or composite outcomes showed consistently statistically significantly better responsiveness than that observed in the WOMAC pain subscale at the chosen alpha level (Table 2).

Figure 1.

Sensitivity to change of single pain-related outcomes from the tanezumab trial. Values plotted are the control-treatment differences in standard score, at different study timepoints. More negative values indicate increased sensitivity to change. WOMAC: Western Ontario and McMaster Universities Osteoarthritis Index; P/S/F: WOMAC subscales pain, stiffness, and function.

View this table:

Table 2.

The difference in standard score between the control group and the treatment group (responsiveness), in the variables of interest and WOMAC pain subscale, at different timepoints in the study. Values are mean (95% CI) and p value.

We next examined the effect of the observed differences in responsiveness on sample size requirements for a hypothetical new trial featuring the same design (Table 3). For example, the WOMAC pain subscale between-groups standardized change at 4 weeks was a difference of −0.37. A hypothetical new trial of identical design observing this between-group difference for the WOMAC pain outcome would require 236 participants (118 per group) to achieve 80% power with a 2-sided 5% type I error rate. In contrast, using composite 1 (i.e., including information on rescue medication as well as the WOMAC pain subscale score) as the primary outcome, which had an observed difference at 4 weeks of −0.41, the same trial would need 190 participants (95 per group) to achieve 80% power with this difference — a saving of 46 participants. When the observed differences between treatments are smaller, the reduction in sample size was more extreme: the WOMAC pain difference at 16 weeks (−0.26) would require 476 participants for 80% power in a hypothetical new trial, compared to only 364 participants when using composite 1 (using the observed difference of −0.29), a saving of 112 participants.

View this table:

Table 3.

Observed responsiveness (standardized placebo/tanezumab difference) for featured pain outcomes at trial timepoints, and required sample sizes for a hypothetical new trial of the same design, using these observed effect sizes.

DISCUSSION

We found that composite outcomes generally had moderately greater responsiveness in a large OA trial than WOMAC pain — the usual standard outcome of these trials. That suggests that if 1 of these composite outcomes were used as the primary outcome in an OA trial, fewer subjects would be needed to demonstrate treatment efficacy.

The improvements in responsiveness did not meet the criteria for a statistically significant difference, but perhaps a more salient measure of their import was to determine what effect using these outcomes had on the sample size needed to be likely to show statistically significant effects of treatment versus placebo. We found that the reduction in sample size was substantial, ranging from roughly 20% to 40%. Thus, composites could substantially diminish the sample sizes needed in an OA trial whose main outcome is pain.

Eigenvalues from the 3 composite models all were much greater than the 1.0 cutoff typically used to select retained factors in a PCA model¹⁷, and a large proportion of the variance in the outcomes was identified by the first component in the PCA model, as anticipated (Table 1). The second factor listed in the model output (which was not extracted in this analysis) in all cases had an eigenvalue much less than 1, lending support to the idea that the selected correlated outcomes are well measured in 1 “pain” component.

Rescue medication use, while contributing to the “pain” component the least (Table 1), appeared to still improve responsiveness: composites including this outcome — composites 1 (WOMAC pain plus rescue medication use) and 3 (WOMAC pain, stiffness, and function, plus rescue medication) — showed slight improvements in responsiveness compared with composite 2, which excluded rescue medication.

Aside from the methods used to combine outcomes, the method chosen to assess responsiveness is also important^18,19. Several methods are commonly cited to quantify responsiveness: the standardized response mean²⁰, the effect size¹⁸ (either Glass’s Δ²¹ or Cohen d²², depending on the SD used), or Guyatt’s responsiveness index²³. All of these methods have 2 important limitations. First, all methods calculate responsiveness over 2 timepoints, and cannot easily be generalized to a study that has 3 or more timepoints. This prevents assessment of how responsiveness may fluctuate over time, and limits the definition of responsiveness only to the magnitude of change relative to its variance, rather than the speed of response. Second, these methods do not directly assess statistical inference; differences in responsiveness coefficients are assessed descriptively only. Methods have been proposed (modified jackknife procedure^5,24,25, bootstrapping²⁶) to address this issue, but other methods that directly perform statistical inference as part of the method generating the coefficient are desirable.

Our approach made use of Z scores (standard scores)²⁷. Converting each outcome’s absolute score to a Z score allows direct comparison of change in an outcome at different timepoints, thereby allowing direct assessment of change over time, and direct comparison between different outcomes. This methodology has been used previously to compare noncomposite outcomes²⁸.

The PCA approach assumes that an intervention will alter several related aspects of a common construct, therefore combining all the multidimensional aspects of pain to form 1 outcome should increase responsiveness. However, if 1 aspect of pain is changed alone, then the inclusion of other aspects of pain that do not change may decrease the sensitivity of the composite. Our finding that the WOMAC stiffness subscale was the most sensitive outcome may fit this explanation: It may be, at least in this trial, that the WOMAC stiffness subscale was the closest correlate to the actual latent pain factor altered by the treatment, hence the greatest responsiveness, and inclusion of other subscales or rescue medication eroded it. Our finding may be limited to tanezumab alone, because the agent’s anti-nerve growth factor may have a greater effect on the stiffness sensation than other pain subscales^15,29.

Freemantle, et al provide a comprehensive discussion on the use of composite outcomes in clinical trials³⁰, highlighting how composite outcomes can obfuscate changes in constituent outcomes. This is particularly problematic when outcomes are unrelated (for example, a composite that combines cardiovascular events and mortality), although they note the statistical advantages (increased power and sensitivity) that arise through the construction of composites^30,31. This discussion highlights how both the outcomes used in the composite, and the method by which they are combined, are important. Our present study combined the 3 WOMAC subscales (pain, stiffness, and function) into 1 composite outcome. We assumed that these 3 subscales were all aspects of the same construct (pain). The PCA (Table 1) produced extremely high factor loading in all 3 subscales, supporting this notion — at least in this trial. In contrast, if pain and function were discrete constructs, then the PCA should fail, with either pain or function alone loading on the latent factor. Both Angst, et al and Ryser, et al found close association between pain and function WOMAC subscales, partly supporting this finding^16,32. In addition, an item overlap analysis on the WOMAC pain and function subscales by Stratford and Kennedy³³ found significant item redundancy between the pain and function subscales, and a further factor analysis on the WOMAC items found clustering of items not by subscale, but by activity³⁴, suggesting that the WOMAC subscales are not distinct.

We surmised that responsiveness in the outcomes may differ over time, as well as in magnitude. In our study, all outcomes appeared to have responded at the same timepoint, and retained their relative positions consistently over time (none of the outcome’s trajectories crossed over each other over time; Figure 1).

There are limitations to this analysis. We observed only very few statistically significant differences between outcomes. The trial was designed to observe a difference in the primary outcome between treatment groups (a relatively large difference), and was not designed to compare treatment differences between outcomes (much smaller differences). Therefore, even the large sample size in the trial provides relatively low power to detect differences between outcomes. Ideally, in the future this analysis would be designed into the trial prior to commencement, with appropriate sample size and power. We also allowed many interaction effects, which increased model-to-data fit at the expense of statistical power. We have assumed in this analysis that the covariate structure of the pain outcomes, and the relationship between the outcomes and the latent (unobserved) pain outcome are consistent between studies, and therefore generalizable across other studies. This is a relatively strong assumption, requiring validation in other datasets to allow wider generalization to other trials with confidence.

While the aim of this approach was to include additional information on pain from rescue medication data, this outcome may not be optimal. Rescue medication is a challenging variable to collect data on accurately, and therefore the likelihood is that measurement error of this variable is high. This may provide an explanation for why the improvement in sensitivity of composites including rescue medication is small.

Even though the between-outcome differences were not statistically significant, even a small improvement in responsiveness can affect sample size calculations (Table 2). This produces gains in efficiency without collecting any novel data simply by reanalyzing the data, using a method that produces a more sensitive, and therefore efficient, outcome. We could have included further assessment of other composites made from different combinations of the 11 possible from the 4 single outcomes used, for example one using WOMAC pain plus WOMAC stiffness. We opted to create the 3 composites that would have the most pragmatic effect on outcome inclusion/exclusion when designing a trial. The alternative, generating all 11 possible combinations and comparing them head to head, would further reduce the statistical power to discern differences between composite outcomes.

The PCA approach to generating a composite outcome by its nature produces a unitless score. While the generated score may have increased responsiveness compared to one of the constituent outcomes, it is more difficult to ascertain the clinical importance of the observed effect, in comparison to another outcome with meaningful units and an agreed minimally clinical importance difference (MCID). A downside of PCA composites is the absence of known values of MCID, but this could be established if a specific composite were widely used.

The choice of primary and secondary outcomes in this trial limited the choice of outcomes available to combine into a composite. Ideally, we would have preferred to use a trial featuring a wider range of pain outcomes, particularly the more recent Knee Injury and Osteoarthritis Outcome Score³⁵ and Measure of Intermittent and Constant Osteoarthritis Pain³⁶ questionnaires; however, a dataset using these outcomes, among others, and featuring the other requirements was not available.

Our present findings are similar to our previous paper, which used data from 2 other completed clinical trials of non-drug interventions²⁸. In both of these trials, the WOMAC stiffness subscale also showed an increased but not statistically significant degree of responsiveness compared to the other 2 WOMAC subscales. Angst, et al, in contrast, found the WOMAC pain subscale to be the most sensitive outcome to change^5,24, however these studies did not examine rescue medication, and used only a 2-timepoint approach. Further, the 2 studies previously analyzed were both prospective cohort studies lacking a control group. Thus, optimizing the detection of treatment effect over placebo was not possible in the 2 Angst, et al analyses.

We investigated whether collapsing several measures of a multidimensional construct into 1 composite outcome through the use of the PCA could help improve responsiveness following an intervention. Adding rescue medication alongside other elements of the WOMAC showed improved responsiveness, greater than the constituent outcomes.

Acknowledgment

We acknowledge the contributions of Pfizer in allowing our team access to the completed trial datasets, and their help and support in using their trial analysis platform; we specifically thank Pamela Singletary, Daireen Garcia, Glenn Pixton, and Michael Smith. We also acknowledge the contributions of the Research in Osteoarthritis Manchester (ROAM) team to this project. The ROAM group is supported by the Manchester Academic Health Sciences Centre. The authors acknowledge the assistance given by Séamus Byers, Contracts, IT Services, and the use of the Computational Shared Facility at The University of Manchester.

APPENDIX 1. Analysis model terms

The full model used was: $\text{[math]}$ where y_itj = standardized score; X_itj1 = treatment group; X_itj2 = outcome type (the categorical data outlined above, which was coded in the form of dummy variables); X_itj3 = study visit (either 2, 4, 8, 12, or 16 weeks, coded as dummy variables) ; X_itj4 = treatment group × outcome interaction; X_itj5 = treatment group × study visit interaction; X_itj6 = outcome type × study visit interaction; X_itj7 = treatment group × outcome type × study visit interaction; μ = model intercept, u_i = subject-level random effect, and W_it = error. This model included a total of 4 types of interaction effects (3 two-way interactions, and one 3-way interaction), which allows the greatest number of degrees of freedom regarding modeling the different outcomes over time, and therefore makes no prior assumptions about treatment trajectories, at the cost of power to detect differences.

Footnotes

MJP, MJC, ML, and DTF receive salary support from the UK National Institute for Health Research (NIHR), as part of the Manchester Musculoskeletal NIHR Biomedical Research Unit Grant. LT is an employee of Pfizer Inc. and owns stock or stock options in it. Arthritis Research UK also continues to support the Centre for Epidemiology (grant number 20380). This report includes independent research supported by (or funded by) the NIHR Biomedical Research Unit Funding Scheme. The views expressed in this publication are those of the author(s) and not necessarily those of the NHS, the NIHR, or the UK Department of Health. Prof. Felson is supported by NIH grant AR4778.

Accepted for publication January 19, 2018.

REFERENCES

1.
1. Williams AC,
2. Craig KD
. Updating the definition of pain. Pain 2016;157:2420–3.
2.
1. Mease PJ,
2. Hanna S,
3. Frakes EP,
4. Altman RD
. Pain mechanisms in osteoarthritis: Understanding the role of central pain and current approaches to its treatment. J Rheumatol 2011;38:1546–51.
3.
1. Neogi T
. The epidemiology and impact of pain in osteoarthritis. Osteoarthr Cartil 2013;21:1145–53.
4.
1. Dworkin RH,
2. Turk DC,
3. Farrar JT,
4. Haythornthwaite JA,
5. Jensen MP,
6. Katz NP,
7. et al.
Core outcome measures for chronic pain clinical trials: IMMPACT recommendations. Pain 2005;113:9–19.
5.
1. Angst F,
2. Aeschlimann A,
3. Steiner W,
4. Stucki G
. Responsiveness of the WOMAC osteoarthritis index as compared with the SF-36 in patients with osteoarthritis of the legs undergoing a comprehensive rehabilitation intervention. Ann Rheum Dis 2001;60:834–40.
6.
1. van der Heijde DM,
2. van ’t Hof MA,
3. van Riel PL,
4. Theunisse LA,
5. Lubberts EW,
6. van Leeuwen MA,
7. et al.
Judging disease activity in clinical practice in rheumatoid arthritis: first step in the development of a disease activity score. Ann Rheum Dis 1990;49:916–20.
7.
1. van der Heijde DM,
2. van’t Hof MA,
3. van Riel PL,
4. van Leeuwen MA,
5. van Rijswijk MH,
6. van de Putte LB
. Validity of single variables and composite indices for measuring disease activity in rheumatoid arthritis. Ann Rheum Dis 1992;51:177–81.
8.
1. Ibrahim F,
2. Tom BD,
3. Scott DL,
4. Prevost AT
. A systematic review of randomised controlled trials in rheumatoid arthritis: the reporting and handling of missing data in composite outcomes. Trials 2016;17:272.
9.
1. Cloutier MM,
2. Schatz M,
3. Castro M,
4. Clark N,
5. Kelly HW,
6. Mangione-Smith R,
7. et al.
Asthma outcomes: composite scores of asthma control. J Allergy Clin Immunol 2012;129 (3 Suppl):S24–33.
10.
1. Gossec L,
2. Paternotte S,
3. Aanerud GJ,
4. Balanescu A,
5. Boumpas DT,
6. Carmona L,
7. et al.
Finalisation and validation of the rheumatoid arthritis impact of disease score, a patient-derived composite measure of impact of rheumatoid arthritis: a EULAR initiative. Ann Rheum Dis 2011;70:935–42.
11.
1. Dechartres A,
2. Albaladejo P,
3. Mantz J,
4. Samama CM,
5. Collet JP,
6. Steg PG,
7. et al.
Delphi-consensus weights for ischemic and bleeding events to be included in a composite outcome for RCTs in thrombosis prevention. PLoS One 2011;6:10–2.
12.
1. Rogozinska E,
2. D’Amico MI,
3. Khan KS,
4. Cecatti JG,
5. Teede H,
6. Yeo S,
7. et al.
Development of composite outcomes for individual patient data (IPD) meta-analysis on the effects of diet and lifestyle in pregnancy: A Delphi survey. BJOG An Int J Obstet Gynaecol 2016;123:190–8.
13.
1. Monchaud C,
2. Marin B,
3. Estenne M,
4. Preux PM,
5. Marquet P
. Consensus conference on a composite endpoint for clinical trials on immunosuppressive drugs in lung transplantation. Transplantation 2014;98:1331–8.
14.
1. Tong BC,
2. Huber JC,
3. Ascheim DD,
4. Puskas JD,
5. Ferguson TB Jr,
6. Blackstone EH,
7. et al.
Weighting composite endpoints in clinical trials: essential evidence for the heart team. Ann Thorac Surg 2012;94:1908–13.
15.
1. Brown MT,
2. Murphy FT,
3. Radin DM,
4. Davignon I,
5. Smith MD,
6. West CR
. Tanezumab reduces osteoarthritic knee pain: results of a randomized, double-blind, placebo-controlled phase III trial. J Pain 2012;13:790–8.
16.
1. Angst F,
2. Ewert T,
3. Lehmann S,
4. Aeschlimann A,
5. Stucki G
. The factor subdimensions of the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) help to specify hip and knee osteoarthritis. A prospective evaluation and validation study. J Rheumatol 2005;32:1324–30.
17.
1. Kaiser HF
. The application of electronic computers to factor analysis. [Internet. Accessed March 5, 2018.] Available from: www.garfield.library.upenn.edu/classics1986/A1986E107600001.pdf
18.
1. Stratford PW,
2. Riddle DL
. Assessing sensitivity to change: choosing the appropriate change coefficient. Health Qual Life Outcomes 2005;3:23.
19.
1. Norman GR,
2. Wyrwich KW,
3. Patrick DL
. The mathematical relationship among different forms of responsiveness coefficients. Qual Life Res 2007;16:815–22.
20.
1. Liang MH,
2. Fossel AH,
3. Larson MG
. Comparisons of five health status instruments for orthopedic evaluation. Med Care 1990;28:632–42.
21.
1. Hedges LV,
2. Olkin I
. Statistical methods for meta-analysis. Orlando: Academic Press Inc.; 1985.
22.
1. Cohen J
. Statistical power analysis for the behavioral sciences. Amsterdam: Elsevier; 1988:567.
23.
1. Guyatt G,
2. Walter S,
3. Norman G
. Measuring change over time: assessing the usefulness of evaluative instruments. J Chronic Dis 1987;40:171–8.
24.
1. Angst F,
2. Verra ML,
3. Lehmann S,
4. Aeschlimann A
. Responsiveness of five condition-specific and generic outcome assessment instruments for chronic pain. BMC Med Res Methodol 2008;8:26.
25.
1. Angst F,
2. Goldhahn J,
3. Drerup S,
4. Aeschlimann A,
5. Schwyzer HK,
6. Simmen BR
. Responsiveness of six outcome assessment instruments in total shoulder arthroplasty. Arthritis Rheum 2008;59:391–8.
26.
1. Spadoni GF,
2. Stratford PW,
3. Solomon PE,
4. Wishart LR
. The evaluation of change in pain intensity: a comparison of the P4 and single-item numeric pain rating scales. J Orthop Sport Phys Ther 2004;34:187–93.
27.
1. Kirkwood BB,
2. Sterne J
. Essential medical statistics. Malden, MA: Blackwell Science; 2003.
28.
1. Parkes MJ,
2. Callaghan MJ,
3. O’Neill TW,
4. Forsythe LM,
5. Lunt M,
6. Felson DT
. Sensitivity to change of patient-preference measures for pain in patients with knee osteoarthritis: data from two trials. Arthritis Care Res 2016;68:1224–31.
29.
1. Lane NE,
2. Schnitzer TJ,
3. Birbara CA,
4. Mokhtarani M,
5. Shelton DL,
6. Smith MD,
7. et al.
Tanezumab for the treatment of pain from osteoarthritis of the knee. N Engl J Med 2010;363:1521–31.
30.
1. Freemantle N,
2. Calvert M,
3. Wood J,
4. Eastaugh J,
5. Griffin C
. Composite outcomes in randomized trials: greater precision but with greater uncertainty? JAMA 2003;289:2554–9.
31.
1. Freemantle N,
2. Calvert MJ
. Interpreting composite outcomes in trials. Br Med J 2010;341:c3529.
32.
1. Ryser L,
2. Wright BD,
3. Aeschlimann A,
4. Mariacher-Gehler S,
5. Stucki G
. A new look at the Western Ontario and McMaster Universities Osteoarthritis Index using Rasch analysis. Arthritis Care Res [Internet] 1999;12:331–5.
33.
1. Stratford PW,
2. Kennedy DM
. Does parallel item content on WOMAC’s pain and function subscales limit its ability to detect change in functional status? BMC Musculoskelet Disord 2004;5:17.
34.
1. Kennedy D,
2. Stratford PW,
3. Pagura SMC,
4. Wessel J,
5. Gollish JD,
6. Woodhouse LJ
. Exploring the factorial validity and clinical interpretability of the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC). Physiother Can 2003;55:160–8.
35.
1. Roos EM,
2. Lohmander LS
. The Knee injury and Osteoarthritis Outcome Score (KOOS): from joint injury to osteoarthritis [review]. Health Qual Life Outcomes 2003;1:64.
36.
1. Hawker GA,
2. Davis AM,
3. French MR,
4. Cibere J,
5. Jordan JM,
6. March L,
7. et al.
Development and preliminary psychometric testing of a new OA pain measure - an OARSI/OMERACT initiative. Osteoarthr Cartil 2008;16:409–14.

Main menu

User menu

Search

Responsiveness of Single versus Composite Measures of Pain in Knee Osteoarthritis

Abstract

MATERIALS AND METHODS

Single outcomes definition

Composite outcomes definition

Analysis approach

RESULTS

Study sample demographics

PCA results

Composite outcome performance

DISCUSSION

Acknowledgment

APPENDIX 1. Analysis model terms

Footnotes

REFERENCES

Content

Resources

Subscribers

More