Original Article
Simulated computerized adaptive test for patients with shoulder impairments was efficient and produced valid measures of function

https://doi.org/10.1016/j.jclinepi.2005.08.006Get rights and content

Abstract

Background and Objective

To test unidimensionality and local independence of a set of shoulder functional status (SFS) items, develop a computerized adaptive test (CAT) of the items using a rating scale item response theory model (RSM), and compare discriminant validity of measures generated using all items (θIRT) and measures generated using the simulated CAT (θCAT).

Study Design and Setting

We performed a secondary analysis of data collected prospectively during rehabilitation of 400 patients with shoulder impairments who completed 60 SFS items.

Results

Factor analytic techniques supported that the 42 SFS items formed a unidimensional scale and were locally independent. Except for five items, which were deleted, the RSM fit the data well. The remaining 37 SFS items were used to generate the CAT. On average, 6 items were needed to estimate precise measures of function using the SFS CAT, compared with all 37 SFS items. The θIRT and θCAT measures were highly correlated (r = .96) and resulted in similar classifications of patients.

Conclusion

The simulated SFS CAT was efficient and produced precise, clinically relevant measures of functional status with good discriminating ability.

Introduction

A recent study by Hart et al. [1] showed how computerized adaptive testing (CAT) can transform the process of estimating latent traits, using simulated CATs to estimate measures of functional abilities for patients with lower extremity impairments by analyzing a patient's performance on a set of items. The CATs precisely estimated measures of lower extremity functional status, with good discriminating ability. Latent traits and abilities (or, as defined in the Hart et al. [1] study, measures of lower extremity functional status) cannot be directly observed but can be estimated by analyzing a person's performance on a set of items [2]. CATs have recently emerged in the medical [3], [4] and rehabilitation [1], [5], [6] fields, and development of CAT measures of function in rehabilitation has been recommended [7], [8], [9].

The advantages of CATs compared to a computer-administered or paper-and-pencil outcomes instruments have been enumerated [10], [11]. From a clinical perspective, however, the most important advantages of CATs are that they provide an efficient alternative to traditional paper-and-pencil or computer-administered tests, allow outcomes data to be collected during the clinical encounter with reduced patient burden, and produce precise estimates of patient ability [1], [6].

Here we describe the construction of a CAT using responses to the original items that were administered in the development of the Flexilevel Scale of Shoulder Function (FLEX-SF) [12], a flexilevel [13], [14] outcomes instrument for patients with shoulder impairments receiving rehabilitation. No studies have assessed the developmental FLEX-SF items for unidimensionality, local independence, or generated a CAT application from these items. Our overall objective was to develop a clinically relevant CAT specific to patients with shoulder impairments using the original developmental FLEX-SF items. Specific objectives were to (i) test unidimensionality and local independence of the developmental FLEX-SF items, (ii) develop a CAT using these items, and (iii) compare how well simulated CAT measures discriminated among patients compared to measures based on the all the FLEX-SF items.

Section snippets

Materials and methods

We conducted a secondary analysis of data collected prospectively from patients with shoulder impairments receiving rehabilitation [12]. For the purposes of this study, we operationally defined shoulder functional status (SFS) as the patient's perception of his or her ability to perform the functional tasks specified in the developmental pool of FLEX-SF items. Functional status is of interest, because many people seek rehabilitation to improve functional deficits caused by shoulder impairments

Unidimensionality and Local Independence

The fit indices for the fit of a one-factor model produced mixed results (Table 2). Using all 60 items, the CFI was acceptable (i.e., CFI > .90), but the RMSEA was not (i.e., RMSEA > .08). While the percentage of item variance accounted for was high and the magnitude of coefficients was strong, the number of absolute residual correlations > 0.10 was higher than desired. A detailed examination of the item pairs with high residual correlations suggested that some item pairs were not locally

Discussion

Results supported unidimensionality and local independence of the 37-item SFS scale and showed that SFS items can be used in a CAT process to generate measures of functional status that should be relevant to clinicians interested in quantifying the functional abilities of their patients. The simulated CAT generated measures of SFS that were precise and discriminated known groups of patients in clinically similar and logical ways. Because θCAT measures were estimated using a mean of 6 SFS items,

Acknowledgments

The authors thank Paul W. Stratford, for assistance with 95% confidence intervals for relative validity estimates. P.K.C.'s work was supported by a K08 AG 022232 grant from the National Institute of Aging. C.R.T.'s work was conducted during a post-doctoral fellowship supported by the Health Services Research and Development Service, Office of Research and Development, Department of Veterans Affairs.

References (57)

  • J.E. Ware et al.

    Practical implications of item response theory and computerized adaptive testing: a brief summary of ongoing studies of widely used headache impact scales

    Med Care

    (2000)
  • J.E. Ware et al.

    Applications of computerized adaptive testing (CAT) to the assessment of headache impact

    Qual Life Res

    (2003)
  • S. Haley et al.

    Extending the frontier of rehabilitation outcome measurement and research

    J Outcome Meas

    (2000)
  • C.A. Velozo et al.

    The use of Rasch analysis to produce scale-free measurement of functional activity

    Am J Occup Ther

    (1999)
  • B.G. Dodd et al.

    Computerized adaptive testing with polytomous items

    Appl Psychol Meas

    (1995)
  • K.F. Cook et al.

    Development and psychometric evaluation of the Flexilevel Scale of Shoulder Function

    Med Care

    (2003)
  • F.M. Lord

    The self-scoring flexilevel test

    J Educ Meas

    (1971)
  • F.M. Lord

    Applications of item response theory to practical testing problems

    (1980)
  • World Health Organization

    International classification of functioning, disability and health

    (2001)
  • R.D. Hays et al.

    Item response theory and health outcomes measurement in the 21st century

    Med Care

    (2000)
  • P.W. Holland

    When are item response models consistent with observed data?

    Psychometrika

    (1981)
  • P.W. Holland et al.

    Conditional association and unidimensionality in monotone latent variable models

    Ann Stat

    (1986)
  • P.R. Rosenbaum

    Testing the conditional independence and monotonicity assumptions of item response theory

    Psychometrika

    (1984)
  • P.F. Lazarsfeld et al.

    Latent structure analysis

    (1968)
  • H. Wainer et al.

    Item response theory, item calibration, and proficiency estimation

  • R.K. Hambleton et al.

    Fundamentals of item response theory

    (1991)
  • W.F. Stout

    A new item response theory modeling approach with applications to unidimensionality assessment and ability estimation

    Psychometrika

    (1990)
  • Cited by (66)

    • The Authors Respond

      2017, Archives of Physical Medicine and Rehabilitation
    View all citing articles on Scopus
    View full text