Interrater agreement and interrater reliability: key concepts, approaches, and applications

Natasa Gisev; J Simon Bell; Timothy F Chen

doi:10.1016/j.sapharm.2012.04.004

Interrater agreement and interrater reliability: key concepts, approaches, and applications

Res Social Adm Pharm. 2013 May-Jun;9(3):330-8. doi: 10.1016/j.sapharm.2012.04.004. Epub 2012 Jun 12.

Authors

Natasa Gisev¹, J Simon Bell, Timothy F Chen

Affiliation

¹ Faculty of Pharmacy, Pharmacy and Bank Building (A15), The University of Sydney, New South Wales 2006, Australia. natasa.gisev@sydney.edu.au

PMID: 22695215
DOI: 10.1016/j.sapharm.2012.04.004

Abstract

Evaluations of interrater agreement and interrater reliability can be applied to a number of different contexts and are frequently encountered in social and administrative pharmacy research. The objectives of this study were to highlight key differences between interrater agreement and interrater reliability; describe the key concepts and approaches to evaluating interrater agreement and interrater reliability; and provide examples of their applications to research in the field of social and administrative pharmacy. This is a descriptive review of interrater agreement and interrater reliability indices. It outlines the practical applications and interpretation of these indices in social and administrative pharmacy research. Interrater agreement indices assess the extent to which the responses of 2 or more independent raters are concordant. Interrater reliability indices assess the extent to which raters consistently distinguish between different responses. A number of indices exist, and some common examples include Kappa, the Kendall coefficient of concordance, Bland-Altman plots, and the intraclass correlation coefficient. Guidance on the selection of an appropriate index is provided. In conclusion, selection of an appropriate index to evaluate interrater agreement or interrater reliability is dependent on a number of factors including the context in which the study is being undertaken, the type of variable under consideration, and the number of raters making assessments.

Publication types

Review

MeSH terms

Data Interpretation, Statistical
Humans
Observer Variation
Pharmacy Administration
Reproducibility of Results
Research Design / statistics & numerical data*