Skip to main content
Log in

A two-stage logistic regression model for analyzing inter-rater agreement

  • Articles
  • Published:
Psychometrika Aims and scope Submit manuscript

Abstract

Studies of agreement commonly occur in psychiatric research. For example, researchers are often interested in the agreement among radiologists in their review of brain scans of elderly patients with dementia or in the agreement among multiple informant reports of psychopathology in children. In this paper, we consider the agreement between two raters when rating a dichotomous outcome (e.g., presence or absence of psychopathology). In particular, we consider logistic regression models that allow agreement to depend on both rater- and subject-level covariates. Logistic regression has been proposed as a simple method for identifying covariates that are predictive of agreement (Coughlin et al., 1992). However, this approach is problematic since it does not take account of agreement due to chance alone. As a result, a spurious association between the probability (or odds) of agreement and a covariate could arise due entirely to chance agreement. That is, if the prevalence of the dichotomous outcome varies among subgroups of the population, then covariates that identify the subgroups may appear to be predictive of agreement. In this paper we propose a modification to the standard logistic regression model in order to take proper account of chance agreement. An attractive feature of the proposed method is that it can be easily implemented using existing statistical software for logistic regression. The proposed method is motivated by data from the Connecticut Child Study (Zahner et al., 1992) on the agreement among parent and teacher reports of psychopathology in children. In this study, parents and teachers provide dichotomous assessments of a child's psychopathology and it is of interest to examine whether agreement among the parent and teacher reports is related to the age and gender of the child and to the time elapsed between parent and teacher assessments of the child.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Barlow, W. (1996). Measurement of interrater agreement with adjustment for covariates.Biometrics, 52, 695–702.

    Google Scholar 

  • Cohen, J. (1960). A coefficient of agreement for nominal scales.Educational and Psychological Measurement, 20, 37–46.

    Google Scholar 

  • Coughlin, S.S., Pickle, L.W., Goodman, M.T., & Wilkens, L.R. (1992). The logistic modeling of interobserver agreement.Journal of Clinical Epidemiology, 45, 1237–1241.

    Google Scholar 

  • Fleiss, J.L. (1971). Measuring nominal scale agreement among many raters.Psychological Bulletin, 76, 378–382.

    Google Scholar 

  • Klar, N., Lipsitz S.R., & Ibrahim, J. (2000). An estimating equations approach for modelling kappa.Biometrical Journal, 42, 45–58.

    Google Scholar 

  • Liang, K.Y., & Zeger, S.L. (1986). Longitudinal data analysis using generalized linear models.Biometrika, 73, 13–22.

    Google Scholar 

  • Prentice, R.L. (1988). Correlated binary regression with covariates specific to each binary observation.Biometrics, 44, 1033–1048.

    Google Scholar 

  • Quenouille, M.H., (1956). Notes on bias in estimation.Biometrika, 43, 353–60.

    Google Scholar 

  • Tukey, J.N. (1958). Bias and confidence in not quite large samples.Annals of Mathematical Statistics, 29, 614.

    Google Scholar 

  • Welsh, A.H. (1996).Aspects of statistical inference. New York, NY: Wiley.

    Google Scholar 

  • White, H. (1982). Maximum likelihood estimation of misspecified models.Econometrica, 50, 1–25.

    Google Scholar 

  • Zahner, G.E., & Daskalakis, C. (1998). Modeling sources of informant variance in parent and teacher ratings of child psychopathology.International Journal of Methods in Psychiatric Research, 7, 3–16.

    Google Scholar 

  • Zahner, G.E., Pawelkiewicz, W., DeFrancesco, J.J., & Adnopoz, J. (1992). Children's mental health service needs and utilization patterns in an urban community: An epidemiological assessment.Journal of the American Academy of Child and Adolescent Psychiatry 31, 951–960.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Garrett M. Fitzmaurice.

Additional information

The authors thank the Associate Editor and the referees for their helpful comments and suggestions. We also thank Gwen Zahner for use of data from the Connecticut Child Study, which was conducted under contract to the Connecticut Department of Children and Youth Services. This research was supported by grants HL 69800, AHRQ 10871, HL52329, HL61769, GM 29745, MH 54693 and MH 17119 from the National Institutes of Health.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lipsitz, S.R., Parzen, M., Fitzmaurice, G.M. et al. A two-stage logistic regression model for analyzing inter-rater agreement. Psychometrika 68, 289–298 (2003). https://doi.org/10.1007/BF02294802

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02294802

Key words

Navigation