A two-stage logistic regression model for analyzing inter-rater agreement

Lipsitz, Stuart R.; Parzen, Michael; Fitzmaurice, Garrett M.; Klar, Neil

doi:10.1007/BF02294802

A two-stage logistic regression model for analyzing inter-rater agreement

Articles
Published: June 2003

Volume 68, pages 289–298, (2003)
Cite this article

Psychometrika Aims and scope Submit manuscript

Stuart R. Lipsitz¹,
Michael Parzen²,
Garrett M. Fitzmaurice³ &
…
Neil Klar⁴

387 Accesses
18 Citations
Explore all metrics

Abstract

Studies of agreement commonly occur in psychiatric research. For example, researchers are often interested in the agreement among radiologists in their review of brain scans of elderly patients with dementia or in the agreement among multiple informant reports of psychopathology in children. In this paper, we consider the agreement between two raters when rating a dichotomous outcome (e.g., presence or absence of psychopathology). In particular, we consider logistic regression models that allow agreement to depend on both rater- and subject-level covariates. Logistic regression has been proposed as a simple method for identifying covariates that are predictive of agreement (Coughlin et al., 1992). However, this approach is problematic since it does not take account of agreement due to chance alone. As a result, a spurious association between the probability (or odds) of agreement and a covariate could arise due entirely to chance agreement. That is, if the prevalence of the dichotomous outcome varies among subgroups of the population, then covariates that identify the subgroups may appear to be predictive of agreement. In this paper we propose a modification to the standard logistic regression model in order to take proper account of chance agreement. An attractive feature of the proposed method is that it can be easily implemented using existing statistical software for logistic regression. The proposed method is motivated by data from the Connecticut Child Study (Zahner et al., 1992) on the agreement among parent and teacher reports of psychopathology in children. In this study, parents and teachers provide dichotomous assessments of a child's psychopathology and it is of interest to examine whether agreement among the parent and teacher reports is related to the age and gender of the child and to the time elapsed between parent and teacher assessments of the child.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Literature reviews as independent studies: guidelines for academic practice

Article Open access 14 October 2022

Sascha Kraus, Matthias Breier, … João J. Ferreira

The Trustworthiness of Content Analysis

Qualitative Content Analysis: Theoretical Background and Procedures

References

Barlow, W. (1996). Measurement of interrater agreement with adjustment for covariates.Biometrics, 52, 695–702.
Google Scholar
Cohen, J. (1960). A coefficient of agreement for nominal scales.Educational and Psychological Measurement, 20, 37–46.
Google Scholar
Coughlin, S.S., Pickle, L.W., Goodman, M.T., & Wilkens, L.R. (1992). The logistic modeling of interobserver agreement.Journal of Clinical Epidemiology, 45, 1237–1241.
Google Scholar
Fleiss, J.L. (1971). Measuring nominal scale agreement among many raters.Psychological Bulletin, 76, 378–382.
Google Scholar
Klar, N., Lipsitz S.R., & Ibrahim, J. (2000). An estimating equations approach for modelling kappa.Biometrical Journal, 42, 45–58.
Google Scholar
Liang, K.Y., & Zeger, S.L. (1986). Longitudinal data analysis using generalized linear models.Biometrika, 73, 13–22.
Google Scholar
Prentice, R.L. (1988). Correlated binary regression with covariates specific to each binary observation.Biometrics, 44, 1033–1048.
Google Scholar
Quenouille, M.H., (1956). Notes on bias in estimation.Biometrika, 43, 353–60.
Google Scholar
Tukey, J.N. (1958). Bias and confidence in not quite large samples.Annals of Mathematical Statistics, 29, 614.
Google Scholar
Welsh, A.H. (1996).Aspects of statistical inference. New York, NY: Wiley.
Google Scholar
White, H. (1982). Maximum likelihood estimation of misspecified models.Econometrica, 50, 1–25.
Google Scholar
Zahner, G.E., & Daskalakis, C. (1998). Modeling sources of informant variance in parent and teacher ratings of child psychopathology.International Journal of Methods in Psychiatric Research, 7, 3–16.
Google Scholar
Zahner, G.E., Pawelkiewicz, W., DeFrancesco, J.J., & Adnopoz, J. (1992). Children's mental health service needs and utilization patterns in an urban community: An epidemiological assessment.Journal of the American Academy of Child and Adolescent Psychiatry 31, 951–960.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Biometry and Epidemiology, Medical University of South Carolina, Charleston
Stuart R. Lipsitz
Graduate School of Business, University of Chicago, USA
Michael Parzen
Department of Biostatistics, Harvard School of Public Health, USA
Garrett M. Fitzmaurice
Division of Preventive Oncology, Cancer Care Ontario, Canada
Neil Klar

Authors

Stuart R. Lipsitz
View author publications
You can also search for this author in PubMed Google Scholar
Michael Parzen
View author publications
You can also search for this author in PubMed Google Scholar
Garrett M. Fitzmaurice
View author publications
You can also search for this author in PubMed Google Scholar
Neil Klar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Garrett M. Fitzmaurice.

Additional information

The authors thank the Associate Editor and the referees for their helpful comments and suggestions. We also thank Gwen Zahner for use of data from the Connecticut Child Study, which was conducted under contract to the Connecticut Department of Children and Youth Services. This research was supported by grants HL 69800, AHRQ 10871, HL52329, HL61769, GM 29745, MH 54693 and MH 17119 from the National Institutes of Health.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lipsitz, S.R., Parzen, M., Fitzmaurice, G.M. et al. A two-stage logistic regression model for analyzing inter-rater agreement. Psychometrika 68, 289–298 (2003). https://doi.org/10.1007/BF02294802

Download citation

Received: 28 September 2000
Revised: 29 January 2002
Issue Date: June 2003
DOI: https://doi.org/10.1007/BF02294802

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A two-stage logistic regression model for analyzing inter-rater agreement

Abstract

Access this article

Similar content being viewed by others

Literature reviews as independent studies: guidelines for academic practice

The Trustworthiness of Content Analysis

Qualitative Content Analysis: Theoretical Background and Procedures

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Key words

Navigation

A two-stage logistic regression model for analyzing inter-rater agreement

Abstract

Access this article

Similar content being viewed by others

Literature reviews as independent studies: guidelines for academic practice

The Trustworthiness of Content Analysis

Qualitative Content Analysis: Theoretical Background and Procedures

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation