Original Article
Nondifferential disease misclassification may bias incidence risk ratios away from the null

https://doi.org/10.1016/j.jclinepi.2005.07.013Get rights and content

Abstract

Background and Objective

When estimating incidence risk ratios in follow-up studies, subjects testing positive for the disease at baseline are excluded. Although the effect of disease misclassification on estimated incidence risk ratios has otherwise been extensively explored, the effect of disease misclassification at baseline has not previously been analyzed.

Study Design and Setting

The design was theoretical calculations assuming dichotomous disease and a follow-up study with a baseline and a follow-up examination, analyzed using cumulative incidence. Calculations consider nondifferential misclassification of disease mainly at baseline, but no misclassification of exposure.

Results

Nondifferential misclassification of disease at baseline can lead to bias either away or toward null in estimated cumulative incidence risk ratios. This bias is mainly a function of sensitivity at baseline, because imperfect sensitivity leads to failure to exclude all diseased subjects from the follow-up. Imperfect specificity at baseline has less effect. Bias is increased with high true prevalence of disease and low true incidence. Bias is also increased with large differences in true risk ratios at baseline and at follow-up, because observed incidence risk ratios in the presence of misclassification reflect both the true association at baseline and at follow-up.

Conclusion

Nondifferential disease misclassification at baseline examination of a follow-up study can lead to over- or underestimation of the cumulative incidence risk ratios. The bias can be substantial for disease with low incidence and high prevalence, such as asthma or myocardial infarction. The results underscore the need to select a highly sensitive test for disease at baseline to exclude all diseased subjects from the follow-up.

Introduction

An analysis of incidence is the standard method to analyze the association between baseline exposure and risk of disease in a follow-up study. Cumulative incidence among both exposed and nonexposed is calculated as number of new cases of disease among the population at risk (i.e., for chronic diseases, subjects free of disease at the beginning of follow-up). Excluding subjects with prevalent disease from the follow-up is meant to ensure that the study can assess causality truly prospectively—that is, measured exposure precedes onset of disease. When calculating incidence, it is commonly assumed that there is no error in excluding prevalent cases of disease from the follow-up.

In a follow-up study, nondifferential misclassification of disease during follow-up leads in general to bias toward null in the estimated relative risks [1]. This bias is mainly a function of specificity [1], especially for rare diseases, and in the presence of perfect specificity there is no bias [2].

Effect of misclassification of disease at baseline has rarely been considered. Recently it was suggested that nondifferential misclassification of binary disease can lead to bias either away or toward null in odds ratios for transition probabilities in a follow-up study [3]. In a follow-up study, it was shown that measurement error in continuous outcomes can lead to bias either away or toward null when the analysis is adjusted for the baseline level of the outcome [4], [5]. It has also recently been shown that measurement error can severely bias estimates of incidence of asthma (unpublished data, 2003). To our knowledge, however, the effect of disease misclassification at baseline on the estimated cumulative incidence risk ratios has not previously been analyzed.

Our objective here was to explore the magnitude and determinants of bias in the observed cumulative incidence risk ratios in a follow-up study of a dichotomous disease measured with error at baseline.

Section snippets

Effect of misclassification of disease on estimated incidence

We first illustrate the effect of misclassification of disease on the estimated cumulative incidence (Fig. 1). For added clarity, all possible 16 combinations of true and observed changes in disease status in an analysis of cumulative incidence are also shown in Table 1. Note that we consider only cumulative incidence, assume that misclassification of disease is nondifferential, and do not consider misclassification of exposure. We also assume that risk of remission of disease is independent of

Discussion

Our present results show that nondifferential misclassification of disease at baseline, especially imperfect sensitivity, can lead to bias away or toward null in the observed cumulative incidence risk ratios. This is in contrast to nondifferential misclassification of disease during follow-up, which in general leads to bias toward null [1]. The bias described can be substantial for disease with low incidence and high prevalence, such as asthma or myocardial infarction. The results underscore

Acknowledgments

We thank Pia Verkasalo, MD, for helpful comments on the draft manuscript.

References (23)

  • E.S. Ford et al.

    Prevalence of nonfatal coronary heart disease among American adults

    Am Heart J

    (2000)
  • K. Kuulasmaa et al.

    Estimation of contribution of changes in classic risk factors to trends in coronary-event rates across the WHO MONICA Project populations

    Lancet

    (2000)
  • K.T. Copeland et al.

    Bias due to misclassification in the estimation of relative risk

    Am J Epidemiol

    (1977)
  • C. Poole

    Exception to the rule about nondifferential misclassification

    Am J Epidemiol

    (1985)
  • R. Wolfe et al.

    Transitions in an imperfectly observed binary variable: depressive symptomatology in adolescents

    Stat Med

    (2003)
  • L.E. Chambless et al.

    Methods for assessing difference between groups in change when initial measurements is subject to intra-individual variation

    Stat Med

    (1993)
  • N.D. Yanez et al.

    The effects of measurement error in response variables and tests of association of explanatory variables in change models

    Stat Med

    (1998)
  • W. Buse et al.

    Childhood versus adulthood onset asthma

    Am J Respir Crit Care Med

    (1995)
  • D. Jarvis et al.

    European Community Respiratory Health Survey II Steering Committee. The European Community Respiratory Health Survey II

    Eur Respir J

    (2002)
  • M.A. Jenkins et al.

    Validation of questionnaire and bronchial hyperresponsiveness against respiratory physician assessment in the diagnosis of asthma

    Int J Epidemiol

    (1996)
  • C.M. Albert et al.

    Triggering of sudden death from cardiac causes by vigorous exertion

    N Engl J Med

    (2000)
  • Cited by (25)

    • Randomized clinical trials and observational studies in the assessment of drug safety

      2018, Revue d'Epidemiologie et de Sante Publique
      Citation Excerpt :

      These different biases can distort the interpretation of OS and result in misleading findings [61–66]. Note also that non-differential misclassification does not always result in bias towards the null, as it is sometimes assumed [67,68]. An important observation is that indication and selection bias would theoretically have limited impact when evaluating the risk for unintended or unanticipated effects like unexpected ADRs because the assignment to the treatment is not expected to be affected by these factors [19,69,70].

    • Diagnosing intramammary infection: Controlling misclassification bias in longitudinal udder health studies

      2018, Preventive Veterinary Medicine
      Citation Excerpt :

      The effect of misclassification of disease at baseline to calculate incidence has less frequently been considered. In longitudinal studies, the nondifferential misclassification of disease at baseline, especially imperfect sensitivity, can lead to over- or under-estimation of the observed cumulative incidence risk ratios (Pekkanen et al., 2006). This bias can be significant for disease with a low true incidence, a high true prevalence, a substantial disease duration (i.e. as long as the interval between first and second test), and a poor test Se.

    • The fluctuation of nocturia in men with lower urinary tract symptoms allocated to placebo during a 12-month randomized, controlled trial

      2014, Journal of Urology
      Citation Excerpt :

      Using these questionnaires individuals may report their estimated average nocturia frequency only in whole numbers (0 to 5 or greater). Because nocturia frequency varies from night to night and self-reports are an estimated average during a given period, there is imprecision and variability in any given nocturia report, which may bias incidence estimates in either direction.10 Our results suggest that multiple questionnaire based assessments likely yield more accurate estimates of the true incidence in the setting of a prevalent condition such as nocturia, of which the long-term clinically relevant incidence may be small compared to short-term fluctuation.

    • Accuracy of national mortality codes in identifying adjudicated cardiovascular deaths

      2011, Australian and New Zealand Journal of Public Health
    • Administrative database research infrequently used validated diagnostic or procedural codes

      2011, Journal of Clinical Epidemiology
      Citation Excerpt :

      When outcome variables are subject to nondifferential misclassification, association measures are typically biased toward the null [23]. When nondifferential effects variables used to define cohorts (by excluding prevalent disease at baseline), Pekkanen et al. [24]. demonstrated that association measures could be biased away from the null (with the degree of bias varying by disease incidence and prevalence).

    • Exposure to substances in the workplace and new-onset asthma: an international prospective population-based study (ECRHS-II)

      2007, The Lancet
      Citation Excerpt :

      We excluded people who reported a history of asthma or respiratory symptoms (wheezing or whistling without a cold; ever having had asthma; having been woken by an attack of shortness of breath in the past 12 months) in ECRHS. These exclusions were intended to avoid confounding by baseline health status that could be associated with future risk of disease and risk factors.19 Asthma was defined by the participant either having an asthma attack or use of asthma medication in the 12 months before interview.

    View all citing articles on Scopus
    View full text