Abstract
Simple statistical models are used to illustrate two important issues arising in the analysis of grouped data. The consequences are explored of grouping continuous data and analyzing the resulting contingency table. Specifically, an expression for the loss of power is derived when and odds ratio is used to assess risk measured by a continuous variable. Also explored are the consequences of employing correlation and regression coefficients to analyze summary variables derived from grouped data (ecologic data). An expression is given that demonstrates the magnitude of a bias (ecologic fallacy) resulting from analyzing a specific type of grouped data.
Similar content being viewed by others
References
Cochran W. (1983): Planning and Analysis of Observational Studies. John Wiley & Sons, New York.
Cox D.R. (1957): Note on grouping. - Am. Stat. Assoc. J., 19: 543–549.
Draper N.R., Smith H. (1966): Applied Regression Analysis. John Wiley & Sons, New York.
Fliess J.L. (1970): On the asserted invariance of the odds ratio. - Brit. J. Prev. Soc. Med., 24: 45–46.
Kasl S.V. (1979): Mortality and the business cycle: Some questions about research strategies when utilizing macro-social models and ecologic data. -Am. J. Public Health, 69: 784–788.
Kendall M.G., Stuart A. (1967): Advanced Theory of Statistics. Hafner Publishing Co., New York.
Robinson W.S. (1950): Ecologic correlations and the behavior of individuals. - Am. Social Rev., 15: 351–7.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Selvin, S. Two issues concerning the analysis of grouped data. Eur J Epidemiol 3, 284–287 (1987). https://doi.org/10.1007/BF00149737
Issue Date:
DOI: https://doi.org/10.1007/BF00149737