Smoothing in occupational cohort studies: an illustration based on penalised splines

E A Eisen; I Agalliu; S W Thurston; B A Coull; H Checkoway

doi:10.1136/oem.2004.013136

Article Text

Original article

Smoothing in occupational cohort studies: an illustration based on penalised splines

E A Eisen1,
I Agalliu2,
S W Thurston3,
B A Coull4,
H Checkoway5

¹Occupational Health Program, Harvard School of Public Health, Boston; Department of Work Environment, School of Health and Environment, University of Massachusetts Lowell, Lowell, MA, USA
²Department of Work Environment, University of Massachusetts, Lowell, Lowell, MA, USA
³Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, NY, USA
⁴Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
⁵Department of Occupational and Environmental Health Sciences, University of Washington School of Public Health, Seattle, WA, USA

Correspondence to:  Prof. E A Eisen  Occupational Health Program, Harvard School of Public Health, 665 Huntington Avenue, Boston, MA 02115, USA; eeisenhsph.harvard.edu

Abstract

Aims: To illustrate the contribution of smoothing methods to modelling exposure-response data, Cox models with penalised splines were used to reanalyse lung cancer risk in a cohort of workers exposed to silica in California’s diatomaceous earth industry. To encourage application of this approach, computer code is provided.

Methods: Relying on graphic plots of hazard ratios as smooth functions of exposure, the sensitivity of the curve to amount of smoothing, length of the exposure lag, and the influence of the highest exposures was evaluated. Trimming and data transformations were used to down-weight influential observations.

Results: The estimated hazard ratio increased steeply with cumulative silica exposure before flattening and then declining over the sparser regions of exposure. The curve was sensitive to changes in degrees of freedom, but insensitive to the number or location of knots. As the length of lag increased, so did the maximum hazard ratio, but the shape was similar. Deleting the two highest exposed subjects eliminated the top half of the range and allowed the hazard ratio to continue to rise. The shape of the splines suggested a parametric model with log hazard as a linear function of log transformed exposure would fit well.

Conclusions: This flexible statistical approach reduces the dependence on a priori assumptions, while pointing to a suitable parametric model if one exists. In the absence of an appropriate parametric form, however, splines can provide exposure-response information useful for aetiological research and public health intervention.

P-splines, penalised splines
HR, hazard ratio
RR, relative risk
GAM, generalised additive model
df, degrees of freedom
AIC, Akaike’s Information Criterion

Cox regression
exposure-response models
regression diagnostics

https://doi.org/10.1136/oem.2004.013136

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

View Full Text

Footnotes

Supported by Grant CA81345-03 from National Cancer Institute

Linked Articles

Work in brief
Work in brief

Dana Loomis
Occupational and Environmental Medicine 2004; 61 797-797 Published Online First: 17 Sep 2004.

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Footnotes

Linked Articles

Read the full text or download the PDF:

Log in using your username and password

Read the full text or download the PDF:

Log in using your username and password