Article Text

Risk of prematurity, low birthweight and pre-eclampsia in relation to working hours and physical activities: a systematic review
  1. Matteo Bonzini1,
  2. David Coggon2,
  3. Keith T Palmer2
  1. 1Department of Occupational Health, University of Milan, Foundation IRCCS Ospedale Maggiore, Milan, Italy
  2. 2MRC Epidemiology Resource Centre, University of Southampton, Southampton General Hospital, Southampton, UK
  1. Correspondence to:
 Dr K T Palmer
 MRC Epidemiology Resource Centre, Southampton General Hospital, Tremona Road, Southampton SO16 6YD, UK;ktp{at}


Background: Occupational activities are suspected of having an adverse impact on outcomes of pregnancy.

Aim: To assess the evidence relating three major adverse outcomes (preterm delivery, low birthweight (LBW) and pre-eclampsia/gestational hypertension) to five common occupational exposures (prolonged working hours, shift work, lifting, standing and heavy physical workload).

Methods: A systematic search of Medline and Embase (1966–December 2005) using combinations of keywords and medical subject heading terms was conducted. For each relevant paper, standard details were abstracted that were then used to summarise the design features of studies, to rate their methodological quality (completeness of reporting and potential for important bias or confounding) and to provide estimates of effect. For studies with similar definitions of exposure and outcome, pooled estimates of relative risk (RR) in meta-analysis were calculated.

Results: 53 reports were identified—35 on preterm delivery, 34 on birth weight and 9 on pre-eclampsia or gestational hypertension. These included 21 cohort investigations. For pre-term delivery, extensive evidence relating to each of the exposures of interest was found. Findings were generally consistent and tended to rule out a more than moderate effect size (RR >1.4). The larger and most complete studies were less positive, and pooled estimates of risk pointed to only modest or null effects. For small-for-gestational age, the position was similar, but the evidence base was more limited. For pre-eclampsia and gestational hypertension, it was too small to allow firm conclusions.

Conclusions: The balance of evidence is not sufficiently compelling to justify mandatory restrictions on any of the activities considered in this review. However, given some uncertainties in the evidence base and the apparent absence of important beneficial effects, it may be prudent to advise against long working hours, prolonged standing and heavy physical work, particularly late in pregnancy. Our review identifies several priorities for future investigation.

  • LBW, low birthweight
  • SGA, small for gestational age

Statistics from

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

In Europe, as in most parts of the world, women of reproductive age now comprise a substantial proportion of the total workforce. Current European Union legislation (directive no. 85, 1992) requires employers to assess health and safety risks to pregnant workers, and where possible to minimise them.1 A number of reproductive hazards associated with work are well established—for example, from ionising radiation and lead—and strategies have been developed to manage the associated risks. For others, the scientific evidence is less certain.

Important among these is the possibility that physical activities at work might impact adversely on outcomes of pregnancy such as preterm delivery and LBW.2 In theory, long working hours, prolonged standing, heavy lifting or unusual workload may pose a number of threats to the pregnant worker. For example, the high demand for uterine and placental blood flow in the third trimester could limit reserve capacity for vigorous exercise, the gravid uterus could limit venous return and cardiac output, especially in those who stand, and raised norepinephrine levels could increase uterine contractility and thereby raise the risk of preterm labour. On the other hand, marked physiological adaptations to the demands of pregnancy tend to preserve constant fetal oxygen consumption. Practical management of the pregnant worker is made more difficult because the activities of concern (especially physical exercise), although suspected of being hazardous, could also be beneficial. Thus, a precautionary approach to the uncertainty may not be ideal.

To help clarify the way forward, we conducted a systematic review of the epidemiological evidence relating three major adverse outcomes of pregnancy (preterm delivery, LBW and pre-eclampsia/gestational hypertension) to five common occupational exposures (prolonged working hours, shift work, lifting, standing and heavy physical workload). We aimed to establish what was a reasonable practical approach for employers, given the current balance of evidence, and to identify priorities for further research that could improve the formulation of policy in the future.


Search strategy

We conducted a systematic search of the Medline and Embase electronic bibliographic databases for the period 1966 to December 2005. Medical subject heading terms and keywords were chosen to represent each of the outcomes and exposures of interest and then combined. The following medical subject heading terms were used: pregnancy, reproductive health, pre-eclampsia, infant-premature, labour-premature, birth weight, gestational age, small for gestational age, fetal growth retardation, labour complications and pregnancy complications (as outcomes); and lifting, work schedule tolerance, exercise, fatigue, work, workload, employment and occupational exposure (as exposures). We also used several simple search terms to supplement our inquiry—namely, occupational activity, standing, manual lifting, heavy lifting and shift work (as exposures). We limited our search to papers with an abstract written in English. Abstracts were examined, duplicates and irrelevant references were eliminated, and paper copies of all primary reports and reviews were obtained. We checked the references of retrieved papers for other relevant material. Papers finally included were those which, for one or more outcome–exposure combinations, compared an exposed with a less heavily exposed or an unexposed reference group and which provided estimates of effect (or the data from which these could be calculated).

These procedures and the later steps below were replicated independently by MB and KTP, and differences were resolved by consensus.

Data abstraction

For each paper that was deemed relevant, we abstracted a standard set of information, including details of the study populations, setting, timing of investigation, study design, exposure contrasts, strategies for assessment of exposure(s) and outcome(s), response rates, confounders considered and estimates of effect. For reports that provided frequencies but no estimate of relative risk (RR), we calculated odds ratios (ORs) with exact confidence intervals (95% CI) using STATA V.8 software. Similarly, for papers that presented birth weight as a continuous measure, with group means and standard deviations, we calculated the mean difference between exposure groups with a 95% CI. Where several subanalyses were presented, we focused on the exposure contrasts that were most comparable across studies. Our assessment of papers was unblinded to authorship.

Quality assessment

We rated each paper for completeness of reporting, and each exposure–outcome permutation in terms of its potential for significant confounding or “inflationary” bias.

The completeness of reporting was graded on a nine-point scale according to the number of the following items that were clearly defined: (1) study design; (2) sampling frame and procedures; (3) inclusion/exclusion criteria; (4) main characteristics of the study population (age, ethnicity and social class); (5) study numbers and response rates; (6) method(s) of exposure assessment; (7) method(s) of outcome assessment; (8) method(s) of analysis; and (9) measures of association with 95% CI and numbers in the analysis. Studies in which ⩾3 of these items were missing/unclear were classified as poor in quality of information. The scoring scheme was based on elements proposed by Ariens et al3 and van der Windt et al4 but modified for purpose.

For each health outcome examined, we assessed known risk factors to identify those with important potential to confound associations with occupational activities and patterns of work. This assessment was based on the relative risk (RR) associated with the risk factor, its prevalence in the general population and the likelihood that it might vary importantly according to occupational exposures. Risk factors carrying only small RRs, with only a low prevalence, or with little potential to vary in relation to occupational exposures would be less likely to have a major confounding effect. Those that might reflect the effects of occupational exposure or that might lie on the causal pathway between exposure and health outcome were also discounted. We classified reported associations as being more susceptible to confounding if important potential confounders had not been addressed.

By inflationary bias, we implied bias that could cause important overestimation of RRs. We considered that this was most likely when exposures were self-reported retrospectively (especially if they were of a type that is more difficult to recall), and were related to outcomes that were themselves self-reported or were clearly adverse. Thus, for retrospective studies with self-reported exposures, we assigned one point for each of the following features that was present: (1) self-reported outcome; (2) outcome of pre-eclampsia, gestational hypertension or very low birthweight; and (3) exposure related to physical workload (standing, lifting or activity score). Exposure–outcome pairings were scored 0–3, and scores ⩾2 were considered to indicate important potential for inflationary bias.

By these criteria, we counted exposure–outcome combinations to be of lower methodological quality if they had significant potential for confounding or bias, or were derived from studies with incomplete reporting or low effective response rates (<50%).


For studies with sufficiently similar definitions of exposure and outcome, we calculated a pooled estimate of RR, using a random effects (DerSimonian–Laird) model, and weighting log RRs or log ORs by the inverse of their variances. This method made the simplifying assumption that all measures of RR (ORs, incidence density ratios, hazards ratios and so on) were equivalent. Most comparisons were summarised as ORs, but where other measures were used, the difference is likely to have been small since outcomes were uncommon and point estimates of RR were fairly close to unity. We performed the meta-analysis using the Sharpe and Sterne STATA macro ( We also conducted a sensitivity analysis to check the impact of excluding papers of lower quality, and explored possible publication bias using funnel plots.


Our search identified a total of 49 studies (53 reports)—34 (35 reports) on preterm delivery, 33 (34 reports) on birth weight and 8 (9 reports) on pre-eclampsia or gestational hypertension. Some investigations covered multiple exposures and/or outcomes. Table 1 lists the studies and their main design features. Tables 2–5 summarise quantitative relationships for specific exposure–outcome combinations.

Table 1

 Features of the studies included in the review

Table 2

 Weekly working hours, shift work, standing and risk of preterm delivery

Quality of evidence

Completeness of reporting

In general, the completeness of reporting was satisfactory across studies. As judged by our 9-point criteria, the median score was 7.5 (range 4.5–9). However, for 10 (20%) investigations, the score was ⩽6,6,12,15,17,27,42–44,50,57 the most common omission being a failure to provide a breakdown of the numbers of women included in analyses and the confidence limits associated with risk estimates.

Sample size and response rates

Sample sizes varied from small (<50) to extremely large (>35 000), but many risk estimates were based on findings from >1000 pregnancies. Response rates at baseline (cross-sectional studies) or follow-up (cohort studies) often exceeded 80% (32/49 studies) or even 90% (23/49 studies). However, in six studies, the usable response was <65%17,24,39,46,51,57 and in eight studies,18,25,27,29,43–45,55 it was unclear from the description provided.

Assessment of exposures

In 21 cohort investigations, exposure was ascertained prospectively during pregnancy, whereas for 28 studies (8 case–control, 19 cross-sectional and 1 retrospective cohort), information about exposure was elicited after the relevant health outcomes had occurred. The data on exposure were collected mostly through self-report (by mail, telephone or interview), but in a minority of studies18,19,31,36,39,51 job title was used as a surrogate index of exposure. Most studies of working hours and shift work employed similar definitions of exposure, but for other exposures, non-comparability restricted the opportunity for meta-analysis—for example, definitions of lifting varied with regard to load, frequency and duration, and some reports failed to define a “heavy load” at all. Similarly, the components of indices that were used to score physical workload differed between studies. Issues of measurement error were seldom considered by authors, even in prospective studies with the opportunity to validate reports by direct observation. However, a few studies12,25,40 used personal diaries to assist self-reporting of exposures. Around half of the studies did not report the timing of exposures during pregnancy.

Assessment of health outcomes

With few exceptions,18,20,24 health outcomes were established from hospital records, registers or birth certificates.

Bias and confounding

Various strategies were used to control for possible confounding, including matching, restriction, stratification and regression modelling, but confounding was ignored altogether in a few investigations.7,22,42 In addition, the associations in which we were interested were not always the prime focus of a paper, and sometimes the data that we were able to abstract, therefore, came only from crude preliminary tables. Tables 2–5 list the exposure–outcome pairings that carried higher potential for inflationary bias or confounding according to our prespecified criteria.

Preterm delivery

Case definition

Most, but not all,12,33,51,57 of the reports that we identified adopted the World Health Organization definition for preterm delivery: “the birth of a living fetus before 37 completed weeks of gestation”.

Potential confounding factors

Various maternal characteristics have been associated with an increased risk of preterm delivery, including a history of preterm delivery or fetal loss, multiple gestation, diabetes, pre-eclampsia, bacterial vaginosis, extremes of maternal age, weight, stature, ethnicity, educational level, socioeconomic status, tobacco and alcohol intake, substance misuse, parity, complications of pregnancy and maternal disease.58–61 However, few of these factors are common and carry a high RR, whereas some were universally controlled (eg, multiple gestation) and some (eg, obstetric events in earlier pregnancies) could have arisen from previous exposures at work. Smoking and social class (or proxies of social class—eg, educational attainment or income) are believed to carry moderate RRs (1.5–2.0) and are prevalent exposures whose frequency could vary systematically by occupational activity. We deemed risk estimates that failed to take account of both these variables to have a higher potential for confounding (17 of 91 exposure–outcome combinations, as listed in tables 2 and 3).

Table 3

 Lifting, physical activity and risk of preterm delivery

Scope for meta-analysis

Formal meta-analysis was judged feasible for associations of preterm delivery with longer working hours, shift work and standing. For lifting and physical workload, definitions of exposure were considered too heterogeneous to be combined.

Working hours

The relationship of working hours with preterm delivery was considered in 16 investigations8,10,11,13,17,20,24,27,29,36,38,43–46,52 (including four cohort studies), the high exposure group generally being women who had worked for at least 40 h per week in the first two trimesters of pregnancy. Only moderate associations were reported (table 2). Estimates of RR exceeded 1.5 in four studies,10,24,27,52 and in two of these the increase was statistically significant. However, these positive findings were limited in one case24 by an unusually low response rate, and in the other27 by incomplete reporting and a higher potential for confounding. No risk estimate was as high as 2, and in the four largest studies (>2000 pregnancies analysed),11,30,43,45 which included the 16-country EUROpean Project of Occupational risks and Pregnancy outcome case–control Study,45 RRs lay in the range 0.59–1.34.

A pooled RR of 1.31 (95% CI 1.16 to 1.47; test of heterogeneity, Q = 4.33, p = 0.74) was derived from eight studies that compared work for at least 40 h per week with shorter hours, and for the subset of five studies judged to have higher methodological quality, the corresponding risk estimate was 1.20 (0.98 to 1.47; Q = 1.32, p = 0.86).

Shift work

We found 14 studies that considered the association of preterm delivery with shift work (usually defined as either shift or night work),8,13,15,24,27,29,32,34,38,43,45,49,55,56 including 5 cohort investigations (table 2). In seven studies, the point estimate of RR was close to unity, but in six studies8,24,27,38,49,55 (including four of higher quality8,38,49,55), the RR was >1.5. Two studies found significantly increased risks of ⩾2.0.8,55

The positive studies tended to be small relative to others of similar design, and the four largest investigations13,29,45,56 found little association between shift work and preterm delivery. In particular, a prospective study, based on the Danish National Birth Cohort and with record linkage to 3 national birth outcome registers, compared >32 000 women working in the daytime with 3197 women working in rotating shifts during the first 2 trimesters, and reported an adjusted OR close to unity.56 Risk estimates from subanalyses for other shift patterns (fixed evening, fixed night, rotating shifts without night) were in the range 0.7–1.1.

The pooled estimate of risk across 13 studies with sufficient data was 1.20 (95% CI 1.01 to 1.42; Q = 31.30, p = 0.002) and for the eight studies that met our criteria for higher quality, it was 1.26 (95% CI 0.98 to 1.63; Q = 21.83, p = 0.003).


Standing and preterm delivery were considered in 20 papers,7,10,11,13,15,17,21,23,24,26,27,29,32,33,38,43–45,51,52 including 10 based on cohort studies (table 2). “High” exposure was defined (or could be interpreted) as standing for at least 3 h/day in 12 of the studies. Risk estimates exceeded 1.5 in seven studies,23,24,26,27,33,44,51 whereas in most of the remainder, they exceeded unity but were less clearly positive (OR 1.2–1.3). In the five largest studies (>4000 pregnancies), ORs lay in the range 1.07–1.56.10,21,23,29,45

The pooled estimate of risk across the 12 studies that compared standing for at least 3 h with lower exposures was 1.28 (95% CI 1.11 to 1.47; Q = 16.50, p = 0.12), and that for the subset of 6 studies of higher methodological quality was 1.26 (95% CI 0.96 to 1.66; Q = 13.19, p = 0.02).


The relationship between occupational lifting and preterm delivery was examined in 12 studies,5,7,10,13,26,29,32,38,43–45,52 including 6 of cohort design (table 3). For the most part, inquiry focused on exposures in the first trimester, but studies differed substantially in their definition of exposure. In none did the point estimate of risk exceed 1.5, although in three studies32,43,44 (one of higher quality32) it was >1.3.

Physical workload

The link between physical workload and preterm delivery was investigated in 21 studies.11,13,15,17,18,21–25,27,29,33,35,36,39,40,42–44,49,52 (table 3). Exposure was defined in various ways. Six studies17,24,27,29,33,49 used an occupational fatigue score, first proposed by Mamelle et al,27 and comprising several occupational factors—standing >1 h/day, work on an industrial machine, carrying loads of >10 kg, mental stress, and chemical or physical exposures in the work environment. Another common approach was based on a score of physical workload calculated after grouping certain exposures (eg, strenuous conditions and self-reported physical exertion), or according to the estimated daily energy expenditure.11–13,15,18,23,25,33,35,36,39,40,42,43 In practice, however, exact definitions tended to vary, hindering comparison.

RR estimates close to unity (0.8–1.3) were found in about half of the studies, but for six studies they exceeded 1.5 (five statistically significant18,27,39,42,43), and in three studies they were at least doubled.18,42,43 Findings differed materially by study design, however. In 7 of the 8 prospective cohort studies, RRs were <1.3, whereas in 5 of the 13 retrospective studies, they were >1.7. The two highest risk estimates (both from cross-sectional surveys of lower quality42,43) were derived from a crude secondary analysis of the data. Four of the five largest studies reported only modestly increased risks (OR <1.3).11,21,23,29

Birth weight

Case definition

The 32 investigations of LBW that we identified used 3 main approaches to characterise the outcome. Some treated birth weight as a continuous measure, some defined cases as having a birth weight less than a stated threshold (most often 2500 g), and some took as cases babies that were small for gestational age (SGA) according to a cut-point on the expected distribution (usually the 10th centile). Many papers presented results for more than one of these outcome measures, and where birth weight was adjusted for gestational age, risk estimates tended, if anything, to be lower. This suggested that associations with unadjusted birth weight in part reflected effects on gestation. Therefore, since we carried out a separate review of occupational risk factors for preterm delivery, we present findings for SGA. Supplementary tables S1 and S2 summarising results for other measures of birth weight are available online (

A total of 14 studies provided information on occupational risks of SGA5,8,11,13,14,20,23,25,26,34,35,46,52,56 (table 4).

Table 4

 Occupational activity and risk of being small-for-gestational age at delivery

Potential confounders

In developed countries, four risk factors are suspected to account for most of cases of intrauterine growth retardation: cigarette smoking during pregnancy, small maternal stature, suboptimal nutrition and low maternal weight gain during pregnancy. Other less significant risk factors include race, alcohol consumption, prematurity or LBW in the mother, parity, maternal illnesses and maternal age.62–64 Among the major risk factors, low maternal weight gain in pregnancy could lie on the causal pathway between occupational exposures and SGA, whereas socioeconomic status represents a useful proxy for poorer nutrition. We thus classified risk estimates as having a higher potential for confounding if they failed to take account of both smoking and at least one of the factors—namely, socioeconomic status, maternal height or pre-pregnancy weight (13 of the 29 exposure–outcome combinations listed in table 4).

Scope for meta-analysis

A pooled estimate of risk was calculated for the association of SGA with shift work, but data on other exposures were too sparse or too heterogeneous to warrant meta-analysis.

Working hours

Weekly working hours and SGA were considered in seven studies,8,11,13,20,38,46,52 all but two of which were of higher quality. High exposure mostly entailed working for ⩾40 h/week in the first or second trimesters of pregnancy. The largest study found an OR of 1.59 (95% CI 1.14 to 2.22),11 and in the one cohort study, the OR was 2.1 (95% CI 0.6 to 7).52 The other five investigations all gave risk estimates close to unity.

Shift work

Our search retrieved six studies that reported on shift work and SGA,8,13,14,34,38,56 all but one of which were considered to be of higher quality. In one, the estimated RR was 1.5 (95% CI 1.0 to 2.4),34 but otherwise, risk estimates tended to be close to unity. The pooled estimate of risk from the six studies was 1.07 (95% CI 0.96 to 1.19; Q = 3.30, p = 0.51).


Lifting was considered in five studies of SGA5,13,26,38,52 (four of higher quality). None of the studies showed either a significant positive or a significant negative association.


Standing and SGA were analysed in eight studies,11,13,14,23,26,35,38,52 including four of cohort design. Four of the studies13,14,26,38 were classified as being of higher quality.

The highest risk estimate (OR 2.0, 95% CI 0.7 to 5.4, for those standing ⩾5 vs ⩽4 h/day in the second trimester) came from a cohort study in Thailand.52 Otherwise, risk estimates were <1.5.

Physical workload

SGA and physically demanding work were considered in seven investigations,11,13,14,23,25,35,52 including three of cohort design. As with preterm delivery, there was considerable variation in definitions of exposure. In half the studies, the timing of exposure was unstated, whereas the remainder focused mainly on the first two trimesters.

One cross-sectional study found a risk ratio of 2.4 (95% CI 1.3 to 4.6),35 and a large cohort study indicated an OR of 1.32 (95% CI 1.12 to 1.56)23 for manual work as compared with office work. However, only one other study11 gave a risk estimate >1. Only two studies13,14 were classified as being of higher quality and both these reported an OR just below unity.

Gestational hypertension and pre-eclampsia

Case definition

Pregnancy-induced hypertension is commonly subclassified as

  1. gestational hypertension (increase in blood pressure in a woman who previously had normal blood pressure, which occurs after the 20th week of gestation and resolves after delivery); and

  2. pre-eclampsia (gestational hypertension with proteinuria and oedema).

For pre-eclampsia, we found some variation in terms of the level of blood pressure and the degree of proteinuria required for diagnosis: two studies relied on routine diagnosis by the clinicians caring for subjects19,53; two defined pre-eclampsia as diastolic blood pressure ⩾90 mm Hg and proteinuria ⩾1 g/l on at least two separate occasions after 20 weeks of gestation and not before22,28; one used a different cut-point (which included systolic hypertension)41; and one study (two reports) was based on a rise in mean arterial blood pressure between the first and last antenatal visits.34,35 Similarly, criteria for gestational hypertension varied importantly between studies.

Potential confounders

Epidemiological research on hypertension in pregnancy has tended to focus mainly on pre-eclampsia, but gestational hypertension may be part of the same spectrum of disease, with a similar pathogenesis and causation.

The pathogenesis of pre-eclampsia is thought to involve superficial placentation, immune maladaptation, reduced concentrations of angiogenic growth factors and increased placental debris in the maternal circulation, provoking a maternal inflammatory response that is modulated by pre-existing cardiovascular or metabolic fitness.65 Reported risk factors include a history of chronic hypertension, previous pre-eclampsia, primiparity, obesity, diabetes, multiple gestation, extremes of age, ethnicity, family history, low calcium intake, use of barrier contraception, artificial donor insemination and donated oocyte gestation.22,65,66 Smokers have been found to be at a lower risk of pre-eclampsia in some, but not all, studies.66,67 The only factors we considered to be both common and to carry a substantial RR were overweight68 and primiparity.69 We classified risk estimates as having a higher potential for confounding if they failed to take account of both of these variables (10 of 21 exposure–outcome pairings in table 5).

Table 5

 Occupational activity and the risks of pre-eclampsia and pregnancy-induced hypertension

Scope for meta-analysis

Because of potentially important differences in definitions of outcome from one study to another, we did not attempt meta-analysis for any occupational association with gestational hypertension or pre-eclampsia.

Associations with occupational activities

Our review identified nine unique investigations (two cohort studies) on gestational hypertension, pre-eclampsia and occupational activity—two on working hours,22,28 two on working in shifts,34,53 two on lifting,19,53 four on standing,19,35,41,53 and five on physical workload19,22,35,42,48 (table 5). Positive associations were found with lifting of heavy loads in one study53 and with high physical activity scores in two studies.42,48 All three of the positive studies were retrospective, and we rated all as having higher potential for inflationary bias. In addition, one42 was incomplete in its reporting and was assessed as having a greater potential for confounding.

Funnel plots

To check for possible publication bias, we constructed funnel plots for those associations with sufficient data to be included in meta-analyses. These plots, which are available from the authors on request, suggest a degree of publication bias in relation to shift work and preterm delivery (smaller studies increasingly more positive), but showed no clear pattern in relation to the other associations.


The health outcomes that we examined in this review are clinically important. Preterm delivery is a major determinant of perinatal mortality, and of neonatal and infant morbidity.70 LBW is also related to infant morbidity and mortality (eg, in England and Wales during 1999, nearly two-thirds of infant deaths were in babies of LBW71), as well as to predicting adverse outcomes in childhood and later life, such as poorer growth and development, and higher risks of neurological and cognitive deficit, high blood pressure, non-insulin-dependent diabetes, coronary heart disease, stroke and obstructive lung disease.72 Pregnancy-induced hypertension was associated with 16% of pregnancy-related maternal deaths in the US during 1991–7.73

The extent of epidemiological evidence that we identified on occupational risks varied. For some associations (eg, the relationship of preterm delivery to shift work and lifting) a substantial body of research had been published, whereas for others (eg, SGA with lifting) relatively few reports were retrieved. Although our literature search was restricted to English language publications, and did not extend beyond those in peer-reviewed journals, we think it unlikely that many important papers will have been overlooked. There is, however, a possibility of publication bias, with more complete reporting of positive than non-positive findings, especially from smaller studies, and the funnel plots that we constructed would tend to support an element of such bias.

A further limitation of the available evidence relates to the definition and ascertainment of exposures. Many of the occupational activities that have been studied are complex constructs, and cannot readily be characterised by a simple, unidimensional metric. For example, occupational lifting could be classified according to the frequency of lifting tasks in a working day, the duration of such tasks, the heaviness of the weights lifted and perhaps also the postures in which lifting is carried out. Similarly, shift work could encompass varying patterns of rotation, with or without work at night. In the absence of clear pointers to the aspects of occupational activities that are most relevant to the health outcomes under investigation, it is not obvious how exposures should best be categorised, and if the classification is suboptimal, risks may be underestimated.

One aspect of exposure that could be important for any of the occupational activities examined is its timing during pregnancy. Thus, the same activity might carry different risks if it occurred late in pregnancy as compared with only a few weeks after conception. In some reports, it was unclear exactly when during pregnancy the reported exposures occurred, but in most, the exposures analysed were in the first or second trimester. Only six papers12,16,37,38,40,46 presented risk estimates separately for exposures in different trimesters, and these did not point to major differences. However, data on exposures late in pregnancy were generally sparse.

Ascertainment of exposures was usually by direct questioning, or by inference from job title. The accuracy of self-reported exposures is likely to differ according to their nature. For example, hours of work and night work should be relatively easy to recall, whereas frequency of lifting may be more difficult to remember accurately. Where exposures were assessed from memory after delivery (ie, in case–control and cross-sectional studies), there was a potential for differential errors of recall, with exposures perceived as hazardous being over-reported by cases or under-reported by controls. This would apply particularly if the adverse nature of the health outcome was obvious to participants (eg, pre-eclampsia) and the exposure of interest was difficult to remember reliably, and would have tended to inflate risk estimates. Thus, when reporting results, we highlighted those studies that we considered most susceptible to such bias.

Even where differential misclassification of exposure was unlikely, however, there remained the possibility of non-differential errors in exposure assessment, especially for exposures that were harder to characterise or were inferred indirectly from job title. The effects of such non-differential misclassification would normally be to bias risk estimates towards the null.

Other possible sources of error include misclassification of health outcomes, response bias and uncontrolled confounding effects, but these are probably less important. With few exceptions,18,20,24 outcomes were determined from hospital records, registers or birth certificates, in a way that seems unlikely to have been influenced by a knowledge of exposures. Response rates frequently exceeded 80% (32 of 49 studies), and even when they were lower, there was usually no reason to expect that responders would have differed markedly from non-responders in the associations under examination. Of the known risk factors for the pregnancy outcomes studied, few are common and carry a RR >2, and even if they were quite strongly associated with occupational activities, their confounding effect would be relatively small.

Current balance of evidence

Given the above limitations, table 6 summarises our assessment of the overall strength of evidence linking occupational activities with different categories of outcomes of pregnancy.

Table 6

 Summary of the main findings

For preterm delivery, we found extensive evidence relating to each of the exposures that we considered. Findings were generally consistent and tended to rule out a more than moderate effect size (RR >1.4). Strongly positive studies were uncommon, the larger and most complete studies tended to be less positive, and pooled estimates of risk, where feasible, pointed to only modest or null effects.

For SGA, the position is not dissimilar. Findings were generally consistent and rarely indicated more than moderate effect sizes. However, the evidence base was more limited. For example, we found only one cohort study on working hours and only one on shift work.

For pre-eclampsia and gestational hypertension, although positive findings were few in number, the evidence base is too limited to allow firm conclusions.

For none of the exposures examined was there any indication of important beneficial effects on outcomes of pregnancy.

Comparison with earlier reviews

Although there have been numerous narrative reviews on work and pregnancy outcome,70,74–79 few systematic reviews have been published, and even fewer that contain an element of meta-analysis. The investigation closest to our own, by Mozurkewich et al,2 considered several of the same papers, but differed in certain respects. Firstly, the authors excluded many more papers than we did. Thus, our review, which also had the benefit of at least five further years of research reports, covered almost twice the number of investigations. Secondly, Mozurkewich et al pooled in meta-analysis several studies that we believe have fundamentally dissimilar definitions of exposure. Thirdly, our review covers some exposure–outcome combinations that were not reported by Mozurkewich et al. Nonetheless, where comparisons can be made, the conclusions of our two reviews are similar in most respects. Mozurkewich et al estimated that physically demanding work (including lifting) was significantly associated with preterm delivery (OR 1.22), SGA (OR 1.37) and hypertension or pre-eclampsia (OR 1.67), and that the risks of preterm delivery were also higher in those with prolonged standing (OR 1.26) and shift and night work (OR 1.24).2 Among these, only the estimates for pre-eclampsia from our reading seem to be without much evidential support.

Main messages

  • This review considered the relationship of three adverse outcomes of pregnancy (preterm delivery, LBW and pre-eclampsia/gestational hypertension) to five occupational exposures (long working hours, shift work, lifting, standing and heavy physical workload).

  • For preterm delivery, there was extensive evidence related to each exposure, and this tended to rule out more than moderate effects (RR >1.4)

  • We found less evidence for the other outcomes, especially for pre-eclampsia.

  • We found no evidence of benefit from these occupational exposures.

Policy implications

  • On balance, the evidence does not support mandatory restriction of any of the activities considered.

  • It may still be prudent to limit extremes of exposure, especially in late pregnancy.

Implications for occupational health practice

Given the scientific evidence that is currently available, how should the occupational activities of pregnant mothers be managed?

We do not think that the current balance of evidence is sufficiently compelling to justify mandatory restrictions on any of the activities considered in this review, if a woman wishes to continue them through pregnancy. At the same time, however, given the uncertainties in the evidence base and the apparent absence of any important beneficial effects, it would seem prudent to advise against long working hours (>40 h per week), prolonged standing and heavy physical work, particularly late in pregnancy. Shift work does not seem to carry an important increase in risk for the health outcomes examined, and in this respect, there seems no reason to recommend its discontinuation during pregnancy. However, societal customs and expectations may dictate that a change from shift work should be permitted if requested.

Priorities for future research

Although we have identified uncertainties in relation to all the occupational activities and health outcomes examined, the potential to reduce this uncertainty by further research varies. We suggest that the highest priorities for future investigation should be the relationship of SGA to working hours and lifting, and further assessment of occupational risk factors for pre-eclampsia and gestational hypertension. In particular, there is a need for well-designed cohort studies in which relevant exposures are assessed prospectively at different stages of pregnancy and subsequent health outcomes are systematically ascertained.


Supplementary materials


  • Published Online First 9 November 2006

  • Competing interests: None declared.