Article Text

Download PDFPDF

Meta-analysis of benzene exposure and non-Hodgkin lymphoma: biases could mask an important association
  1. C Steinmaus1,2,
  2. A H Smith1,
  3. R M Jones1,
  4. M T Smith1
  1. 1
    School of Public Health, University of California, Berkeley, California, USA
  2. 2
    Office of Environmental Health Hazard Assessment, California Environmental Protection Agency, Oakland, California, USA
  1. Craig Steinmaus, 140 Warren Hall University of California Berkeley, California 94720-7360, USA; craigs{at}


Objectives: Benzene is a widely recognised cause of leukaemia but its association with non-Hodgkin’s lymphoma (NHL) is less well established. The goal of this project is to review the current published literature on this association.

Methods: We performed a meta-analysis of cohort and case-control studies of benzene exposure and NHL and a meta-analysis of NHL and refinery work, a potential source of benzene exposure.

Results: In 22 studies of benzene exposure, the summary relative risk for NHL was 1.22 (95 CI 1.02 to 1.47; one-sided p value  0.01). When studies that likely included unexposed subjects in the “exposed” group were excluded, the summary relative risk increased to 1.49 (95 CI 1.12 to 1.97, n  13), and when studies based solely on self-reported work history were excluded, the relative risk rose to 2.12 (95 CI 1.11 to 4.02, n  6). In refinery workers, the summary relative risk for NHL in all 21 studies was 1.21 (95 CI 1.00 to 1.46; p  0.02). When adjusted for the healthy worker effect, this relative risk estimate increased to 1.42 (95 CI 1.19 to 1.69).

Conclusions: The finding of elevated relative risks in studies of both benzene exposure and refinery work provides further evidence that benzene exposure causes NHL. In addition, the finding of increased relative risks after removing studies that included unexposed or lesser exposed workers in “exposed” cohorts, and increased relative risk estimates after adjusting for the healthy worker effect, suggest that effects of benzene on NHL might be missed in occupational studies if these biases are not accounted for.

Statistics from

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Annual production of benzene in the USA is over 2 billion gallons and millions of people are exposed to benzene either occupationally through a variety of different industrial processes or environmentally from cigarette smoke, gasoline or automobile emissions. Benzene is a widely recognised cause of leukaemia, particularly acute non-lymphocytic leukaemia (ANLL), but its association with non-Hodgkin’s lymphoma (NHL) is less well established.

Previous epidemiological studies on the association between benzene and NHL have produced mixed results. For example, in a cohort study of 74 828 benzene-exposed workers from multiple industries in China, Hayes et al reported a relative risk of 4.7 (95 CI 1.2 to 18.1) for NHL.1 In contrast, Sorahan et al reported a relative risk of 1.00 (95 CI 0.64 to 1.49) for NHL in a cohort study of benzene-exposed workers in England and Wales.2 The reasons for these discrepancies are not entirely clear but could be related to differences in study populations, exposure levels and study designs. It could also be related to low statistical power or the presence of particular biases which may have limited the ability of some studies to identify real effects. In particular, biases resulting from the healthy worker effect, from inaccurate classification of exposure, and from inadequate study power can bias relative risk estimates towards the null and lead to true associations being missed.

In a previous publication, we assessed some of the evidence relating to benzene and NHL in a systematic review in which problems of bias due to the health worker effect were noted.3 In this paper we present a formal meta-analysis of studies of NHL and occupational exposure to benzene in work settings other than refineries, and a formal meta-analysis of NHL and refinery work, a setting that has historically been associated with benzene exposure.4 These were done separately since refinery work can be associated with many chemical exposures other than benzene. Two previous meta-analyses done on these topics did not find increased NHL risks with benzene exposure or refinery work.5 6 However, our meta-analysis differs from these prior reports in that we objectively evaluate the impact of the healthy worker effect and exposure misclassification (in particular, the inclusion of unexposed or lesser exposed workers in “exposed” cohorts) and incorporate the results of these evaluations directly into our overall summary relative risk estimates. As we show in this report, the incorporation of these factors can have important implications in the evaluation of a causal link between benzene and NHL.


Databases including Medline were searched for all epidemiological studies on NHL and benzene exposure or refinery work. Searches included combinations of the keywords: NHL, lymphosarcoma, reticulosarcoma, cancer, benzene, solvents, refinery, and petroleum industry. The bibliographies of all relevant articles as well as relevant review articles were also searched. The results of these searches are reviewed in Smith et al.3

The meta-analysis of benzene exposure and NHL includes case-control and cohort studies that provided relative risk estimates specifically for benzene exposure. Only data published in scientific journals were used. Studies which reported relative risks only by job type (eg, shoe maker) or industry (eg, rubber manufacturing) without specific identification of those exposed to benzene were not included in the benzene–NHL meta-analysis. Studies that reported relative risks for many different solvent exposures combined and studies reporting only relative risk estimates for NHL combined with other cancer types were also not included. Many studies, especially older ones, presented results only for lymphosarcoma and reticulosarcoma combined. These were included in the analysis since these cancers compromise a major sub-classification of NHL. A few studies presented separate results for both NHL and lymphosarcoma and reticulosarcoma. For these studies, the results for NHL were used.

In most studies, relative risks were given for several different metrics of benzene exposure including cumulative exposure, average exposure intensity, peak exposure, exposure duration, and time since first exposure. When this occurred we chose a single relative risk selected in the following order: average exposure intensity, cumulative exposure, and exposure duration. Table 1 shows the selection criteria used for this meta-analysis. Several studies also reported relative risks for different levels of exposure (ie, high, medium, low). Simple cause and effect associations are best initially evaluated in groups with high exposure since relative risks are likely to be further away from 1.0 when exposures are high than when they are low. Higher relative risks are less likely to be subject to type II bias (ie, inadequate study power) since all else being equal, study power is greater when relative risks are higher. Higher relative risks are also less likely to be due to confounding or some other undetected bias.7 For these reasons, and because our focus was on evaluating causal inference rather than exact dose–response relationships, we selected the relative risk for the highest exposure category from each study.

Table 1 Order of selection of relative risk estimates from the individual studies when more than one relative risk is given

The highest categories of exposure were defined differently from study to study. For example, in Hayes et al the highest exposure category included only those workers with estimated average exposures greater than 25 ppm,1 while in Rinsky et al the highest exposure group was defined as those with exposures greater than 1 ppm-day.8 If a true association exists, combining relative risks from studies of highly exposed workers with those of studies including lesser exposed workers can drive summary relative risk estimates towards 1.0. As such, we performed a separate subgroup analysis that only included studies providing relative risks for workers that were most likely to be highly exposed. A single definition of high exposure could not be identified since studies used different exposure metrics. Because of this, we performed one analysis (labelled “high exposure”) in which we excluded all studies comparing a non-exposed group to a group with “any benzene exposure” and excluding the study by Rinsky et al (2002) which had a very low cut-off point for defining exposed workers (1 ppm-day). In an additional subgroup analysis (labelled “high exposure, no self reported data”) we included only those “high exposure” studies where some industrial hygiene measurement data were used to assess exposure. In this analysis, we excluded studies where benzene exposure was based solely on workers’ self-reports.

In some instances, the highest exposure category for the selected exposure metric had no cases. In these instances, the next highest exposure category or highest exposure category for the next most relevant exposure metric was selected. This was done to remove studies with low study power. This criterion only involved two studies and had only small impacts on our results.9 10 For example, removing these two studies or using the relative risks of zero for their selected exposure category changed the benzene–NHL summary relative risk estimate from 1.22 to 1.21 and 1.19, respectively.

We selected studies that reported either incidence rate ratios, odds ratios, or standardised mortality ratios (SMRs). Incidence rate ratios and odds ratios adjusted for age and gender were used. Several case-control studies and a few cohort studies also reported relative risks adjusted for other variables such as education, family history of cancer, ethnicity, income and smoking. Adjusted relative risks were used when available. In our meta-analysis of benzene and NHL, many studies reported data on NHL incidence and a few studies reported data on both incidence and mortality. Incidence relative risks were selected over mortality relative risks if both were reported. In the meta-analysis of refinery work and NHL, the Gun et al study was the only study that reported relative risks for both NHL incidence and mortality.11 Few refinery studies reported data on NHL incidence. Because of this, we used the Gun et al results for mortality in the refinery–NHL meta-analysis.

Some studies reported results for different latency periods (the time from first exposure to cancer diagnosis). Increases in ANLL have been associated more strongly with more recent benzene exposures compared to more distant exposures but this does not appear to be the case for NHL.1 Since many environmental agents can take decades to lead to detectable cancers, we chose the result for the longest latency period. These were generally about 10–20 years and never more than 40 years. For many specific benzene-exposed cohorts, publication of initial results was followed by one or more updates, usually extending the period of follow-up. In these instances the most recent publication giving the selected exposure metric was used in our analysis.

Occupational studies reporting SMRs can be biased by the healthy worker effect.12 In order to objectively evaluate the impact of this bias, we performed separate analyses in which SMRs were adjusted for the healthy worker effect using the methods initially presented by Miettinen and Wang to analyse proportionate mortality studies as case-control studies.13 This method involves computation of the mortality odds ratio for the cause of death of interest (NHL) and all other causes of death comparing the benzene-exposed group to the benzene-unexposed group. Justification and further details for this method are provided in Smith et al.3 For those studies that did not provide sufficient data to do these calculations, the unadjusted relative risks were used.

Summary relative risk estimates were calculated using both the fixed effects inverse variance weighting method14 and the random effects method.15 Heterogeneity among studies was assessed using the general variance-based method as described by Petitti.16 Some authors have suggested that because the random effects model incorporates between-study heterogeneity it is more conservative than the fixed effects model.16 However, a potential problem with the random effects model is that, unlike the fixed effects model, study weighting is not directly proportional to study precision. As a consequence, the random effects model gives relatively greater weight to smaller, less precise studies compared to larger, more precise studies than the fixed effects model. As shown by Poole and Greenland, this can actually lead to summary results that are less conservative than those produced using the fixed effects model.17 To avoid the problems introduced by the random effects model, we used the method first presented by Shore et al18 and used in several subsequent meta-analyses.1922 In this method the summary relative risk estimate is calculated by directly weighting individual studies by their precision as in the fixed effects model. Between-study heterogeneity is not used to calculate the relative risk estimate, but is incorporated into calculations of its variance (ie, its 95 confidence interval).

Publication bias was assessed using funnel plots and Begg’s and Egger’s tests.23 24 The funnel plot is a graphical presentation of each study’s effect size versus an estimate of its precision and can be asymmetrical if smaller studies with null or unexpected results are not published. In Egger’s test, asymmetry in the funnel plot can be formally tested by performing a simple linear regression of the effect size divided by its standard error (SE) on the inverse of the SE.24 In Begg’s test, Kendall’s rank order test is used to assess the correlation between the studies’ effect sizes and their SEs.23

All p values for the meta-analysis results are one-sided since we had a clear one-directional a priori hypothesis: that benzene or refinery work was associated with an increased (not decreased) NHL risk.

Although the focus of our study was on benzene exposure, we performed a supplementary meta-analysis of refinery work since this industry has historically been associated with exposures to benzene. Our meta-analysis of refinery work and NHL was done using similar methods to those described above, except in this analysis only cohort studies were used and studies of refinery workers were used regardless of whether or not they provided specific information on benzene exposure. In these studies, exposure was commonly classified as the number of years worked at the refinery. The relative risk from each study associated with the greatest number of years worked was selected for the meta-analysis. When data on this metric weren’t provided, relative risks for metrics in the following order were selected: greatest number of years since hire, highly exposed job category (ie, hourly worker versus salary worker) and any refinery work (Table 1).


Studies of benzene exposure and NHL risk

Tables 2 and 3 provide details of the case-control and cohort studies used in the meta-analysis of benzene exposure and NHL risk, respectively. A list of studies not used and reasons why they were not used can be found at In total, the meta-analysis of benzene exposure and NHL included 16 case-control and six cohort studies. The overall summary relative risk for all 22 studies combined was 1.22 (95 CI 1.02 to 1.47; p  0.01) (Table 4). In an analysis by study design, the summary relative risks for cohort studies (RR  1.21, 95 CI 0.83 to 1.77; p  0.15) were similar to that for the case-control studies (RR  1.23; 95 CI 0.99 to 1.52; p  0.03). In the 13 studies that provided results specifically for highly exposed workers the summary relative risk was 1.49 (95 CI 1.12 to 1.97) (Figure 1). In the six of these studies that did not rely on self-reported exposure information for exposure assessment, the summary estimate was 2.12 (95 CI 1.11 to 4.02). When three of the four cohort studies in this analysis were adjusted for the healthy worker effect, the relative risk rose slightly to 2.26 (95 CI 1.29 to 3.97) (An internal comparison group was used in the cohort study by Hayes et al so the healthy worker effect adjustment was not done.1)

Figure 1 Forest plot of high exposure studies in the meta-analysis of benzene and NHL.
Table 2 Case-control studies of benzene exposure and non-Hodgkin’s lymphoma (NHL)
Table 3 Cohort studies of benzene exposure and non-Hodgkin’s lymphoma (NHL)
Table 4 Summary results of the meta-analysis of benzene exposure and non-Hodgkins lymphoma (NHL) and meta-analysis of refinery work and NHL

The cohort investigated in Collins et al25 made up one of the seven plants included in the Wong study.26 However, the Wong study had a small number of cases (probably because the follow-up period was relatively short) and therefore received only a small fraction of the total weight in any analysis. Removing this study had little effect on our results.

Studies of refinery work and NHL risk

Table 5 shows details of the studies included in the meta-analysis of refinery work and NHL. Lists of studies and relative risks not used for this analysis can be found at The summary relative risk for all 21 studies in this analysis was 1.21 (95 CI 1.00 to 1.46; p  0.02). When adjusted for the healthy worker effect, this rose to 1.42 (95 CI 1.19 to 1.69; p<0.001). The Tsai et al study27 had a much higher relative risk than any other study in this analysis. Removing this study from this analysis caused a slight decrease in the summary estimate (RR  1.37; 95 CI 1.18 to 1.58). Removing the Sorahan et al study,28 which had a much larger sample size and was weighted more heavily than any other study, also had little impact on this analysis (RR  1.44; 95 CI 1.15 to 1.80). When the analysis of refinery studies with the healthy worker effect adjustment was confined to only those 14 studies that assessed a higher exposure category, the summary relative risk increased to 1.51 (95 CI 1.07 to 2.14) (This analysis included all studies in Table 5 that were not marked “Total cohort” for the exposure category.) The summary relative risk calculated using the random effects model was lower than this (RR  1.37; 95 CI 0.94 to 2.01) because of the greater weight given to the smaller, less precise studies by this model.

Table 5 Cohort studies of refinery work and non-Hodgkin’s lymphoma (NHL)

No evidence of publication bias was evident in the funnel plots, Begg’s test, or Egger’s test in either the benzene or refinery analyses. For example, in the benzene–NHL meta-analysis involving the 13 high exposure studies (not adjusted for the healthy worker effect), Begg’s test Kendall’s score was -2 (p  0.90), Egger’s test p value for bias was 0.81, and the funnel plot showed no evidence of asymmetry (Figure 2).

Figure 2 Funnel plot of benzene and NHL high exposure studies (n  13) (not adjusted for the healthy worker effect).


The elevated summary relative risk estimate we identified in studies that specifically examined high benzene exposure provides new evidence that benzene causes NHL. The increase in summary relative risks when analyses were confined to results in highly exposed workers and reports not based primarily on self-reported exposure provides an example of how exposure misclassification and the inclusion of low or unexposed workers in “exposed” cohorts can bias or dilute relative risk estimates towards the null. This highlights the importance of evaluating and incorporating information on exposure level when reviewing epidemiological literature on the health effects of benzene.

We also found evidence that increased risks of NHL are associated with work in petroleum refineries. Refinery work can be associated with exposure to many different chemical agents so this analysis by itself does not provide direct evidence that any single agent is responsible for the observed increase. However, benzene exposures have traditionally been seen in this industry and no other chemical commonly found at high levels in refinery work has been conclusively linked with NHL. Thus, while these results do not directly implicate benzene as a cause of NHL, they do support the findings of our analyses that looked directly at benzene exposure.

Our findings differ from a previous meta-analysis of benzene exposure and NHL which did not find evidence of an association.5 There were several reasons for this including our use of a few studies that were published only after the previous meta-analysis was published and our exclusion of studies where exposure was based solely on death certificate information. The primary reason however was our selection of relative risks for highly exposed groups instead of those for all exposure groups combined. In general, this resulted in us using higher relative risks for several studies than were used in the previous meta-analysis. For example, for the Bloemen et al and Collins et al studies we used the relative risks of 2.15 for >10 years of exposure group and 1.80 for >40 days of exposure to >100 ppm group, respectively. In the previous meta-analysis, relative risks for all exposed groups combined were used for both of these studies and these were markedly lower than the ones we used (1.06 and 0.89, respectively).

Our analysis of refinery studies provided some interesting results with regards to the healthy worker effect. In the analysis of all refinery studies combined, the relative risk rose from 1.21 (95 CI 1.00 to 1.46; p  0.02) to 1.42 (95 CI 1.19 to 1.69; p<0.001) after adjustment for this bias. These results provide a good example of the potential importance of evaluating and adjusting for the healthy worker effect in occupational mortality studies.

Despite our analyses of the healthy worker effect and the dilution of exposed cohorts with lesser exposed workers, several other important biases could have affected our results. For example, our results may not represent an effect of benzene but could be due to some chemical or other exposure that is highly correlated with benzene exposure. Several studies have reported a high correlation between benzene exposure and certain other chemicals such as toluene and xylene.29 30 However, several biological observations suggest that benzene is responsible for the elevated relative risks we found. First, benzene has been shown to have the ability to produce chromosomal and genetic changes important to NHL induction. Second, benzene is a known human carcinogen and a known bone marrow toxin.4 None of the other agents closely correlated with high benzene exposure have shown similar effects. Third, benzene has been linked to lymphomas in several animal studies including the 1986 National Toxicology Program carcinogenicity bioassay of benzene.3134

Many other agents have been linked to NHL in some studies, but for several reasons are not likely to be important confounding variables in our analysis. Confounding variables must be associated with both exposure and disease. For many of the factors linked to NHL (eg, some pesticides, certain dietary and lifestyle variables), positive associations have not been consistent from study to study and a true causal link may not exist.35 Other factors (eg, family history of NHL, autoimmune disorders, viral infections) are probably too rare to substantially impact our analyses. Still other factors may be associated with NHL, but are not strongly associated with benzene exposure or refinery work and therefore would not act as important confounders. As a whole, while we cannot completely exclude the possibility that some other agent is causing the effects we identified, most evidence suggests the major causative agent is benzene.

Other potential biases could result from the inclusion of case-control studies in our analysis of benzene and NHL. Factors such as recall bias or bias in the selection of controls can affect case-control studies. However, in our analyses based on study design, the summary relative risks were similar when we assessed case-control and cohort studies separately. If substantial bias affected case-control studies more than cohort studies we would expect these estimates to be markedly different from each other. Also, in our analysis of high exposure studies excluding those based primarily of self-reported data (the analysis where we found our highest relative risks of 2.12 and 2.26), the two case-control studies in this analysis were nested in cohort studies. As such, many of the potential problems that can occur with un-nested studies would not likely have occurred in these studies.

Another issue that could impact the studies included in our meta-analysis is the varying definitions of NHL that were used from study to study. As explained above, we included studies of both NHL and studies of lymphosarcoma/reticulosarcoma. Various ICD or other coding classifications were used including the International Classification of Diseases (ICD), the Rappaport classification, the Revised European-American Lymphoma classification, and the National Cancer Institutes Working Formulation system. Most studies used ICD7–9 codes 200 and 202. The use of different classifications has added some complexity to the diagnosis of NHL. Wong and Raabe provide a thorough review of this issue.6 Importantly, the inclusion of NHL subtypes unrelated to benzene would bias relative risk estimates towards the null, not towards the positive associations we identified. Other errors in diagnosing NHL may have also occurred; however, the diagnosis of cancer was done independently of benzene exposure. As such, any misclassification of disease would have likely been non-differential and therefore biased relative risk estimates towards the null, not towards the positive effects we identified. Similar effects would likely be seen with misclassification of benzene exposure or the use of exposure metrics that are not truly associated with NHL risk. In all studies benzene exposure was assessed independently of disease status; thus, bias from errors in misclassification would most likely be towards the null, not towards the positive associations we identified.

Another potential bias that can impact meta-analyses is publication bias, the tendency of journals or researchers to publish statistically significant effects.36 In our analysis, no obvious asymmetry was identified in the funnel plots,37 and no evidence of publication bias was seen in Egger’s or Begg’s tests. Further evidence that publication did not have a substantial impact on our analysis is the large number of results that were not statistically significant. Only four of the 43 studies (9) we included in our analyses reported statistically significant positive effects (Tables 2, 3, 5). This small percentage suggests that for this particular topic, there was not a strong tendency to only publish statistically significant positive results. It should be noted that none of the methods we used to assess publication bias are completely reliable and publication bias may still be occurring despite their findings. However, as a whole, the results of these tests combined provide evidence that publication bias was not responsible for the positive associations we report.

Some heterogeneity was seen in some of the meta-analyses we performed, particularly in those of refinery studies. As discussed above, some of this may have been due to including very low exposed workers in exposed groups in some studies but not in others. Some of this may have also been due to the differences across studies in assessing exposure and classifying NHL discussed above. Other sources of heterogeneity include other differences in study methodology or differences across study populations.

In conclusion, the results of these analyses suggest that benzene causes NHL and failure to exclude unexposed workers from “exposed cohorts” and failure to incorporate information on the healthy worker effect could lead to important effects being missed. While our meta-analysis of refinery work does not directly implicate benzene as a cause of NHL, the benzene exposures that can occur in this industry, combined with the lack of any other obvious or likely causative agent supports the hypothesis that benzene is the agent responsible for the effects we identified. Other human, animal, and laboratory data linking benzene to NHL and immunotoxicity provide further support and biological plausibility to our findings. Misclassification of both exposure and NHL could have biased the individual studies and our meta-analysis as a whole. However, these biases were most likely non-differential and thus not likely to cause the positive effects we identified. Publication bias may also impact meta-analyses such as this one, but the funnel plots and statistical tests we performed, and the very small number of statistically significant results used in these analyses, all combine to suggest that this bias was also not responsible for the positive associations reported here.

Main messages

  • Benzene may cause non-Hodgkin’s lymphoma.

  • Refinery work may be associated with an increased risk of non-Hodgkin’s lymphoma.

  • Biases from the healthy worker effect and exposure misclassification may have masked this effect in some studies.

Policy implication

  • Risk assessments for environmental and occupational standard setting for benzene should incorporate its possible effects on non-Hodgkin’s lymphoma incidence.


Northern California Center for Occupational and Environmental Health and NIH grants P42ES04705, R01ES06721, and R01CA104862 (MT Smith) and National Institute for Occupational Safety and Health Training Grant T42 OH008429 (RM Jones).

The views expressed are those of the authors and do not necessarily represent those of the Office of Environmental Health Hazard Assessment, the California Environmental Protection Agency, or the State of California.


  1. 1.
  2. 2.
  3. 3.
  4. 4.
  5. 5.
  6. 6.
  7. 7.
  8. 8.
  9. 9.
  10. 10.
  11. 11.
  12. 12.
  13. 13.
  14. 14.
  15. 15.
  16. 16.
  17. 17.
  18. 18.
  19. 19.
  20. 20.
  21. 21.
  22. 22.
  23. 23.
  24. 24.
  25. 25.
  26. 26.
  27. 27.
  28. 28.
  29. 29.
  30. 30.
  31. 31.
  32. 33.
  33. 33.
  34. 34.
  35. 35.
  36. 36.
  37. 37.
  38. 38.
  39. 39.
  40. 40.
  41. 41.
  42. 42.
  43. 43.
  44. 44.
  45. 45.
  46. 46.
  47. 47.
  48. 48.
  49. 49.
  50. 50.
  51. 51.
  52. 52.
  53. 53.
  54. 54.
  55. 55.
  56. 56.
  57. 57.
  58. 58.
  59. 59.
  60. 60.
  61. 61.
  62. 62.
  63. 63.
  64. 64.
  65. 65.
  66. 66.


  • Funding: MTS has received consulting and expert testimony fees from law firms representing both plaintiffs and defendants in cases involving exposure to benzene.

  • Competing interests: None declared.