Article Text


Is it feasible to construct a community profile of exposure to industrial air pollution?


OBJECTIVE An epidemiological investigation to assess the validity of residential proximity to industry as a measure of community exposure.

METHODS 19 Housing estates in Teesside (population 1991: 77 330) in north east England were grouped into zones: A=near; B=intermediate; C=further from industry. With residential proximity of socioeconomically matched populations as a starting point a historical land use survey, historical air quality reports, air quality monitoring, dispersion modelling data, and questionnaire data, were examined.

RESULTS The populations in zones A, B, and C were similar for socioeconomic indicators and smoking history. Areas currently closest to industry had also been closest for most of the 20th century. Historical reports highlighted the influence of industrial emissions to local air quality, but it was difficult to follow spatial pollution patterns over time. Whereas contemporary NOx and benzene concentrations showed no geographical variation, dispersion modelling of emissions (116 industrial stacks, traffic, and domestic sources) showed a gradient associated with industry. The presumed exposure gradient of areas by proximity to industry (A>B>C) was evident for all of zone A and most of zones B and C.

CONCLUSIONS It was feasible to assemble a picture of community exposure by integration of measurements from different sources. Proximity of residence was a reasonable surrogate for complex community exposure.

  • exposure assessment
  • community exposure
  • industrial pollution

Statistics from

In environmental epidemiology studies the assessment of exposure of a population remains both an on going area of concern when interpreting results and a major methodological challenge at the design stage. This is especially the case for retrospective exposure assessment: gaps in the spatial and temporal coverage, the relative crudity of past measurement techniques, and changes in techniques are among the many problems; and exposure assessment has therefore been earmarked as an area of future research.1 2

To analyse the associations between exposure to industrial air pollution and the health of residents living close to industries a comprehensive picture of past and contemporary exposure is required. However, to characterise air pollution and its variability in time and space, necessitates reliance largely on data collected for different purposes—for example, to fulfil statutory obligations. In the end, a judgement has to be made on how far the chosen exposure measures are likely to reflect the exposure of the population.

Based on the study of detailed accounts of the validity and reliability of exposure measurements and a systematic analysis of differential misclassification of exposure,2-6 the criteria we used to make judgements on whether the exposure surrogate residential proximity to industry was valid were: (a) Have we been able to characterise the temporal and spatial pattern of exposure in our study areas? (b) Have we been able to disentangle industrial sources from other pollution sources? (c) Have different methods of exposure assessment complemented one another to draw an overall picture? (d) How much have the data shown about the true exposure of populations in the past? (e) Was the initial assumption and design based on proximity of industry as a surrogate for residential exposure vindicated?

Often, polluting industries are located in areas with poor populations,7-9 among whom high smoking rates, poor housing conditions, and unemployment prevail. Because these socioeconomic characteristics are strong determinants of both exposures and ill health several approaches have been developed to take them into account at the design and analysis stage of epidemiological studies.8 10-18 We chose socioeconomically matched populations with the Townsend deprivation score and additional census variables, therefore correcting for socioeconomic influences at the design stage.

In summary, there is currently no single method of assessment of exposure that could conceptually or indeed in practice be considered as a gold standard. The question we consider in this paper is whether we can build up a useful picture of exposure of a community by combining data from various sources? We examined different methods of assessing exposure supplementing data with evidence from surveys and descriptive accounts of past pollution to create a picture of past and present exposure covering more than 40 years.

The paper arises out of an epidemiological study into the association between proximity of residence to an industrial complex and acute and chronic health outcomes.8 19 It was evident at an early stage that due to resource and conceptual constraints, direct contemporary measurements of exposure would be unable to capture the complex and varying exposures in a community, let alone exposures in the past. The study design was based, therefore, on proximity of residence as a surrogate for exposure. It was then necessary to verify and validate how good a surrogate of exposure the selected areas were.


The research underpinning this paper was driven by the hypothesis that populations living close to industrial operations are likely to experience greater pollution derived from the industry than those living further away. Previous research in the area had highlighted health differentials for all cause and respiratory mortality under the age of 65 in populations experiencing similar levels of socioeconomic deprivation,20-22 and differentials in exposure to air pollution were one of the possible explanations.

Teesside is an industrial conurbation with a population of over 400 000 (fig 1) characterised by large petrochemical operations as well as steel and coke works, which have been continuous for 150 years. Chemicals and oil based operations developed after 1920. Although industry has tended to relocate down river, moving away from residential populations, even in the 1980s and 1990s sizeable populations still lived close to industrial operations.

Figure 1

Housing estates in zones A, B, and C and selected monitoring sites for smoke and SO2.

The epidemiological study was based on a comparison of areas matched for socioeconomic characteristics. Neighbourhoods were grouped into near, intermediate, and further from industry reflecting the hypothesised exposure gradient. The hypothesised exposure gradient (A>B>C) was taking into account that concentrations of pollution were highest when no dispersion was taking place due to low wind speeds. The focus of this paper is what kind of picture of community exposure is it possible to create in disaggregated small neighbourhoods, and how far did this picture support the assumptions of residential proximity as a surrogate for exposure?

The identification of areas of varying proximity to industry involved a stepwise process to distinguish socially homogeneous localities.8 15 17 Nineteen housing estates in Teesside (in 1991 the population was 77 330) were selected. The other approaches for exposure assessment encompassed a historical land use survey, air quality monitoring including a review of historical pollution data and local authority reports, dispersion modelling, and a postal questionnaire survey.

Historical land use survey

A historical land use survey investigated the possibility that the location of industrial developments and closures affected the exposure of populations differently in the study zones. If in the past, industries had existed in areas of the study classified as intermediate or further from current industrial complexes serious exposure misclassification could have been introduced.


A land use survey based on local government archives was carried out for four areas encompassing our study area. Official annotated Ordnance Survey maps held in local authorities and archive material were used to produce “snapshot” maps for the years 1900, 1930, 1960, and 1990. The categories and subcategories of land use for our survey were those used by local authorities to classify potentially contaminated land.17


Of those areas closest to industry (zone A), residential development in neighbourhood 1 (Dormanstown) began in the 1920s on a green field site, the period after the war was marked by continued urban and industrial development (table 1). Area 3 (Grangetown South) remained purely residential, whereas areas 2 and 4 (Grangetown North and South Bank) have experienced large scale iron and steel production close to housing since 1900. Area 5 (Portrack and Tilery) had heavy industry alongside residential developments for most of the 20th century. Most of the areas defined as intermediate (zone B) or further away from industry (zone C) were green field sites until new housing estates were built. Early industrial developments were present only in three estates: Whinney Banks (area 13, zone B), Mandale (area 15, zone C), and Parkfields (area 16, zone C). Hemlington (area 14, zone C) had no housing until the late 1960s.

Table 1

Land use within 500 m of study areas at four times

Overall, the land use survey confirmed the extent to which areas currently closest to industry had been close to industry earlier, indicating the appropriateness of areas in zone A and few anomalies in our definition of zones B and C.

Air quality monitoring


Air quality monitoring data were collected from the following sources: (a) current routinely collected data; (b) surveys commissioned for this study; (c) historical, routinely available data from the annual reports of the medical officer of health and pollution control committee reports; and (d) the national air quality archive (partly overlapping with (a) and (c)).17 19 23-34 Here we present summaries of findings for smoke, sulphur dioxide, and insoluble solids. For contemporary monitoring we chose nitrogen dioxide and benzene to allow comparisons to be made between monitored data and data from dispersion modelling. Emission data for dispersion modelling were available only for nitrogen dioxide and benzene.


Historical data on smoke, sulphur dioxide, and insoluble solids

From much information retrieved from historical documents it was possible to observe trends of falling levels of pollution over time, but it was difficult to follow spatial patterns over time. For example, average monthly deposits of insoluble solids showed a sharp gradient between industrial (I), semi-industrial (SI), and residential (R) areas in tonnes per square mile: 1962 22 (I), 11 (SI), 6 (R); 1964, 18 (I), 9 (SI), 7 (R); 1966, 25 (I), 12 (SI), 7 (R).8 32 33 Table2 summarises the variation in smoke and sulphur dioxide in 1968. The site closest to industry (a) had highest levels, while lowest levels were found well away from industry (e). The high readings were in South Bank close to industrial operations, which was also frequently mentioned in local documentation as heavily polluted.24 28-31 35 36

Table 2

Mean concentrations of smoke2-150and SO2 in Teesside 1968 in summer and winter in μg/m3 (data source8 19)

During the late 1970s and early 1980s the number of monitoring stations was reduced and replaced by complex real time monitoring equipment. This modern equipment allowed measurements at lower levels of pollution with high temporal resolution. However, monitoring data were no longer available across a range of sites.

Contemporary monitoring of nitrogen dioxide

The spatial resolution of monitoring points for nitrogen dioxide was not sufficient to detect localised peaks—for example near busy roads—and nitrogen dioxide concentrations were found to vary little across the 10 monitoring points in the study area in 1993 (table3).

Table 3

Monitored concentrations of NO2 in Teesside (ppb)

Contemporary monitoring of benzene

Benzene concentrations at roadsides were about double those of the urban background and about four times higher than rural backgrounds (table 4). A diffusion tube survey in 1993 assessed ambient concentrations of volatile organic compounds in the areas under study. The range of measurements for benzene (0.5–7.4 ppb) was consistent with findings of other studies.25 26 28 Highest readings were in neighbourhoods nearest to industry. The mean in 1993 was 1.3 ppb (London 4.6 ppb).25 27 Derwent et al reported analyses of many short term excursions in a time series and indicated that benzene concentrations were increased when the wind was blowing from the north east or east downwind from one major chemical complex, pointing to potentially relevant contributions.25 Another approach to assess the relative impact of airborne volatile organic compounds other than traffic is to calculate the ratio of the compound and nitrogen dioxide.17 In the 1993 diffusion tube survey the benzene/NO2 ratio varied between 0.07 and 0.10 for all sites with the exception again of South Bank (area 4) and Grangetown (area 2) where ratios were 0.17 and 0.18 respectively, pointing to contributions from stationary sources.

Table 4

Monitored concentrations of benzene in Teesside (ppb)

Dispersion modelling of emissions


Air quality modelling used a Gaussian plume model (US EPA's industrial source complex short term model ISCST2) to calculate mean concentrations at ground level. The model was run for an area of 20 km by 15 km divided into a grid of 300 cells, of 1 km2 each (grid references NZ 405135 to 595135 to NZ 595275, fig 1) with hourly meteorological and emissions data. The model required data for atmospheric stability and mixing height, variables which are collected only at a few sites. Data from the nearest weather centre with full data was about 55 km north of the study area. One way to avoid the limitations of this approach would have been to compare local weather data with those from the location 55 km away, this was not done in this case.

For 116 industrial sources, data on stack height and diameter, gas exit temperature, gas exit velocity, and emission rate were provided by industry. Emission rates in the form of annual totals were extracted from the public register or were estimated on the basis of the mass burned for one natural gas based power station and several municipal waste and clinical incinerators.38

The model did not allow the inclusion of line sources, therefore road traffic was treated as an area source. Hourly exhaust emissions were estimated by summing the contribution from each road segment in each cell. This has been documented as a feasible option.39Rates of emissions of oxides of nitrogen (NOx) were estimated as a function of vehicle, road, and fuel, while benzene emission rates were estimated as a function of road type only. With natural gas as the predominant domestic fuel in the study area (gas zone A: 41%, B: 39%, C: 47%; coal zone A: 1%, B: 1%, C: 1%) domestic emissions were also treated as area sources. The total emission for each grid cell was estimated as the product of the average emission per household and the number of households in the cell from the 1991 census of population.


The average NOx concentration across the study area was estimated at 13.2 ppb, and the maximum value was predicted at 26.9 ppb within the area of a major chemical complex (fig 2 A, NZ 585215). Estimated concentrations were low with slightly higher values in the vicinities of two major chemical works. The model predicted an annual mean of 0.3 ppb for benzene with slightly higher values in Middlesbrough town centre; the maximum of 3.0 ppb occurred in the area of a chemical works (fig 2 B, NZ 575205).

Figure 2

(A) Modelled annual mean NOx concentrations (ppb) 1993. (B) Modelled annual mean benzene concentrations (ppb) 1993.

Table 5 shows the modelled annual mean contributions for the 19 study areas. A gradient of concentrations of NOx attributable to industrial activities was found, with greatest relative contributions in areas closest to industry. Concentrations of NOx in area 13 (Winney Banks, zone B) were higher than expected, this may be accounted for by proximity to a busy road interchange.

Table 5

Modelled mean annual NOxand benzene concentrations 1993 in study areas (ppb)

For benzene, industrial contributions to total concentrations were highest in area 1 (Dormanstown). Contributions of non-industrial sources for benzene varied only by a factor of about 3, whereas the contribution from industry varied by a factor of almost 30. An overall geographical gradient across estates was therefore apparent. Variations in total benzene concentration across the region were greater than for NOx. Monitored and modelled concentrations of benzene and nitrogen dioxide agreed well.

Indoor exposure, occupational exposure, and perception of exposure

As well as monitoring and modelling we incorporated evidence of people's own perceptions about pollution in living and working conditions. On the face of it such data may seem too limited in themselves. However, we think that they assist in building up a comprehensive picture of population exposure.


Of the 19 housing estates in Teesside, 12 (population 52 373) were the basis for a postal community survey. The questionnaire included items on socioeconomic circumstances, personal behaviour, health, and perception of risk factors influencing health.8 15 17


Data from the 1991 census and the postal survey showed that study areas experienced similar levels of socioeconomic deprivation.8 15 17 Table 6 shows selected data about living circumstances including surrogate indicators for occupational exposure, indoor air pollution, and perception of environmental pollution. The occupational experience of working in dusty industries varied for men, but not for women. Data on levels of smoking, and heating and cooking methods provided indirect indicators of indoor air pollution. Levels of smoking were similar across the zones. For pollution from heating and cooking, differences were found, but gradients were not in line with the study hypothesis of an exposure gradient near>intermediate>further. A steep gradient was found for the contemporary level of worry about air pollution and smells, with highest levels in zone A closest to industry.

Table 6

Comparability of study areas in zones A, B, and C and living circumstances (%)


With proximity of residence to industry as the basic design feature for assessment of exposure we have attempted to verify and validate the suitability of the area choices to reflect a suspected exposure gradient across zones so that near >intermediate> further. Our discussion centres around the five criteria outlined in the introduction.


Despite considerable effort we succeeded only in creating a jigsaw puzzle of community exposure with several of the pieces missing. However, a consistent pattern emerged. The evidence showed that residential exposure in areas close to industry has been and possibly still is, greater than in areas further away from industry. In particular zone A had higher exposures both in the past and in the present than did zones B or C. There was no indication that any of the boundaries of zone A should have been different to avoid misclassification of exposure. The hypothesised exposure gradient across Teesside was less clear cut for boundaries between zones B and C and north of the river Tees between zones A and C. One estate (area 16, Parkfields) might have been better included in zone A than in C.

The limited available data on smoking and use of fuels did not indicate a gradient in indoor air pollution indicting residential exposure in the hypothesised direction. Any variation in indoor pollution was therefore unlikely to explain a gradient in exposure to industrial air pollution in the direction A>B>C.

Although the overall trend in smoke and sulphur dioxide concentrations was dominated by the expected downward trend the data also suggested that this decline lagged behind the overall trend in those areas closest to industry. Because historical monitoring techniques for smoke were based upon the black staining of filters, appropriate for pollution dominated by combusted coal, the contribution of traffic to the long term trend over time was not easily documented. A key finding from dispersion modelling was that contributions from industry varied across the study area whereas contemporary contributions from non-industrial sources showed only limited variation.


Although historical documents consistently indicated industrial contributions as common knowledge, the contemporary contribution of industrial emissions was much harder to document. The strongest evidence in favour of a continuity of industrial contributions came from dispersion modelling, whereas the monitoring data carried messages of similarity rather than differences across the study area. Modern real time monitoring has improved the temporal resolution of data, but spatial resolution is now poor and predictions of spatial patterns have to be largely based on back trajectory analysis. The only state of the art monitoring site currently operating in Teesside is located in zone B.

Overall, it has been easier to assess the contributions of industrial pollution across the area historically than now due to the limited spatial resolution of current monitoring.


Both monitoring and modelling of air pollution were able to provide information on single pollutants. Although dispersion modelling can lead to a circular argument in the attribution to point sources, monitoring data supported the findings of modelling. Monitoring data on the other hand did not allow a detailed analysis of the levels of spatial resolution, but this was available from dispersion modelling. Exposure assessment by proximity to industry, the historical land use survey and the annual reports of the medical officer of health dealt with exposures to industrial pollutants more generally. If we had used other methods of exposure assessment—for example biological indicators, we would have been able to capture recent exposures (days, weeks) in more detail, but would still have been unable to capture exposures to unknown mixtures of pollutants of 20 or 30 years ago. We concluded that a range of exposure assessment methods was both essential and enabled us to build up a picture of historical and contemporary exposure.


With exposure assessment in epidemiological studies we attempt to get as close as possible to the true exposure of populations, avoiding exposure misclassification and other biases. The questionnaire survey highlighted the strong effect of sex with large differences in the proportion of men and women who worked in dusty industries. Historical documents pointed to the importance of variations in employment rates. During the 1980s the direct link between residence and employment was weakened with those working in heavy industry tending to commute from further away. We would argue that male exposure to air pollution in the past would have been dominated by occupational rather than residential exposure, whereas today residential exposure is more likely to dominate. By contrast women's past as well as contemporary exposure is likely to be dominated by residential proximity to industry. The available data on contemporary indoor air pollution did not indicate higher concentrations in areas near industry in Teesside. However, no data were available on heating and cooking patterns in the past. Overall therefore, for women the relative contribution of industrial emissions to their total exposure is likely to have been considerably greater than for men.


We suggest that the use of proximity of residence to industry is a defensible surrogate for true exposure of the community. The choice of areas was generally supported by other sources of information on exposure. In particular the neighbourhoods assigned to zone A seemed to be justified. At the same time the boundaries between the intermediate and further zones were less clear. The evidence suggested that three of the 19 estates (in zones B and C) may have been mis-allocated. A recent study with the proportion of land used by industry per ward as a proxy for industrial air pollution found that after controlling for socioeconomic deprivation a greater proportion of industrial land use was associated with higher mortality.40 Residential proximity was the only method that allowed matching on socioeconomic factors thereby accounting for the major contributors to ill health at the design stage. Monitoring and dispersion modelling data indicated variations in pollution which were compatible with the idea of land use and proximity being surrogates for exposure.

We do not know how feasible the comprehensive data collection we were able to use here would be elsewhere. The picture of ambient air pollution was more than the sum of its component parts. We used residential proximity to industry as the initial method of assessing exposure. With this approach in otherwise socioeconomically matched areas to assess exposure of populations near industry has the advantage that boundaries can reflect complex information from other data sources such as historical land use and employment patterns. Yet, proximity did not allow us to measure exposure. But the integration of measurements from many different sources made it feasible to assemble a picture of community exposure that verified the assumptions implicit in the use of proximity.


The contributions of Ann Rooke and Paula Rutter in providing the maps, Lesley Sharpe for the dispersion modelling, Debbie Nash for the land-use survey, and Jeff Duffield, Catherine Hall, Jim Longstaff, Anthony Luke, David Sutherland, and Jacqui Tate for contributions to the chapter containing environmental data in the TEES report are gratefully acknowledged. Funding was received from Middlesbrough Borough Council (City Challenge), South Tees Health Authority, Cleveland Family Health Services Authority, Cleveland County Council, and Northern and Yorkshire Regional Health Authority.


View Abstract

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.