Article Text
Abstract
Objectives To identify risk factors that contribute to outbreaks of COVID-19 in the workplace and quantify their effect on outbreak risk.
Methods We identified outbreaks of COVID-19 cases in the workplace and investigated the characteristics of the individuals, the workplaces, the areas they work and the mode of commute to work, through data linkages based on Middle Layer Super Output Areas in England between 20 June 2021 and 20 February 2022. We estimated population-level associations between potential risk factors and workplace outbreaks, adjusting for plausible confounders identified using a directed acyclic graph.
Results For most industries, increased physical proximity in the workplace was associated with increased risk of COVID-19 outbreaks, while increased vaccination was associated with reduced risk. Employee demographic risk factors varied across industry, but for the majority of industries, a higher proportion of black/African/Caribbean ethnicities and living in deprived areas, was associated with increased outbreak risk. A higher proportion of employees in the 60–64 age group was associated with reduced outbreak risk. There were significant associations between gender, work commute modes and staff contract type with outbreak risk, but these were highly variable across industries.
Conclusions This study has used novel national data linkages to identify potential risk factors of workplace COVID-19 outbreaks, including possible protective effects of vaccination and increased physical distance at work. The same methodological approach can be applied to wider occupational and environmental health research.
- COVID-19
- Epidemiology
- Public health
- Statistics
- Disease Outbreaks
Data availability statement
Data are available on reasonable request. The data have not been made available due to the sensitivity of workplace-level data. An application for data access can be made to the UK Health Security Agency. Data requests can be made to the Office for Data Release (https://www.gov.uk/government/publications/accessing-ukhsa-protected-data/accessing-ukhsa-protected-data) and by contacting DataAccess@ukhsa.gov.uk. All requests to access data are reviewed by the Office for Data Release and are subject to strict confidentiality provisions in line with the requirements of: the common law duty of confidentiality, data protection legislation (including the General Data Protection Regulation), Caldicott principles, the Information Commissioner’s statutory data sharing code of practice, and the national data opt-out programme. Access to the National Population Database can be requested by contacting: npd@hse.gov.uk.
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Statistics from Altmetric.com
WHAT IS ALREADY KNOWN ON THIS TOPIC
There are many risk factors that affect severity of disease caused by SARS-CoV-2 infection for individual infections. However, despite frequent reports of COVID-19 outbreaks in the workplace, there is limited evidence on factors that contribute to the risk of SARS-CoV-2 transmission in the workplace, with existing studies focusing on high-resolution analysis of individual workplace outbreaks with small sample sizes.
WHAT THIS STUDY ADDS
This is the first study to use large national data linkages to consider the association between where workers lived, worked and modes of commute and the risk of COVID-19 outbreaks in the workplace. Increased levels of vaccination and reduced physical proximity are shown to reduce the risk of workplace outbreaks. Increased levels of deprivation in the areas where workers lived showed increased risk of workplace outbreaks.
HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY
The findings contribute to the evidence base around social distancing measures in workplaces and vaccination at reducing transmission/infection. By identifying other risk factors associated with workplace outbreaks, this can inform workplace health and safety messaging. The innovative data linkages will support future workplace health research.
Introduction
Throughout the COVID-19 pandemic, outbreaks of cases in workplaces have frequently been reported worldwide.1–3 Some industries have seen elevated rates of outbreaks, including healthcare settings, where the risk of exposure to patients with COVID-19 is high, and essential services, where work is unable to be carried out from home. The impact of workplace outbreaks is twofold. First, workplace outbreaks contribute to the prevalence of COVID-19 in the community. This is evident in the role workplace closures played in reducing transmission.4 The second impact is on the workforce. Workplace outbreaks result in potentially large proportions of the workforce being unable to work effectively,5 which can have financial impacts on the company, the economy6 and on workers themselves.7
The frequency and size of outbreaks vary by time and space, depending on the community prevalence of COVID-19. This is the main exposure contributing to outbreaks in the workplace, but there are other potential exposures that could amplify the risk of outbreaks occurring for given levels of COVID-19 prevalence. Demographic factors that have been associated with risk of infection include age, gender, ethnicity and deprivation.8 Workplace factors that are likely to influence outbreak risk include physical proximity of workers, physical environment, including ventilation in the workplace, work activities, contract stability of the workforce and characteristics of the local area.
Workplaces present opportunities for a range of infectious diseases to spread, including respiratory syncytial virus,9 influenza and norovirus.10 Infectious diseases can be introduced through providing services to infectious individuals or workers attending workplaces while infectious. However, to study the influence of risk factors on potential transmission requires disease surveillance data at levels of detail previously unavailable. For COVID-19, however, mass testing of both the community and the workforce has provided a proxy measure for prevalence of infection in the community and the rate of outbreaks in workplaces, providing a unique opportunity to address questions on the risk factors associated to workplace outbreaks of COVID-19.
To mitigate the workplace impact of future waves of COVID-19, and to ensure accurate and reliable health and safety messaging can be issued to workplaces, it is essential to understand the different risk factors associated with these outbreaks. By using a combination of employee, workplace and population characteristics, our objective was to identify risk factors associated with workplace outbreaks of COVID-19 in different industry sectors. The findings of the study aim to support future pandemic preparedness.
Methods
Setting
This study examined variation in workplace COVID-19 outbreak rates by Middle Layer Super Output Area (MSOA—geographical areas with resident populations between 5000 and 15 000 persons11) in England according to workplace characteristics, the characteristics of the local population of the workers, and the neighbourhood characteristics around the workplace. We considered three time periods for the study: 20 June 2021–30 September 2021 (Delta variant dominant strain), 7 December 2021–20 February 2022 (Omicron variant dominant strain),12 and 20 June 2021–20 February 2022 (ie, overall, encompassing both periods and the intervening period). The period between Delta and Omicron was not assessed independently due to a mixture of variants circulating. These study periods were considered since testing policy was relatively stable.
Data sources
We used population data to investigate potential risk factors for workplace outbreaks by sector. The data describe demographics of employees and residents in England; characteristics of workplaces in England and COVID-19 testing and vaccination data.
Demographic characteristics were obtained from the 2011 Census conducted by the ONS.13 The variables extracted from the census were age, gender, ethnicity and modes of commute to work. We also used data on indices of multiple deprivation (IMD) from the Department for Levelling Up, Housing and Communities.14
Workplace COVID-19 clusters were identified from the UKHSA Contact Tracing and Advisory Service (CTAS) data.15 These data collected information on individuals who tested positive for COVID-19 in England and were contacted by the contact tracing service. For individuals who tested positive (and attended their workplaces 3–7 days prior to their positive test), their places of work were coded using the unique property reference numbers. This enabled workplace clusters to be counted based on two or more cases of self-reported COVID-19 occurring at the same workplace building within a rolling 6-day period, using the method described in Abbey et al.16 We use clusters as a proxy measure for workplace outbreaks in this analysis.
Mass testing data from UKHSA was used to measure community case-incidence over time.17 During the study period, 20 June 2021–20 February 2022, testing policy was constant across England. Therefore, infection incidence (the number of new infections each day) is assumed to be proportional to case incidence (the number of positive tests each day) across this period, allowing case incidence to be used as a proxy for the force of infection in the community. The National Immunisation Management System data from UKHSA, which contains COVID-19 vaccination records,18 provides data on all vaccination doses given in England, from which we calculated the proportion of working age individuals vaccinated.
The National Population Database (NPD)19 workplace layer is derived from the Inter-Departmental Business Register20 and includes estimates of the number and location of workplaces and workers in Great Britain by industry type using the Standard Industrial Classification (SIC). The number of workplaces in each Workplace Zone (a geographical area that is nested within an MSOA) was calculated by industry type, and subsequently aggregated to MSOA level. Workplaces with identified COVID-19 outbreaks were linked to the NPD workplace layer using business name, address and postcode information16 so that an SIC code could be applied. Workplace information was aggregated into 34 COVID-19-related sector groups,21 which were then aggregated into 11 broad industry sectors. Of the total number of workplace outbreaks detected, 42% were excluded since these could not be reliably linked to a specific industry sector when merging CTAS data with NPD data.
Other workplace data used in this study included data on physical proximity (mean proximity score) in the workplace at the two-digit SIC grouping level,22 the proportion of workers on permanent contracts at the two-digit SIC grouping level,23 and the classification of each workplace MSOA by mobility class,24 which provides a refined definition of rural/urban classification.
Data preparation
Details of data preparation are contained in online supplemental material. The variables used in this study fall into five broad categories: workplace, COVID-19 case and outbreak rates, employee (characteristics of the employees, restricted to those who travel into workplaces), work commute modes and neighbourhood (characteristics of people who live near the workplace). Online supplemental table S1 lists the variables and groupings.
Supplemental material
Statistical methods
Our outcome of interest was the workplace COVID-19 outbreak rate in the MSOA, which we defined as the number of active outbreaks in workplaces in an MSOA divided by the total number of workplaces in that MSOA. Since the number of active outbreaks is an integer valued count, these can be considered as samples from a counting process, such as a negative binomial distribution. Therefore, instead of modelling the outbreak rate directly, we modelled the time-varying number of active outbreaks in the workplace as the outcome variable of a negative binomial generalised linear model, and introduced the log transformed number of workplaces as an offset to transform this into a rate.
The risk factors that we considered fall into three distinct groups: workplace, employee and work commute characteristics. These risk factors were selected by health and safety experts for examination in the current study. The full list of risk factors examined is shown in online supplemental table S1, alongside common confounder variables.
For each risk factor, we fitted three models. The first included only the risk factor (model 1: unadjusted). The second extended model 1 by including the neighbourhood variables (characteristics of individuals living near the workplace) and the log-transformed 7-day lagged: values of COVID rates among employees, outbreak rate in that industry and outbreak rate in other industries (model 2: minimal adjusted). The third extended model 2 by adding variables from other risk factor groups which are plausible causal antecedents of the risk factor of interest, that is, they do not occur later down the causal pathway (model 3: priority risk factor-fully adjusted). Our confounder sets were decided on by discussion among the author team, which included consideration of plausible directed acyclic graphs (DAGs) representing the relationships between the study variables (online supplemental figure S1). Acknowledging the complexity of the underlying process and the challenge of identifying a single DAG, we selected confounder sets which would be compatible with the principles described by VanderWeele.25 Further details on the statistical models are provided in online supplemental material.
Note that models 1 and 2 included the same explanatory variables for all risk factors. Model 3 used different explanatory variables depending on which group of risk factors the factor of interest falls within. When measuring the effect of vaccination, we considered the periods where the Delta and Omicron variants were dominant in the population independently, since different COVID-19 variants have different levels of vaccine effectiveness.26
Each industry was modelled independently, because different testing policies across industry sectors lead to different case ascertainment rates. Therefore, outbreaks are not directly comparable across industry sectors. For education, we removed school holiday periods, excluding an extra week after the holiday period to account for delays from infection to cases being detected. This resulted in outbreaks between 18 July 2021 and 12 September 2021, 24/ October 2021 and 31 October 2021, 19 December 2021 and 9 January 2022 being excluded for education.
Results
Descriptive results
Across the study period, 1149 007 workplaces were included. The study period contains 36 weeks, and we have 6721 distinct MSOAs and 11 industry sectors. Altogether, there were 801 874 data points, where each data point corresponds to a unique combination of MSOA, week and industry. Note that not all permutations are included, since for some MSOAs not all industry sectors are present.
Tables 1 and 2 show a descriptive summary of the data used in this study by industry. The mean is based on the number of workplaces, rather than the number of employees. Figure 1 shows the outbreak rate at a national level over time.
Main results
Tables 3 and 4 show the results for the priority risk factor fully adjusted model. Online supplemental tables S2 and S3 show the results for the unadjusted and minimal adjusted models for all sectors, respectively. Table 3 contains results for the five industry sectors identified as high importance by HSE: education; human health and social work activities; services; transport, distribution and warehousing; and manufacturing, with the remaining six industry sectors in table 4. Comparing the unadjusted, minimal adjusted and the priority risk factor fully adjusted model, some of the results are substantially different, demonstrating that the identified causal structure has a large influence.
Workplace characteristics
Across all industries with sufficient data available, increasing physical proximity in the workplace was associated with increased risk of outbreaks, except for construction where the effect was not statistically significant (table 4). Increasing the proportion of workers on permanent contracts was associated with reduced risk in all industries except construction, where it was associated with an increase in risk (table 4). The relationship with mobility class varied across the industries but was generally found to be a significant risk factor.
Employee vaccination
Increasing the cumulative proportion of employees with two or more doses of vaccination, during the Delta and Omicron periods, was associated with reduced risk (or no significant effect) for all industries except education (table 3). In education, vaccination was associated with increased risk during Delta and no significant effect during Omicron.
Demographic characteristics
Across every industry, having higher proportions of individuals in the 60–64 age group, in comparison to the baseline group (30–44), was associated with substantially reduced risk of outbreaks (or no significant effect). In all industries, a higher proportion of individuals in either the 18–29 age group or the 44–59 age group, relative to the baseline group, was associated with an elevated risk. The presence of higher proportions of individuals from black/African/Caribbean ethnicities, relative to the baseline group (white ethnicity), were associated with increased outbreak risk (or no significant effect) in all industries. Mixed/multiple/other were associated with reduced risk, relative to the baseline group, in all industries except public service activities and agriculture, forestry and fishing (table 4). The trend across the remaining groups of ethnicities varied across industries. A higher proportion of female workers was associated with reduced risk (or no significant effect) in all industries except for education, transport, distribution and warehousing, and waste management and remediation. In all industries, increasing the IMD quintile (reducing deprivation) was associated with reduced risk of workplace outbreaks, except for mining and quarrying and agriculture, forestry and fishing, where the effect was not significant (table 4).
Modes of commute to work
Work commute modes generally had a significant association with the risk of workplace outbreaks across all industries, though the association varied substantially across industries. Taxi (or vehicle passenger) was associated with increased risk (or no significant effect) across all industries except education, utilities and public service activities (tables 3 and 4). Trains were associated with reduced risk (or no significant effect) in all industries. The results for bus, tram or metro use varied.
Discussion
Key results
We set out to identify potential risk factors associated with workplace outbreaks of COVID-19 in different industry settings. We found increased physical proximity in the workplace was associated with increased risk of workplace outbreaks in most industries. Airborne transmission of COVID-1927 will be highly dependent on physical proximity, which likely contributes to the reduced outbreak risk in industries with lower physical proximity. Other workplace characteristics, such as the proportion of workers on permanent contracts and the mobility class of the local MSOA, were found to have substantial variation across industries.
In most industries, increased vaccination uptake was associated with reduced risk of workplace outbreaks. Despite primarily protecting against severe disease rather than infection, vaccination has been shown to reduce the length of the infectious period in infected individuals,28 which could reduce the length of time workers are exposed to infectious individuals, thus reducing the chance of infection. During the Delta period, vaccination was associated with increased risk in Education. However, this is likely driven by the lack of vaccination in students, with children under 16 ineligible for vaccination during the Delta study period,29 who are the main exposure for individuals working in education. Combined with the prevalence of Delta growing during the study period, the model cannot distinguish between the impact of increased vaccine uptake (which increases monotonically with time) and increased transmission risk.
Employee demographic risk factors varied across industry, but for most industries, a higher proportion of black ethnicities, 45–59 years, living in deprived areas (areas with reduced IMD), were associated with increased outbreak risk, and a higher proportion of 60–64 years in the workforce were associated with reduced outbreak risk in the majority of industries. The increased risk associated with a higher proportion of black ethnicities is consistent with community-based studies30 and could be linked to increased risk of severe infection leading to higher ascertainment rates. The increased risk associated with more deprived areas has been observed in other studies,31 potentially driven by larger household sizes or necessity to work while unwell. The reduced risk in the oldest age group may be related to increased public health messaging around COVID-19 risks for vulnerable individuals during the pandemic.32
Workplaces where a higher proportion of employees were expected to use Taxis (or travel as a passenger in a private car) and other forms of transport were found to have increased outbreak risk for many industries, with trains or single occupancy typically associated with the lowest risk. Some of the reduced risk associated with train travel could be due to increased awareness of social distancing on trains (after strict social distancing measures had been in place33), or repeat infections over time on public transport leading a higher level of natural immunity at the time of the study.
Limitations
This analysis used multiple data sources linked by geographical unit (MSOA) with, on average, 2532 workers per unit. Ideally, analysis would be conducted at an individual or workplace level, but such data are not readily available at the national level. Therefore, the employee demographics had to be approximated, using the expected demographics of employees in the MSOA where people live and industry distribution in the MSOA where people work. If some workplaces have very atypical employee demographics, this would not be captured. Additionally, because the required variables were not yet available in the 2021 Census, the 2011 Census was the closest data that could be used to capture the demographic variables. If some locations have seen substantial changes since 2011, this may not accurately reflect the current conditions. The nature of the data creates the possibility of ecological bias, whereby relationships observed at population-level differ from those at the level of individuals or workplaces.
Motivated by causal assumptions elucidated using DAGs, we have adjusted for key confounders. However, we have not considered causal relationships between variables in the same group (employee characteristics, workplace characteristics and modes of work commute). Moreover, the challenge of creating a DAG capable of capturing the complex network of factors relating to this question is high. We fitted a series of models for each candidate risk factor to investigate the impact of different confounder sets on the estimated relationships.
The outcome variable in our analysis, workplace outbreaks of COVID-19, uses clusters of COVID-19 cases in the workplace as a proxy. Some of these may be linked to false positive tests, and some may be community-acquired infections rather than workplace outbreaks. We identified workplace ascertainment rate as an important factor. However, this variable is unobserved, so we were unable to compare risk factors across industry sectors. We assumed that the ascertainment rate of workplace COVID-19 cases depends only on the type of workplace and not the location. There may be some variation due to local test availability which is not captured here.
Data on workplace physical proximity and the proportion of employees on permanent contracts (permanence) is only available at the two-digit SIC grouping level. Therefore, we assume that all workplaces in each two-digit SIC grouping have the same physical proximity and permanence. Therefore, the estimated effect size reflects the risk in the different sub-industries in each industry sector, rather than individual workplaces. For some industry sectors (agriculture, forestry and fishing, human health and social work, mining and quarrying, public service activities, and waste management and remediation), there are too few two-digit SIC groupings, so physical proximity and permanence are not considered for these industries.
We have only been able to assess risk factors where data were available. Many other risk factors will influence the risk of COVID-19 outbreaks in the workplace, including employee behaviour and workplace specific variables such as ventilation or humidity. Measuring such variables at a population level, as required by this study, is likely infeasible. Such analysis relies on site-specific outbreak investigations.2 34
Interpretation
This work identifies risk factors associated with risk of COVID-19 outbreaks in workplaces for different industry settings. This study was performed at a population level, so does not reflect individual level risk. The employee characteristics and work commute modes show how the expected characteristics of the local workforce are associated with the risk of workplace outbreaks, rather than the risk of being involved in an outbreak for individuals. The workplace characteristics reflect how the characteristics of sub-industries within the industry sector are associated with the risk of workplace outbreak occurrence. The mobility class characteristics reflect how the local area where the workplace was located is associated with the risk of workplace outbreaks.
Generalisability
Although this study focused on COVID-19, the methods used may be generalisable to other communicable and non-communicable diseases. The range of potential risk factors identified, and methods for linking these data at a population level, are likely to be relevant for many diseases. However, the influence of each risk factor will vary across different diseases.
Conclusion
Workplace outbreaks during the COVID-19 pandemic had substantial health and economic impacts. The novel national data linkages in this study have enabled the identification and investigation of workplace settings with increased risk of COVID-19 outbreaks in England. This innovative approach to investigate multiple and complex social, environmental and behavioural determinants associated with where people live and work, can support future pandemic preparedness and is applicable to wider occupational and environmental health research more generally.
Data availability statement
Data are available on reasonable request. The data have not been made available due to the sensitivity of workplace-level data. An application for data access can be made to the UK Health Security Agency. Data requests can be made to the Office for Data Release (https://www.gov.uk/government/publications/accessing-ukhsa-protected-data/accessing-ukhsa-protected-data) and by contacting DataAccess@ukhsa.gov.uk. All requests to access data are reviewed by the Office for Data Release and are subject to strict confidentiality provisions in line with the requirements of: the common law duty of confidentiality, data protection legislation (including the General Data Protection Regulation), Caldicott principles, the Information Commissioner’s statutory data sharing code of practice, and the national data opt-out programme. Access to the National Population Database can be requested by contacting: npd@hse.gov.uk.
Ethics statements
Patient consent for publication
Ethics approval
This study was conducted as part of the PROTECT National Core Study. Work was undertaken in line with national data regulations.
Supplementary materials
Supplementary Data
This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.
Footnotes
Twitter @CamillaWoodrow
Contributors Guarantor: CEO. Conceptualisation: CEO, RA, TW, OD, MG, JW, TA and YC. Literature review: RA, MG, JW, TA and YC. Data curation: CEO, RC, RP, JD, OJ, TB and CW-H. Formal analysis: CEO, RP and MT. Methodology: CEO, RP, OJ, MT, MG, JW and OD. Software: CEO, RP, MT, OJ, CW-H, TB, RC and JD. Visualisation: CEO. Writing–original draft preparation: CEO, OD, RA, JW, MG, TA and YC. Writing–review and editing: CEO, OD, RA, JW, MG, TA, YC and TW.
Funding This work was funded through the PROTECT (Partnership for Research in Occupational, Transport and Environmental COVID Transmission) COVID-19 National Core Study on Transmission and Environment, managed by the Health and Safety Executive on behalf of HM Government.
Disclaimer The contents of this paper, including any opinions and/or conclusions expressed, are those of the authors alone and do not necessarily reflect Health and Safety Executive policy or UK Health Security Agency or Office the National Statistics views.
Competing interests None declared.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.