Article Text

Download PDFPDF

Systematic review and meta-analysis of interventions aimed at enhancing return to work for sick-listed workers with common mental disorders, stress-related disorders, somatoform disorders and personality disorders
  1. Mai Bjørnskov Mikkelsen1,
  2. Michael Rosholm2,3
  1. 1 Department of Psychology and Behavioural Sciences, Aarhus University, Aarhus, Denmark
  2. 2 Department of Economics and Business Economics, Aarhus University, Aarhus, Denmark
  3. 3 IZA, Bonn, Germany
  1. Correspondence to Professor Michael Rosholm, Department of Economics and Business, Aarhus University, Aarhus DK8210, Denmark; rom{at}econ.au.dk

Abstract

Objectives Mental disorders are associated with significant functional impairment, sickness absence and disability. The consequences of sickness absence warrant investigation into interventions aimed at enhancing return to work (RTW) for workers with mental disorders. The present systematic review and meta-analysis aim to synthesise evidence on the effectiveness of interventions aimed at enhancing RTW in sick-listed workers with mental disorders.

Methods EconLit, Embase, PsychInfo, PubMed, Svemed+ and Web of Science were searched for peer-reviewed, randomised or controlled studies assessing employment-related outcomes of interventions. A meta-analysis was conducted and meta-regressions were performed to explore prespecified potential sources of heterogeneity between studies.

Results The literature search yielded 3777 publications of which 42 (n=38 938) were included in the systematic review and 32 (n=9459) had appropriate data for the meta-analysis. The pooled effect size (95 % CI) was 0.14 (0.07 to 0.22). Meta-regressions revealed that the heterogeneity could not be attributed to study quality, timing of the intervention or length of the intervention. However, it could be partly explained by number of components included in the intervention, if the intervention included contact to the work place and by the disorder targeted by the intervention.

Conclusions The results reveal strong evidence for interventions including contact to the work place and multicomponent interventions and moderate evidence for interventions including graded RTW. In addition, the results provide strong evidence for interventions targeting stress compared with interventions targeting other mental disorders. The findings point to important implications for policy and design of future interventions.

  • sickness absence
  • mental health
  • intervention studies

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

Mental disorders are prevalent in the labour force in developed countries and associated with significant personal and public expenses.1 Inherent to the definition of mental disorders is that they imply significant distress and functional impairment for the person affected.2 3 This is mirrored in the strong association between mental disorders and low work function as well as increased sickness absence.4 Common mental disorders are the leading cause of sickness absence in the majority of high-income countries and the main impetus behind 1/3 of all disability benefits.5

The consequences of reduced work functioning and increased sickness absence should not be disregarded. For the individual, work often constitutes an important source of social integration and social identity.6 Further, work can afford structure, purpose and meaning in daily life for both healthy individuals and individuals with mental health problems.7 8 Thus, at the personal level, absence from work may have consequences for social and psychological well-being. For society, sickness absence due to mental disorders represents a significant financial burden comparable to or exceeding that of physical disabilities (eg, blindness, musculoskeletal disorders9 10). This is likely due to a combination of increased prevalence of sickness absence, low probability of return to work (RTW) following sickness absence and a high risk of recurrent sickness absence found in workers with mental disorders.4

The clinical, social and economic consequences of sickness absence and reduced work functioning in workers with mental disorders have increasingly been recognised by policy makers and researchers alike. This has led to development of a variety of interventions aimed at increasing RTW among sick-listed workers with mental disorders. Although the primary goal of these interventions is the same (ie, facilitating RTW), the contents and methods vary, and no gold standard exists.4 At a general level, these interventions often contain one or more of the following four components: (1) Organisational change, defined as enhanced collaboration or integration of central partakers, (2) Graded RTW, (3) Therapeutic elements, for example, therapeutic conversations or therapy and (4) Contact to the work place. Whether each of these components by themselves or in combination significantly increase RTW among sick-listed workers with mental disorders has yet to be established.

Within the field of occupational medicine, several reviews and meta-analyses exist of RTW-interventions for sick-listed workers with musculoskeletal disorders and other non-communicable diseases11 12. However, only a handful of reviews and meta-analyses have been conducted for interventions aimed at RTW for sick-listed workers with mental health disorders. Only three of these reviews included a meta-analysis,13–15 and two of these were disorder-specific (ie, aimed at adjustment disorders13: or depression14). Nigatu and colleagues15 included a broader spectrum of mental disorders in their review and meta-analysis. The authors reviewed the literature on interventions aimed at enhancing RTW in sick-listed workers with common mental disorders, CMD (defined by the authors as depressive disorders, anxiety disorders and adjustment disorders). The authors concluded that there is no evidence to support the effectiveness of available interventions. However, this conclusion was based solely on interventions involving elements of Cognitive Behavioral Therapy (CBT). Thus, the authors excluded interventions that did not include CBT-elements and that were not aimed at sick-listed workers with CMD. As depressive disorders and anxiety disorders only make up 2 out of the 10 most high-prevalence and high-cost mental disorders affecting workers,16 it may be beneficial to adopt a more comprehensive approach, addressing a larger segment of mental disorders and a wider variety of RTW-interventions.

The aim of the present review and meta-analysis was to collate and update the existing evidence for interventions aimed at facilitating RTW in sick-listed workers with mental disorders. Interventions aimed at a broad spectrum of high-prevalence, high-cost mental disorders (ie, anxiety disorders, depressive disorders, adjustment disorders, stress-related disorders, personality disorders and somatoform disorders) were included.

Methods

A systematic review and meta-analysis for studies of interventions aimed at enhancing RTW for sick-listed workers with anxiety disorders, depressive disorders, adjustment disorders, stress-related disorders, personality disorders and somatoform disorders were performed.

A review protocol was prospectively registered with Prospero (registration number: CRD 42017070410). Our review conforms to the PRISMA guidelines for systematic reviews and meta-analyses.

Search strategy

An extensive search of the electronic databases EconLit, Embase, PsychInfo, PubMed, Svemed+ and Web of Science from January 2000 to June 2017 was conducted. The cut-off date of January 2000 was chosen to ensure relevance of the findings for current occupational and diagnostic practices and policies. The searches were limited to peer-reviewed English-language publications and either randomised controlled trials or controlled trials. Reference lists of identified literature and newer publications citing identified literature were reviewed. The search terms were established using the PICO model (see online supplementary appendix 1).

Supplemental material

Study selection

For inclusion, the studies had to be peer-reviewed, randomised or controlled studies assessing employment-related outcomes of interventions aimed at sick-listed workers with anxiety disorders, depressive disorders, adjustment disorders, stress-related disorders, personality disorders and/or somatoform disorders. When studies were aimed at more than one of these disorders, they were classified as targeting sick-listed workers with CMDs.

Employment-related outcomes were defined broadly as (1) time until RTW, (2) proportion of participants achieving RTW, (3) number of sick leave days and (4) self-reported work-readiness. Previous systematic reviews have been used as a foundation for this review, but were not formally included. Grey literature, single case studies and qualitative studies were excluded.

The selection process consisted of three stages. First, the two authors independently screened the titles and abstracts of retrieved studies to determine eligibility. Second, the two authors independently read the full text articles of identified studies to assess whether studies should be included. Third, the authors reviewed reference lists and newer publications citing identified literature. Interobserver agreement was calculated at each stage and disagreements were resolved by consensus.

Quality appraisal

To establish a reliable rating system, the two authors jointly reviewed two studies and developed a quality assessment checklist based on The National Institutes of Health Quality Assessment Tool for Controlled Intervention Studies (see online supplementary appendix 2). The methodological quality of the remaining studies was assessed independently by the two authors according to this quality assessment checklist. Interobserver agreement was assessed by calculating the kappa index (κ) with 95% CI. Disagreements were resolved through discussion. The summary score for each study was calculated (minimum: 1, maximum: 10) and categorised into three categories. Studies with a score of ≥9 were considered of high quality, studies with a score of 6–8 were considered of fair quality and studies with a score of 5 were considered of poor quality. These categories were used to evaluate whether outcomes significantly varied according to study quality and to determine which weight studies should be given in the synthesis of the findings.

Supplemental material

Data extraction and coding

The two authors independently used a prespecified data-extraction sheet to extract data from the included studies (see online supplementary appendix 3). Accordingly, data were extracted at five levels: study, participants and controls, design and method, measure of interest and outcome and effect. In addition, intervention types were coded according to four components: (1) organisational change, that is, enhanced collaboration or integration of central partakers, (2) graded RTW, (3) therapeutic elements, for example, therapy or therapeutic support and (4) work place contact before RTW, for example, meetings with the sick-listed worker and a representative of the employer at the workplace. When studies evaluated more than one intervention with no overlap in participants, each treatment group were included separately and contrasted with the control group.

Supplemental material

Statistical procedure

The primary outcome measure used in the meta-analysis was time until RTW. When time until RTW was not available, the proportion of participants who had achieved RTW measured as close to the end of the intervention as possible was used. Studies using alternative outcome measures were included in the meta-analysis if the outcomes could be converted to standardised effect sizes for mean difference between the intervention group and the control group. Effect sizes were calculated as Glass’ delta (Δ), where the mean difference between the intervention-group (1) and the control group (2) is divided by the SD of the control group:Embedded Image . According to Kline,17 this is an appropriate method when interventions affect both the central tendency and variability in the population distribution. The I2 statistic was calculated to assess heterogeneity between studies. A random effects meta-analysis was applied, as it allows for heterogeneity in the analysis of the overall effectiveness of the interventions. Meta-regression analyses were performed to investigate prespecified moderators of effectiveness (ie, target population, intervention content, quality and country of origin). Risk of publication bias was assessed by funnel plot analysis and the Egger’s test.

Results

A PRISMA flow diagram of the search results and reasons for study exclusion can be found in supplementary material (see online supplementary figure 1). The literature search yielded 4104 potentially relevant publications (3777 after removal of duplicates), while the review of reference lists and newer publications citing the identified literature yielded an additional 4 potentially relevant publications. After review of titles and abstracts, 118 full-text articles were retrieved for further consideration. Following review of the full-text articles, 76 articles were excluded, leaving 42 articles (n=38 938) to be included in the present systematic review (see table 1). Agreement about inclusions and exclusion was high. The two authors were in 97.5% agreement about inclusion and exclusion following the preliminary screening of titles and abstracts and in 95% agreement of inclusion and exclusion following review of full-text articles.

Supplemental material

Table 1

Quality ratings of included studies

Of the 42 articles that met the criteria for inclusion and exclusion, 3 articles described studies using the same data as other included articles. For these, data were extracted from the most recent article with relevant data. Thus, the final sample consisted of 39 studies comprising 31 randomised controlled trials (RCTs) and 8 controlled trials. The studies were published in scientific journals between 2003 and 2017.

Systematic review

The quality assessment revealed that 10 of the included studies could be characterised as high-quality studies, 23 studies could be characterised as fair quality studies and 6 could be characterised as low-quality studies. All RCTs were evaluated to be of fair or high quality. Two controlled studies were assessed to be of fair quality, while the remaining controlled studies were evaluated to be of poor quality. The two authors were in 97.5% agreement about the classification of studies into high quality, fair quality and low quality categories. The intercoder agreement for exact scores on the quality checklist was κ=0.5, indicating moderate agreement. Table 1 provides an overview of included studies and specific quality ratings of these studies. The majority of the studies were conducted with participants with CMDs and stress-related disorders, while a few studies were conducted exclusively with participants with depression and somatoform disorders. None of the included studies were on interventions aimed at sick-listed workers with personality disorders.

To provide structure for the systematic review, the included studies are reviewed according to study quality and design. Thus, high-quality RCT studies, fair-quality RCT studies and controlled studies are reviewed in turn.

Summary of high quality RCT-studies

Ten RCT studies were characterised as being of high quality. Table 2 provides an overview of included studies including intervention characteristics and effectiveness. Three of the high-quality studies found a positive effect of the studied intervention,18–20 while seven high-quality studies report no effect.21–27

The content and methods employed in these high quality interventions are very different, and no obvious conclusions can be drawn based on these studies alone. However, an interesting observation can be made regarding the target group. No diagnosis-specific intervention were found to have an effect, while three interventions aimed at workers with CMDs were found to be effective.

Summary of fair quality RCT studies

Twenty-one RCT studies were characterised as fair quality (see table 2). Seven fair-quality studies found a positive effect of the examined intervention,28–34 while 14 fair quality studies showed no effect.35–50

As with the high-quality RCTs, the fair-quality RCTs comprise very different interventions. However, a few common characteristics are apparent: the majority of the effective interventions were aimed at newly or short-term sick-listed workers (<10 weeks at the time of inclusion; 5/7 studies), were disorder specific (7/7 studies), included contact to the work place (6/7 studies), often in combination with therapy), were delivered by a multidisciplinary team (4/7 studies) and included two or more components (5/7 studies).

Characteristics of the fair-quality RCTs that did not show effect can be used to nuance the highlighted characteristics for effective studies. About half of the ineffective studies were aimed at newly or short-term sick-listed workers (7/14 studies) and included a multidisciplinary team (6/14). The majority of the ineffective studies were aimed at workers with CMDs (9/14), did not include contact to the work place (12/14) and included only 0 or 1 of the four identified components (13/14 studies).

Overall, the findings from the fair quality RCTs suggest that disorder-specific interventions, interventions including contact to the work place as a component and interventions with more than one component are more likely to be effective.

Summary of controlled studies

Eight controlled studies were included in the present review (see table 2). Two controlled studies were characterised as fair quality,51 52 while the remaining six studies were of poor quality.53–59 Five controlled studies found a positive effect of the examined intervention,51 52 57–59 while three controlled studies showed no effect.53–56 Study characteristics varied greatly across controlled studies. No convincing patterns of characteristics emerge from the overview of effective and ineffective controlled studies.

Table 2

Overview of study characteristics and effectiveness

Meta-analysis

For the meta-analysis, primary authors were contacted in order to obtain missing data for 10 studies.18 26 30 33 35–37 46 50 59 We received data from authors of four studies,18 33 35 36 46 while two authors were not able to comply with the request.37 50 The remaining authors could not be reached. Thus, appropriate data were available from 33 studies. One of these studies52 was excluded, as the reported results shifted from significantly negative to significantly positive at different points in time during follow-up. Three of the included studies29 41 42 tested the effects of more than one intervention or estimated the effect of the intervention on more than two groups. Thus, the final sample included in the meta-analysis consisted of 32 studies with 35 effect sizes.

Overall effect

The result of the random effects meta-analysis is summarised in figure 1.

Figure 1

Results of the meta-analysis. Note: Test for heterogeneity, χ² = 105.72 (df=34), p=0.000. Proportion of variation in effect sizes that can be attributed to heterogeneity, I-squared: 67.8 %. Test of effect size=0: Z=3.88, p=0.000. The overall effect size of 0.14 can be considered small according to the classification of Cohen.60 An effect size of 0.14 corresponds to an increase in return to work at three month follow-up from, for example, 36% to 43%.

The pooled effect size (95% CI) was 0.14 (0.07 to 0.22) with considerable heterogeneity (I2=67.8%). This result was highly significant (p<0.001), implying that the interventions generally shortened time until RTW and increased the fraction of participants who had achieved RTW at follow-up. The test of heterogeneity was significant, indicating that two-thirds of the variation in effect sizes across studies was likely due to other sources than sampling error alone (eg, study design, intervention content, target population). These findings remained unaltered when the meta-analysis was rerun without studies of poor quality (effect size=0.12, p<0.001).

Meta-regressions were performed for the meta-analysis with all 32 studies (35 effect sizes) to assess potential sources of heterogeneity. Results from the meta-regressions are presented in the following section.

Nineteen of the included studies found positive effects of the tested interventions, but only 11 of these were significant. Six studies found negative effects, but only two of these effects were significant. Ten studies found no effect. The most effective interventions were presented by Netterstrøm et al,32 33 who found an increase in proportion of participants achieving RTW in the intervention group (66% and 67%) compared with the control group (36% in both studies). These effects are substantial, corresponding to almost a doubling of the proportion of participants achieving RTW in the intervention group compared with the control group.

Meta-regression estimates of moderators of effect sizes

Meta-regressions were performed to assess the moderating influence of study and population characteristics on effect sizes. In addition, the meta-regressions provide an opportunity to test the robustness of the findings from the systematic review, where effect size and sample size were not taken into consideration. Table 3 provides results from meta-regressions estimating the moderating effect of diagnosis (reference category: CMD), country of origin (reference category: Netherlands), number of components included in an intervention (reference category: 1), specific components included in an intervention (reference category: no specific components) and study quality (reference category: fair).

Table 3

Meta-regression estimates of effect size moderated by diagnosis, country of origin, number of components in interventions, type of components in interventions and study quality

As evident from table 3, interventions aimed at workers sick-listed due to stress showed the largest effects. Effect sizes from studies aimed at workers with the remaining diagnoses did not appear to be significantly different from the effect sizes from studies aimed at workers with CMD. Regarding country of origin, the effectiveness of interventions did not appear to vary systematically across countries. There was a non-significant tendency for the most effective studies to have originated from Denmark and other countries (Norway and Germany) as opposed to Sweden and the Netherlands. As for other population characteristics, meta-regressions were performed to estimate the moderating effect of age and the proportion of women. These variables had no significant moderating impacts on effect sizes.

Table 3 further shows that effect sizes varied according to number of components in the interventions. Interventions with 0 components did not significantly differ from interventions with 1 component (which had an effect size of 0.00). However, interventions with 2 and 3 or more components were associated with significantly larger effect sizes (0.36 and 0.25, respectively) compared with interventions with 1 component. In addition, interventions that included contact to the work place showed significantly larger effect sizes compared with interventions including none of the identified components. Interventions including the remaining components did not significantly differ from interventions including no components. Regarding other intervention characteristics, meta-regressions were conducted to estimate the moderating effect of timing of the intervention, the length of intervention and the study quality rating, but these variables were not associated with significant moderating impacts on effect sizes.

Results from the overall analysis and the meta-regressions remained robust when the analyses were rerun omitting studies of poor quality and omitting interventions aimed at sick-listed workers with stress-related disorders. Omitting studies aimed at sick-listed workers with stress-related disorders reduced the effect size to 0.07 (p=0.03), but the qualitative results from the meta-regressions remained with the exception that the component graded RTW became significant and contact to the work place became insignificant.

Publication bias

Publication bias was assessed by a funnel plot (see online supplementary figure 2). A visual inspection of the plot revealed no clear asymmetry indicating no publication bias and Egger’s test for asymmetry in small study effects revealed no asymmetry (p=0.13).

Supplemental material

Levels of evidence based on systematic review and meta-analysis

In this section, a summary and integration of the evidence from the meta-analysis and the systematic review is provided.

The estimation of level of evidence is based on the following classification:

  • Strong evidence (++/−−): The meta-analysis and the majority (more than 50%) of the included studies of fair and high quality studies found a significant effect.

  • Moderate evidence (+/–): The meta-analysis or the majority of the included studies of fair and high quality found a significant effect.

  • Limited or contradictory evidence (?): None of the above criteria were met. No definite conclusions can be made.

Table 4 provides a summary of the evidence based on the meta-analysis and the systematic review.

Table 4

Summary of evidence from the meta-analysis and the systematic review

Discussion

In the present review and meta-analysis, we evaluated the effectiveness of employment related interventions for sick-listed workers with mental disorders. Overall, the results revealed a significant, but relatively small, positive effect of the included interventions. The effect size is similar to the effect size reported by Nigatu et al 15 in their review of 16 studies testing CBT-interventions aimed at RTW for sick-listed workers with CMDs. However, other disease-specific reviews report diverging findings. Niewenhuijsen et al 14 reviewed 23 studies of work-related interventions for workers with depression and found moderate effects across studies, while Arends et al 13 reviewed nine studies and found no effect of CBT and problem solving therapy on time until full RTW for sick-listed workers with adjustment disorders.

The average effect size of 0.14 can be considered small according to the classification of Cohen,60 but effect sizes of this order still carry considerable economic value. A couple of examples may illustrate this; in the two intervention studies by Netterstrøm et al.,32 33 36% of the control group had returned to work at 3 month follow-up. An effect size of 0.14 implies an increase by 7 percentage points to 43% (0.36+0.14×sqrt(0.36×(1–0.36))) in this case. That is, at a given point in time, 20% more workers in the intervention group have returned to work compared with the control group. Assuming for this example, a constant increase in the exit rate from sick-listing to work of 20% implies a 17% reduction in the average time until RTW (the average time until RTW is proportional to the inverse of the exit rate), suggesting a considerable impact on the public budget. Turning to interventions studying time until RWT, the SD in the control group varies, but an effect size of 0.14 in most cases implies a reduction in time until RTW by 15–30 days on average. Effects of this magnitude have large implications for the sick-listed individual, and they lead to large budget savings for government and/or employers, depending on who finance sickness insurance payments. Hence, employment-related interventions aimed at sick-listed workers with mental disorders are of considerable economic value to society.

Regarding intervention characteristics, we found strong evidence for including contact to the work place and for including more than one of the identified components. In addition, we found moderate evidence for including graded RTW, which by definition also implies contact to the work place. These results are robust and emphasise the need to focus on incorporating contact to the work place as well as one or more additional component(s) in intervention designs.

Research on the specific content of successful RTW-interventions for mental disorders is lacking, and only one other study has addressed intervention content.14 Consistent with the findings in the present review, Niewenhuijsen et al 14 found moderate evidence for including several components in RTW-interventions aimed at sick-listed workers with depression and reported that adding a work-related intervention to a clinical intervention significantly reduced the number of sick-leave days. Common to graded RTW and contact to the work place is that they allow sick-listed workers to refamiliarise themselves with their work place and their work activities. By doing so, the worker is effectively practicing management of their work situation and work functions in situ. This may contribute to the relative effectiveness of these components compared with therapy and organisational change, which by themselves include no contact to the work place.

For population characteristics, we found strong evidence for targeting stress, compared with other mental disorders or groups of disorders (ie, CMDs, depression, somatoform disorders, anxiety and personality disorders). This finding may reflect the small number of identified studies targeting other specific mental disorders (depression: 6, somatoform disorders: 2, anxiety: 1, personality disorders: 0), but it may also be attributed to the nature of stress. Stress can be considered a fundamentally different disorder compared with the other included disorders. Stress is intrinsically tied to perceived external stressors. When these stressors are removed, the stress symptoms often dissipate. The same cannot be said for depression, anxiety, somatoform disorders or personality disorders. In addition, although some overlap is apparent, every mental disorder is associated with unique symptoms and unique impairment.2 3 This further speaks to the challenge of designing an intervention flexible enough to address the needs associated with several different disorders. However, this is exactly the type of intervention requested by local government or other institutions interacting with the sick-listed worker to help them back to work. In local labour markets, they typically do not have sufficient scale to train case workers and/or therapists in interventions aimed at small groups.

The present review and meta-analysis provide new insights for researchers and policymakers alike, attempting to conceive employment related interventions aimed at broader groups of sick-listed workers with mental disorders, and they represent a much needed starting point for planning future research and policy. The results provide convincing evidence for the effectiveness of work related interventions for sick-listed workers with mental disorders and point to key ingredients in successful interventions that may boost effect sizes considerably beyond the average, namely contact to the work place and using more than one active component.

The overall quality of the present review and meta-analysis might be affected by the following limitations. First, despite the aim to include interventions targeting a broad spectrum of mental disorders, only two studies pertaining exclusively to somatoform disorders, one study pertaining to anxiety and no studies targeting personality disorders were obtained. Second, the majority of the included studies were conducted with small samples (27 out of 42 studies had sample sizes<200). Third, due to missing data, the foundations for the meta-analysis and the systematic review differ slightly and results should be interpreted in the light of this discrepancy.

Conclusion

In the present review and meta-analysis, we assessed the effectiveness of interventions aimed at enhancing RTW for sick-listed workers with mental disorders. The results revealed an overall significantly positive and economically important effect of interventions aimed at enhancing RTW for sick-listed workers with mental disorders. We demonstrated the importance of contact to the work place, graded RTW and the use of multicomponent interventions as key ingredients in successful interventions. These findings may inform design of future interventions and may thus represent a step towards establishing systematic guidelines for employment related interventions aimed at sick-listed workers with mental disorders.

References

Footnotes

  • Contributors Review conception and design: MBM and MR. Acquisition of data: MBM and MR. Systematic review and data-interpretation: MBM and MR. Meta-analysis: MR. Drafting of manuscript: MBM. Critical revision: MBM and MR.

  • Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.

  • Competing interests None declared.

  • Patient consent Not required.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Data sharing statement Data available in reviewed articles.

  • Preregistration Prospero: CRD42017070410.