Reliability of Standard Health Assessment Instruments in a Large, Population-Based Cohort Study

doi:10.1016/j.annepidem.2006.12.002

Annals of Epidemiology

Volume 17, Issue 7, July 2007, Pages 525-532

https://doi.org/10.1016/j.annepidem.2006.12.002 Get rights and content

Purpose

The Millennium Cohort Study began in 2001 using mail and Internet questionnaires to gather occupational and environmental exposure, behavioral risk factor, and health outcome data from a large, population-based US military cohort. Standardized instruments, including the Patient Health Questionnaire, the Medical Outcomes Study Short Form-36 for Veterans, and the Posttraumatic Stress Disorder (PTSD) Checklist–Civilian Version, have been validated in various populations. The purpose of this study was to investigate internal consistency of standardized instruments and concordance of responses in a test-retest setting.

Methods

Cronbach alpha coefficients were used to investigate the internal consistency of standardized instruments among 76,742 participants. Kappa statistics were calculated to measure stability of aggregated responses in a subgroup of 470 participants who voluntarily submitted an additional survey within 6 months of their original submission.

Results

High internal consistency was found for 14 of 16 health components, with lower internal consistency found among two alcohol components. Substantial test-retest stability was observed for stationary variables, while moderate stability was found for more dynamic variables that measured conditions with low prevalence.

Conclusions

These results substantiate internal consistency and stability of several standard health instruments applied to this large cohort. Such reliability analyses are vital to the integrity of long-term outcome studies.

Introduction

Standardized instruments are often used in survey research. Many of these instruments are devised in clinic settings where health assessment is completed by trained health care professionals. However, prohibitive cost and relative ease make participant-assessed outcome measures a more feasible approach to obtain constructs describing functional and mental health outcomes. With these more convenient measures of health increasingly used as primary outcomes in epidemiologic studies, selecting an appropriate assessment tool involves careful review of the many standard survey instruments available. Special consideration of whether the instruments meet the requirements of the proposed application is critical to interpretation of collected data (1). Reliability and validity of these instruments are often tested thoroughly in populations or settings in which the instrument was originally created 2, 3. However, many questionnaires incorporate standardized survey instruments in populations that may be different from those for which the instrument was intended. In these studies, it is important to establish a level of confidence in the information being ascertained prior to declaring the instrument appropriate for the targeted population.

The Millennium Cohort, the largest cohort study ever undertaken by the US Department of Defense, was launched in 2001 to gather health outcome information along with occupational and environmental exposures employing a longitudinal approach 4, 5. In the first panel of enrollment, more than 77,000 participants joined the 22-year-long study, filling out either a mailed survey or an identical Web-based survey. The Millennium Cohort Study questionnaire is composed of more than 60 multipart questions comprising more than 400 individual data points, including questions from standardized instruments such as the Medical Outcomes Study Short Form 36-item for Veterans (SF-36V) 6, 7, the Primary Care Evaluation of Mental Disorders (PRIME-MD) Patient Health Questionnaire (PHQ) 2, 8, 9, the Posttraumatic Stress Disorder (PTSD) Checklist–Civilian Version (PCL-C) 3, 10, and the CAGE questionnaire to assess problematic drinking behavior (11), as well as questions that target areas such as medical history, vaccinations, environmental exposures, and occupation. Although the concordance of test-retest responses and internal consistency of the standard instruments have been established 6, 7, 8, 9, 10, tests of reliability of these constructs have not been performed in a large, population-based cohort where multiple independent instruments are presented simultaneously. The purpose of this study, therefore, was to establish the reliability as measured by concordance in a test-retest setting and internal consistency of several standardized instruments in a large, population-based military cohort.

Section snippets

Study Population

The invited Millennium Cohort Study participants were randomly selected from all US military personnel serving in the Army, Navy, Coast Guard, Air Force, and Marine Corps as of October 1, 2000. The population-based sample represented approximately 11% of the 2.3 million men and women in service and, oversampled for those who had been previously deployed, were US Reserve and National Guard personnel, and female service members, to ensure sufficient power to detect differences in smaller

Results

Of the 77,047 Millennium Cohort Panel 1 participants, 76,742 (99.6%) had complete demographic and military characteristic data. This population included 73% men, 73% born between 1960 and 1979, 49% without any college experience, 63% married, 70% white non-Hispanic, 77% enlisted personnel, 57% active duty personnel, 48% Army, 20% working as functional support specialists, and 20% combat specialists (Table 2).

Levels of internal consistency among standardized survey scales, as measured by

Discussion

Standardized instruments are often employed to enhance the value of epidemiologic survey research. Diligence in establishing consistency and comparability to promote confidence in results will become increasingly more important. While the use of established survey instruments may be an enticing addition in pursuit of quality health metrics, suboptimal performance in varying populations may be found instead. In this study, the internal consistency of well-known instruments (PHQ, SF-36V, CAGE,

References (32)

E.B. Blanchard et al.
Psychometric properties of the PTSD Checklist (PCL)
Behav Res Ther
(1996)
M.A.K. Ryan et al.
Millennium cohort: enrollment begins a 21-year contribution to understanding the impact of military service
J Clin Epidemiol
(2007)
R.L. Spitzer et al.
Validity and utility of the PRIME-MD Patient Health Questionnaire in assessment of 3000 obstetric-gynecologic patients: the PRIME-MD Patient Health Questionnaire Obstetrics-Gynecology Study
Am J Obstet Gynecol
(2000)
W.D. Thompson et al.
A reappraisal of the kappa coefficient
J Clin Epidemiol
(1988)
R. Fitzpatrick et al.
Evaluating patient-based outcome measures for use in clinical trials
Health Technol Assess
(1998)
R.L. Spitzer et al.
Validation and utility of a self-report version of PRIME-MD: the PHQ Primary Care Study. Primary care evaluation of mental disorders
JAMA
(1999)
G.C. Gray et al.
The Millennium Cohort Study: a 21-year prospective cohort study of 140,000 military personnel
Mil Med
(2002)
J.E. Ware et al.
SF-36 Health Survey: manual and interpretation guide
(2000)
J.E. Ware et al.
The MOS 36-Item Short-Form Health Survey (SF-36). I. Conceptual framework and item selection
Med Care
(1992)
R.L. Spitzer et al.
Utility of a new procedure for diagnosing mental disorders in primary care. The PRIME-MD 1000 Study
JAMA
(1994)

Weathers FW, Litz BT, Herman DS, Huska JA, Keane TM. The PTSD Checklist (PCL): reliability, validity, and diagnostic...

J.A. Ewing

Detecting alcoholism. The CAGE questionnaire

JAMA

(1984)

J.R. Fann et al.

Validity of the Patient Health Questionnaire-9 in assessing depression following traumatic brain injury

J Head Trauma Rehabil

(2005)

A.J. Means-Christensen et al.

An efficient method of identifying major depression and panic disorder in primary care

J Behav Med

(2005)

D. Jones et al.

Health status assessments using the Veterans SF-12 and SF-36: methods for evaluating outcomes in the Veterans Health Administration

J Ambul Care Manage

(2001)

J.E. Ware et al.

SF-36 Physical and Mental Health Summary Scales: A user's manual

(1994)

Cited by (103)

The Millennium Cohort Study: The first 20 years of research dedicated to understanding the long-term health of US Service Members and Veterans
2022, Annals of Epidemiology
The Millennium Cohort Study, the US Department of Defense's largest and longest running study, was conceived in 1999 to investigate the effects of military service on service member health and well-being by prospectively following active duty, Reserve, and National Guard personnel from all branches during and following military service. In commemoration of the Study's 20th anniversary, this paper provides a summary of its methods, key findings, and future directions.
Recruitment and enrollment of the first 5 panels occurred between 2001 and 2021. After completing a baseline survey, participants are requested to complete follow-up surveys every 3–5 years.
Study research projects are categorized into 3 core portfolio areas (psychological health, physical health, and health-related behaviors) and several cross-cutting areas and have culminated in more than 120 publications to date. For example, some key Study findings include that specific military service-related factors (e.g., experiencing combat, serving in certain occupational subgroups) were associated with adverse health-related outcomes and that unhealthy behaviors and mental health issues may increase following the transition from military service to veteran status.
The Study will continue to foster stakeholder relationships such that research findings inform and guide policy initiatives and health promotion efforts.
Sexual health difficulties among service women: the influence of posttraumatic stress disorder
2021, Journal of Affective Disorders
Citation Excerpt :
Mental disorders were assessed at Time 1. Probable PTSD was measured using the PTSD Checklist−Civilian Version (PCL-C), a validated instrument used to rate the severity of symptoms (Blanchard et al., 1996) that has demonstrated good internal consistency (Cronbach's =0.94) in this cohort (Smith et al., 2007). Based on criteria from the Diagnostic and Statistical Manual of Mental Disorders 4th edition (DSM-IV), probable PTSD was defined as reporting a moderate or higher level of at least one intrusion symptom, three avoidance symptoms, and two hyperarousal symptoms (Diagnostic and statistical manual of mental disorders 4th ed.
Background Sexual health among service women remains understudied, yet is related to health and quality of life. This study examined if the associations between recent combat and sexual assault with sexual health difficulties were mediated by mental disorders and identified factors associated with sexual health difficulties among service women.
Methods Data from two time points (2013 and 2016) of the Millennium Cohort Study, a large military cohort, were used. The outcome was self-reported sexual health difficulties. Mediation analyses examined probable posttraumatic stress disorder (PTSD) and major depressive disorder (MDD) as intermediate variables between recent combat and sexual assault with the sexual health difficulties. Multivariable logistic regression modeling was used to examine the association of demographic, military, historical mental health, life stressors, and physical health factors with sexual health difficulties.
Results Of the 6,524 service women, 13.5% endorsed experiencing sexual health difficulties. Recent combat and sexual assault were significantly associated with sexual health difficulties. Probable PTSD mediated the associations of recent combat and sexual assault with sexual health difficulties; probable MDD did not mediate these relationships. Other significant factors associated with sexual health difficulties included enlisted rank, historical mental disorders, childhood trauma, and disabling injury.
Limitations Use of self-reported data, outcome not assessed using a standardized measure and future studies may benefit from examining other mediators.
Conclusion Our findings that combat and sexual assault may have negative effects on service women's sexual health suggest that treatment options and insurance coverage for sexual health problems should be expanded.
A pilot multisite study of patient navigation for pregnant women with opioid use disorder
2019, Contemporary Clinical Trials
The opioid crisis continues to affect pregnant and postpartum women the United States, with the number of pregnant women diagnosed with opioid use disorder (OUD) quadrupling over the last decade. The associated increase in morbidity and mortality among mother and baby warrants prompt, targeted intervention efforts that improve engagement, linkage of care, and treatment retention. Patient navigation (PN) is a chronic care intervention that can directly address this need by helping women identify medical, behavioral, and psychosocial care goals. Moreover, PN can assist women in preparing for, engaging in, and maintaining patient participation in necessary services. Specifically, PN includes strengths-based case management, 1-1 clinical support, motivational interviewing, and addiction-relapse prevention programming. The objective of this article is to present the study protocol of a pilot multisite randomized clinical trial, entitled: Optimizing Pregnancy and Treatment Interventions for Moms 2.0 (OPTI-Mom 2.0; NCT03833245). In this study, we build upon a proof-of-concept study, employing evidence-informed frameworks for protocol and intervention expansion in order to construct a PN intervention tailored for pregnant women with OUD in central Utah and southwestern Pennsylvania. Our protocol provides an initial framework of a potentially impactful intervention and may guide development of future programs. Importantly, this study further establishes the evidence-base—with potential to ameliorate serious adverse opioid-related outcomes and improve health for women and their children.
A community pharmacy-led intervention for opioid medication misuse: A small-scale randomized clinical trial
2019, Drug and Alcohol Dependence
Citation Excerpt :
The two-item pain subscale asked about level of bodily pain and pain-related physical functioning and is scored on a 0–200 scale. We assessed depression using the Patient Health Questionnaire (PHQ) depression subscale, a valid mental health assessment with demonstrated reliability (Hides et al., 2007; Smith et al., 2007; Spitzer et al., 1999, 2000). This subscale is scored on a 5-point scale (0=none-minimal; 1=mild; 2=moderate, 3=moderately severe; 4=severe).
Stemming the opioid epidemic requires testing novel interventions. Toward this goal, feasibility and acceptability of a Brief Motivational Intervention-Medication Therapy Management (BMI-MTM) intervention was examined along with its impact on medication misuse and concomitant health conditions.
We conducted a two-group randomized trial in 2 community pharmacies. We screened patients for prescription opioid misuse at point-of-service using the Prescription Opioid Misuse Index. Participants were assigned to standard medication counseling (SMC) or SMC + BMI-MTM (referred to as BMI-MTM herein). BMI-MTM consists of a pharmacist-led medication counseling/brief motivational session and 8-weekly patient navigation sessions. Assessments were at baseline, 2-, and 3-months. Primary outcomes included feasibility, acceptability, and mitigation of opioid medication misuse. Secondary outcomes included pain and depression. Outcomes were analyzed with descriptive and multivariable statistics (intent-to-treat [ITT] and adjusted for number of sessions completed [NUMSESS]).
Thirty-two participants provided informed consent (74.4% consent rate; SMC n = 17, BMI-MTM n = 15; 3-month assessment retention ≥93%). Feasibility was demonstrated by all BMI-MTM recipients completing the pharmacist session and an average of 7 navigation sessions. BMI-MTM recipients indicated ≥4.2 (5 maximum) level of satisfaction with the pharmacist-led session, and 92.4% were satisfied with navigation sessions. Compared to SMC at 3-months, BMI-MTM recipients reported greater improvements in misuse (ITT: Adjusted Odds Ratio [AOR] = 0.13; 95% CI = 0.05, 0.35, p < 0.001. NUMSESS: AOR = 0.05; 95% CI = 0.01, 0.25; p < 0.001), pain (ITT: В = 8.8, 95% CI=-0.95, 18.5, p = 0.08; NUMSESS: В = 14.0, 95% CI = 3.28, 24.8, p = 0.01), and depression (ITT: B= -0.44; 95% CI=-0.65, -0.22; p < 0.001. NUMSESS: B= -0.64; 95% CI=-0.82, -0.46; p < 0.001).
BMI-MTM is a feasible misuse intervention associated with superior satisfaction and outcomes than SMC. Future research should test BMI-MTM in a large-scale, fully-powered trial.
Patient navigation for pregnant individuals with opioid use disorder: Results of a randomized multi-site pilot trial
2024, Addiction
Masculinity and stigma among emerging adult military members and veterans: implications for encouraging help-seeking
2023, Current Psychology

View all citing articles on Scopus

: Disclosure: This work represents Report 06-24, supported by the Department of Defense, under work unit No. 60002. The views expressed in this article are those of the authors and do not reflect the official policy or position of the Department of the Navy, Department of the Army, Department of the Air Force, Department of Defense, Department of Veterans Affairs, or the US Government. This research has been conducted in compliance with all applicable federal regulations governing the protection of human subjects in research (Protocol NHRC.2000.007).

^∗: In addition to the authors, the Millennium Cohort Study Team includes Paul J. Amoroso, MD, MPH (Madigan Army Medical Center, Tacoma, WA); Edward J. Boyko, MD, MPH (Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Puget Sound Health Care System, Seattle, WA; Gary D. Gackstetter, PhD, DVM, MPH (Department of Preventive Medicine and Biometrics, Uniformed Services University of the Health Sciences, Bethesda, MD and Analytic Services, Inc. [ANSER], Arlington, VA; Gregory C. Gray, MD, MPH (College of Public Health, University of Iowa, Iowa City, IA; Tomoko I. Hooper, MD, MPH, Department of Preventive Medicine and Biometrics, Uniformed Services University of the Health Sciences, Bethesda, MD); James R. Riddle, DVM, MPH, and Timothy S. Wells, PhD, DVM, MPH. (both from Air Force Research Laboratory, Wright Patterson AFB, OH.).

View full text

Reliability of Standard Health Assessment Instruments in a Large, Population-Based Cohort Study

Purpose

Methods

Results

Conclusions

Introduction

Section snippets

Study Population

Results

Discussion

Behav Res Ther

J Clin Epidemiol

Am J Obstet Gynecol

J Clin Epidemiol

Evaluating patient-based outcome measures for use in clinical trials

Health Technol Assess

Validation and utility of a self-report version of PRIME-MD: the PHQ Primary Care Study. Primary care evaluation of mental disorders

JAMA

The Millennium Cohort Study: a 21-year prospective cohort study of 140,000 military personnel

Mil Med

SF-36 Health Survey: manual and interpretation guide

The MOS 36-Item Short-Form Health Survey (SF-36). I. Conceptual framework and item selection

Med Care

Utility of a new procedure for diagnosing mental disorders in primary care. The PRIME-MD 1000 Study

JAMA

Detecting alcoholism. The CAGE questionnaire

JAMA

Validity of the Patient Health Questionnaire-9 in assessing depression following traumatic brain injury

J Head Trauma Rehabil

An efficient method of identifying major depression and panic disorder in primary care

J Behav Med

Health status assessments using the Veterans SF-12 and SF-36: methods for evaluating outcomes in the Veterans Health Administration

J Ambul Care Manage

SF-36 Physical and Mental Health Summary Scales: A user's manual