You Are Here:
Release Date: July 30, 2013
Prepared by: Harry J. de Koning, MD, PhD; Rafael Meza, PhD; Sylvia K. Plevritis, PhD; Kevin ten Haaf, MSc; Vidit N. Munshi, MS; Jihyoun Jeon, PhD; S. Ayca Erdogan, PhD; Chung Yin Kong, PhD; Summer S. Han, PhD; Joost van Rosmalen, PhD; Sung Eun Choi, SM; Melicia C. Miller, MPH; Suresh Moolgavkar, MD, PhD; Paul F. Pinsky, PhD; Christine D. Berg, MD; Amy Berrington de Gonzalez, PhD; William C. Black, MD; C. Martin Tammemagi, PhD; William D. Hazelton, PhD; Eric J. Feuer, PhD; Pamela M. McMahon, PhD
This report is based on research conducted by the Cancer Intervention and Surveillance Modeling Network under contract to the Agency for Healthcare Research and Quality (AHRQ), Rockville, MD (Administrative Supplement to U01 CA152956). The investigators involved have declared no conflicts of interest with objectively conducting this research. The findings and conclusions in this document are those of the authors, who are responsible for its contents, and do not necessarily represent the views of AHRQ. No statement in this report should be construed as an official position of AHRQ or of the U.S. Department of Health and Human Services.
The information in this article is intended to help clinicians, employers, policymakers, and others make informed decisions about the provision of health care services. This article is intended as a reference and not as a substitute for clinical judgment.
This report may be used, in whole or in part, as the basis for the development of clinical practice guidelines and other quality enhancement tools, or as a basis for reimbursement and coverage policies. AHRQ or U.S. Department of Health and Human Services endorsement of such derivative products may not be stated or implied.
Background: The National Lung Screening Trial (NLST) demonstrated that three annual computed tomography (CT) screenings reduced lung cancer-specific mortality by 20% compared with annual chest radiography screenings in a volunteer population of current and former smokers ages 55 to 74 years with at least 30 pack-years of cigarette smoking history and no more than 15 years since quitting for former smokers. To inform the updated U.S. Preventive Services Task Force recommendations on lung cancer screening, we assessed the benefits and harms of CT screening programs that varied by age, pack-year, and years since quitting criteria, as well as the frequency of screening.
Methods: Five independent microsimulation models estimated the long-term harms and benefits of screening as experienced by the U.S. cohort born in 1950. The five models were calibrated to the NLST to predict lung cancer outcomes consistent with the trial's observations. These models were also then calibrated to the lung cancer screening portion of the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial. We evaluated 576 scenarios with annual or less frequent screening of individuals between the ages of 45 and 85 years, for a range of minimum smoking exposure (measured in pack-years) and maximum time since quitting. Screening benefits are expressed in terms of the percentage of cancers detected at an early stage (stages I or II), percentage and absolute number of lung cancer deaths prevented, and life-years gained compared with a reference scenario with no screening. Screening harms are expressed as the number of CT screenings required (and percentage of the cohort ever screened), number of followup imaging examinations, and number of overdiagnosed lung cancers and radiation-related lung cancer deaths. We identified consensus strategies that the models identified as efficient, preventing the greatest number of lung cancer deaths for the screening examinations required. Counts and percentages reported are calculated as averages of outcomes from the five models, following a 100,000 person cohort from ages 45 to 90 years.
Results: Five independent microsimulation models estimated the long-term harms and benefits of screening as experienced by the U.S. cohort born in 1950. The five models were calibrated to the NLST to predict lung cancer outcomes consistent with the trial's observations. These models were also then calibrated to the lung cancer screening portion of the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial. We evaluated 576 scenarios with annual or less frequent screening of individuals between the ages of 45 and 85 years, for a range of minimum smoking exposure (measured in pack-years) and maximum time since quitting. Screening benefits are expressed in terms of the percentage of cancers detected at an early stage (stages I or II), percentage and absolute number of lung cancer deaths prevented, and life-years gained compared with a reference scenario with no screening. Screening harms are expressed as the number of CT screenings required (and percentage of the cohort ever screened), number of followup imaging examinations, and number of overdiagnosed lung cancers and radiation-related lung cancer deaths. We identified consensus strategies that the models identified as efficient, preventing the greatest number of lung cancer deaths for the screening examinations required. Counts and percentages reported are calculated as averages of outcomes from the five models, following a 100,000 person cohort from ages 45 to 90 years.
Conclusion: Our findings support a range of possible lung cancer screening programs, including annual lung cancer screening of individuals with at least 30 pack-years of smoking who are between the ages of 55 and 80 years, but cannot determine which tradeoff of harms and benefits is “best.” Scenarios with an older starting age (60 years) but increased maximum years since quitting (from 15 to 25 years) offer different tradeoffs of benefits and harms (depending on the minimum pack-years). Extending eligibility to individuals with fewer pack-years—although still efficient—leads to additional benefits but more additional harms. Overdiagnosis remained limited for annual screening.
The National Lung Screening Trial (NLST) demonstrated that in a volunteer population of current and former smokers ages 55 to 74 years with at least 30 pack-years of cigarette smoking history and no more than 15 years since quitting for former smokers, three annual computed tomography (CT) screenings and subsequent treatment of early-stage lung cancer reduced lung cancer-specific mortality by 20% compared with three annual chest radiography screenings at 6.5-years followup (1). With an additional year of followup (to 7.5 years), the lung cancer-specific mortality reduction in the NLST was 16% (2). Albeit a significant effect, this trial does not directly address the effects of additional rounds of screening, the long-term benefits, or whether other screening policies, such as different intervals or risk groups, may result in substantial benefits. To understand the tradeoffs between benefits and important harms involved with screening, long-term outcomes must be quantified (3).
We used five microsimulation models that were calibrated to individual-level, de-identified data from the NLST and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial (which included a wider range of smoking exposures than NLST and provides further information on the natural history of lung cancer) to estimate the long-term harms and benefits of a variety of CT screening programs (4). In this report, we briefly summarize model calibration to NLST data and estimated future harms and benefits (averaged outcomes from all five calibrated models) of a set of 27 screening policies.
Calibration of Models to De-Identified NLST Data
The five models used were developed independently by groups of investigators at five institutions: Erasmus MC in Rotterdam, the Netherlands (Model E), Fred Hutchinson Cancer Research Center in Seattle (Model F), Massachusetts General Hospital in Boston (Model M), Stanford University in Stanford, California (Model S), and the University of Michigan in Ann Arbor, Michigan (Model U). Earlier versions of the models can be found under the model profilers at www.cisnet.cancer.gov/profiles. All five models simulate the underlying natural history of lung cancer (separated by histology) in individuals and include dose-response modules that relate a detailed cigarette smoking history over time to lung cancer risk. Initially, all models were populated with de-identified individual trial participant histories from the NLST (and the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial) and set to mimic the design of both trials (e.g., the number of screenings and screening modality, ages at screening, smoking history and sex of enrollees, and screening intervals). Each model estimates screening effectiveness based on a different set of equations that are key in predicting the effects of earlier treatment, but each model employs different mathematical formalisms and model structure. Figure 1 gives a general picture of how earlier detection (followed by treatment) may have an effect in reducing serious consequences of the disease and/or increasing life expectancy. Although the five models differ substantially in their structure, they all account for the risk of lung cancer for an individual, the age and stage of lung cancer diagnosis, the corresponding lung cancer mortality, and the individual's life expectancy in the presence and absence of screening. More details of these processes will be summarized separately in a future report.
For this analysis, we prioritized the lung cancer mortality difference in the presence and absence of screening. We briefly describe how each model computes this measure. In Model E, screen-detected (and therefore earlier treated) cases experience a reduced risk of dying from lung cancer compared with the stage-specific survival if the same tumor had been diagnosed clinically, later in life. The improved prognosis in Model E is represented as a cure fraction specific to stage at detection, but if curative treatment fails, the survival of the patient will equal the survival in the case in which the tumor had been diagnosed clinically (obtained from Surveillance, Epidemiology, and End Results data). Model F estimates cure rates that depend on sex, tumor stage, and histology. Model M assumes that most patients with early-stage lung cancer would undergo resection and that (for patients without undetected distant metastases or additional primary lung cancers in another lobe) this resection is curative. For detected cancers in Model U, time to death from lung cancer is based on survival models that define cure by histology, stage, sex, and age at diagnosis. Mortality reduction due to screening is due to the earlier stage and younger age at detection. Model S estimates the probability of lethal metastases as a function of tumor size, histology, and sex. Advanced-stage lung cancer is, by definition, detected after the onset of lethal metastases, but some early-stage cases are detected before this occurs. With screening, patients are more likely to be detected at an early stage and cured of their disease following standard of care.
Figure 2 shows the cumulative lung cancer mortality ratio between the chest radiography arm and the CT arm by year of followup, indicating that the calibrated models agree with the observed lung cancer mortality reduction after 6 years of followup in the NLST.
Choosing Screening Programs and Expressing Harms and Benefits
A comprehensive set of 576 programs that varied lung cancer CT screening frequency, ages of starting and stopping screening, and eligibility based on smoking history were examined (Table 1) and compared with a reference scenario with no screening. To reduce the number of scenarios to consider, we first identified “efficient” programs according to each model; programs on the model's “efficient frontier” prevented the greatest number of lung cancer deaths for the same number of CT screenings. All scenarios were first run separately for men and women, and we looked for programs that were efficient for both sexes. The outcomes were then pooled to provide outcomes for both sexes combined. Modeling groups standardized input data on smoking history and nonlung cancer mortality to simulate life histories of the U.S. cohort born in 1950, using an updated version of the National Cancer Institute Smoking History Generator (5, 6). This cohort was chosen because in 2013 the individuals in this cohort will reach the same age as roughly the midrange of participants in the NLST. The cohort includes smokers and nonsmokers so that outcomes are at the population level.
Initially, all scenarios in the 90%–100% (y-axis) range of each model's efficient frontier were considered. Since there are outputs from five models for each scenario, scenarios that were evaluated as appropriately (90%–100%) efficient by at least three of the five models were considered “consensus efficient.” Additionally, model results were compared using a formal approach previously described in the literature (7), again using the consensus criterion of three or more models agreeing in their assessment. The two approaches identified similar consensus programs. In all simulated scenarios, a perfect screening adherence was assumed for individuals who met the screening eligibility criteria at any given age. A complete overview of the 576 programs and selection of consensus programs and their benefits is forthcoming.
Screening benefits are expressed as the percentage of cancers detected at an early (I or II) stage, percentage and absolute number of lung cancer deaths averted, and life-years gained (absolute and per lung cancer death prevented). Composite measures of the number of screenings per lung cancer averted and per life-year gained are also presented. Potential harms are expressed as the number of CT screenings per 100,000 persons (and percentage of the cohort undergoing at least one screening), the number of screenings plus followup imaging examinations, and the number of overdiagnosed lung cancers and radiation-related lung cancer deaths. All counts are cumulative from ages 45 to 90 years, per 100,000 persons in the cohort at age 45 years. Averages of results from all five models are presented unless otherwise noted.
A forthcoming publication based on this report will provide additional details regarding the modeling of followup examinations (the models employed varied approaches to extrapolate from NLST observations or from guidelines devised for incidentally-detected pulmonary nodules) and radiation-related lung cancers. Additional harms of screening, such as anxiety, complications, or longer periods of adverse effects of treatment, were not considered.
The screening programs are labeled as follows: frequency (annual [A], biennial [B], and triennial [T]), age start, age stop, minimum pack-years, and maximum years since quitting.
We compared 26 consensus scenarios that start screening at age 50, 55, or 60 years, stop screening at age 80 or 85 years, and that are very close to or on the efficient frontier and were identified as consensus efficient using both approaches described in the Methods, as well as a 27th program that is most similar to the NLST criteria (A55-75-30-15; not among the consensus efficient programs). Of the 27 programs, four are triennial, six biennial, and the rest annual. None have a starting age of 45 years. Table 2 shows the benefits of these scenarios and Table 3 shows the harms. Table 2 shows the estimated numbers of lung cancer deaths averted in this specific U.S. cohort. Without screening, 3,719 (per 100,000) persons would ultimately die from the disease. Triennial screening programs lead to rather limited lung cancer mortality reductions, on the order of 6% or less in this cohort (between 172 and 225 lung cancer deaths averted per 100,000 persons), also shown in Table 2. Biennial programs lead to 6.5% to 9.6% lung cancer mortality reductions. Annual programs lead to 11% to 21% lung cancer mortality reductions. When we simulated a scenario that looks most similar to the NLST design and inclusion criteria (A55-75-30-15), this cohort would experience a 12% lung cancer mortality reduction.
This 12% reduction is notably lower than the observed point estimate of 20% or the recently updated 16% in the NLST, for at least two major reasons: 1) in NLST, almost all enrolled persons in the CT arm were screened, whereas in this cohort analysis, only eligible persons (19%) in the cohort were screened (dilution effect); and 2) we assessed lifetime lung cancer mortality (compared with 6-year followup in NLST). Furthermore, in contrast to NLST, once a person's characteristics do not satisfy the screening criteria (such as passing the limit of years since smoking cessation), that person is not invited for future screenings in our analysis, and we only considered the 1950 birth cohort instead of all NLST subjects.
For all triennial and biennial efficient programs, the starting age is 60 years and the minimum pack-years is 40 (with one exception). The first two triennial scenarios clearly show the effect of stopping at age 80 or 85 years: about 6% more screenings when stopping at age 85 years, leading to 10% more lung cancer deaths averted (Table 2). The next two scenarios give an indication of extending the time frame of quitting: extending the possible (quit) time from 10 years to 15 years leads to a 6% increase in deaths averted (at the expense of 14% additional screenings), and increasing to 25 years an additional 12% (at the expense of 20% additional screenings). When comparing the same eligibility criteria with triennial or biennial screening, the additional percentage of lung cancer deaths averted is about 40%, at the expense of about 50% additional screenings. The biennial comparisons generally show the same differences as discussed before with the triennial comparisons. Biennial screening scenarios are more effective, leading to 241 to 358 lung cancer deaths averted per 100,000 persons, still comprising less than 130,000 screenings. By comparing B60-85-30-20 with B60-85-40-25, we see the effect of simultaneously including lighter smokers but limiting eligibility to fewer former smokers (32% more screenings and 15% more lung cancer deaths averted).
For the annual policies, the starting ages are 50, 55, or 60 years. Table 2 clearly shows that among the consensus efficient programs, the most intensive annual program may be substantially more effective than the most intensive biennial program (A50-85-20-25 leads to more than double the lung cancer deaths averted than B60-85-30-20). Further, the biennial or triennial strategies that emerged as consensus efficient had the strictest smoking history criteria we evaluated (40 pack-years and 10 years since quitting), leading to low numbers of screenings and less lung cancer deaths avoided. Efficient programs that screened individuals with lighter smoking histories were more likely to be annual programs. For these reasons, we focused on finding effective and efficient scenarios that screened eligible persons every year.
The scenario that resembles the original NLST criteria the most (A55-75-30-15) leads to less benefit (but more screenings) when compared with the next least intensive program (A60-80-30-25). The inclusion criteria used for the NLST are therefore not the most efficient ones for a population screening program. For example, expanding the original NLST criteria (A55-75-30-15) by 5 more years (A55-80-30-15) or beginning and stopping screening 5 years later but extending the risk group up to 25 years since quitting smoking (A60-80-30-25) are more effective and efficient; about the same number of screenings are needed, but these scenarios lead to more lung cancer deaths averted. Specifically, the NLST criteria required 577 screenings per lung cancer death averted compared with 550 and 511 screenings for the other two scenarios, respectively (Table 2).
When we focused on annual programs requiring between 200,000 to 300,000 CT screenings per 100,000 population, three scenarios stood out: A55-85-40-20 and two strategies with later starting ages but more inclusive cutoffs for years since quitting, A60-85-30-25 and A60-85-40-25. These scenarios lead to half (49% to 52%) of all lung cancers being detected at an early stage (compared with 37% in usual care), 12% to 15% of lung cancer deaths averted, and between 4,200 and 5,300 life-years gained (Table 2). Larger lung cancer mortality reductions could be reached but would require a substantial increase in the number of screenings. However, clinical concerns about the potential for increased operative mortality in older individuals with heavy smoking histories, as well as increased comorbidity and reduced eligibility for surgery with curative intent at these higher age limits (which the models did not address in detail in the comparative analyses), led us to focus on scenarios with stopping ages of 80 years.
The seven programs highlighted in Table 2 and Table 3 are the consensus efficient annual programs with a stopping age of 80 years and screening counts between 200,000 and 600,000, plus an eighth program (A60-80-40-25) with just under 200,000 screenings included as a reference program.
Focusing on annual scenarios stopping at age 80 years (the highlighted scenarios) in Table 2, Table 3, Figure 3, and Figure 4 shows the impact of expanding the smoking eligibility in the age range of 55–80 beyond the criteria similar to NLST; for example, to 25 years since quitting (A55-80-30-25 or even A55-80-20-25 or A55-80-10-25). Although these are still efficient scenarios per our definition (maximum lung cancer deaths averted given number of CT screenings performed), they require more CT screenings (both overall and per person) and are associated with more radiation-related lung cancer deaths, especially when expanding the eligibility criteria to less than 30 pack-years (Table 2).
In Figure 3, it is apparent that with more CT screenings, more lung cancer deaths may be averted, but there are diminishing returns, as indicated by the decrease of the slope of the line (efficient frontier) connecting the programs that yield the greatest reduction in lung cancer mortality for a given number of screenings. Figure 4 plots the life-years gained on the y-axis. The A60-80-20-25 scenario, which extends eligibility to individuals with fewer pack-years, is still efficient with respect to number of screenings and lung cancer deaths averted but represents a noticeable tradeoff between the measures of deaths averted and life-years gained (provides fewer life-years gained). Other indications of the tradeoffs inherent in the A60-80-20-25 scenario are that for the three consecutive (in Table 2 and Table 3) scenarios A55-80-30-15, A60-80-20-25, and A55-80-30-25, the number of screenings per lung cancer deaths averted keeps going up (550, 570, and 583, respectively), while the number of screenings per life-year gained is the highest (worst) for A60-80-20-25 (52, 57, and 54, respectively). Of the same three consecutive scenarios, the A60-80-20-25 scenario extends screening to the highest percentage of the cohort (19%, 25%, and 20%) but has the highest number needed to screen to prevent one lung cancer death (37, 43, and 35, respectively).
Of the efficient scenarios, annual screening in the age range of 55 to 80 years was found to have substantial benefits. The annual programs include a strategy similar to the NLST criteria: starting screening at age 55 years, ending at age 80 years for ever-smokers with at least 30 pack-years, and no more than 15 years since quitting for former smokers (A55-80-30-15). With this program, 19.3% of the cohort would be screened at least once, requiring about 287,000 CT screenings per 100,000 persons, leading to 50% of lung cancers being detected at an early stage and a 14% lung cancer mortality reduction (about 520 lung cancer deaths averted), resulting in about 5,500 life-years gained. The benefits accruing from the A55-80-30-15 program must be weighed against the following harms (Table 3): 330,000 CT examinations per 100,000 persons screenings and followup CT scans), an estimated 4% overdiagnosis rate (of all lung cancers in the cohort), and 0.8% of lung cancer deaths (24 per 100,000 persons) related to radiation exposure (based on two models).
We conducted this study to extrapolate findings from the NLST to compare screening programs that could potentially be adopted in the general U.S. population. Of the efficient scenarios, annual screening of individuals with at least 30 pack-years of smoking who are between the ages of 55 and 80 years offers substantial benefits. Comparable scenarios (A60-80-30-25 and A60-80-40-25) offer a different tradeoff of benefits and harms. Extending eligibility to individuals with fewer pack-years—although still efficient—leads to additional benefits along with additional harms. These models cannot determine which efficient scenario is best, but are valuable tools that project the results of the trials to different screening scenarios over the course of a lifetime and show which scenarios provide the greatest benefits for a specified level of harms.
We can compare the A55-80-30-15 scenario, which required 300,000 CT screenings and yielded a 14% mortality reduction (521 lung cancer deaths averted, based on results from five models (Table 2), and 690 lung cancer deaths averted, as estimated solely by Model E), with the U.S. Preventive Services Task Force recommendations for breast and colorectal cancer screening by considering the number of screenings needed for each site-specific test; the breast cancer recommendation would mean about 1.1 million screening mammographies (per 100,000 women), resulting in a 30% breast cancer mortality reduction (700 breast cancer deaths averted), and the colorectal cancer recommendations would mean 225,000 screening colonoscopies, resulting in a 77% colorectal cancer mortality reduction (1,910 colorectal cancer deaths averted). These breast and colorectal cancer estimates are solely from Model E (used in prior comparative analyses [8, 9]) in the 1960 birth cohort (breast) and the 1950 birth cohort (colorectal cancer), with counts per 100,000 persons followed from ages 45 to 90 years.
This comparative analysis did not quantify all potential harms from screening, including the number of false-positive results, the number of additional years a patient lives with the diagnosis of lung cancer and possible adverse effects of treatment, the possible risks of false reassurance (a false-negative result that could possibly postpone access to care), or the possibility of a behavioral (smoking) change after screening. All smokers, independent of eligibility for a screening program, should be counseled to quit and offered assistance.
1. National Lung Screening Trial Research Team; Aberle DR, Adams AM, Berg CD, Black
WC, Clapp JD, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med. 2011;365(5):395-409.
2. Pinsky P, Black B. Subset and histological analysis of screening efficacy in the National Lung Screening Trial. Paper presented at: 2nd Joint Meeting of the National Cancer Advisory Board and Board of Scientific Directors; June 24, 2013; Bethesda, MD.
3. Heijnsdijk EA, Wever EM, Auvinen A, Hugosson J, Ciatto S, Nelen V, et al. Quality-of-life effects of prostate-specific antigen screening. N Engl J Med. 2012;367(7):595-605.
4. Oken MM, Hocking WG, Kvale PA, Andriole GL, Buys SS, et al; PLCO Project Team. Screening by chest radiograph and lung cancer mortality: the Prostate, Lung, Colorectal, and Ovarian (PLCO) randomized trial. JAMA. 2011;306(17):1865-73.
5. Anderson C, Burns DM, Dodd KW, Feuer EJ. Chapter 2: Birth-cohort-specific estimates of smoking behaviors for the U.S. population. Risk Anal. 2012;32(Suppl 1):S14-24.
6. Rosenberg MA, Feuer EJ, Yu B, Sun J, Henley SJ, Shanks TG, et al. Chapter 3: Cohort life tables by smoking status, removing lung cancer as a cause of death. Risk Anal. 2012;32(Suppl 1):S25-38.
7. Charnes A, Cooper WW, Rhodes E. Measuring the efficiency of decision making units. Eur J Oper Res. 1978;2:429-44.
8. Mandelblatt JS, Cronin KA, Bailey S, Berry DA, de Koning HJ, et al; Breast Cancer Working Group of the Cancer Intervention and Surveillance Modeling Network. Effects of mammography screening under different screening schedules: model estimates of potential benefits and harms. Ann Intern Med. 2009;151(10):738-47.
9. Zauber AG, Lansdorp-Vogelaar I, Knudsen AB, Wilschut J, van Ballegooijen M, Kuntz KM. Evaluating test strategies for colorectal cancer screening: a decision analysis for the U.S. Preventive Services Task Force. Ann Intern Med. 2008;149(9):659-69.
AHRQ Publication No. 13-05196-EF-2
Current as of July 2013
de Koning HJ, Meza R, Plevritis SK, ten Haaf K, Munshi VN, Jeon J, Erdogan SA, Kong CY, Han SS, van Rosmalen J, Choi SE, Miller M, Moolgavkar S, Pinsky PF, Berg CD, Berrington de Gonzalez A, Black WC, Tammemagi CM, Hazelton WD, Feuer EJ, McMahon PM. Benefits and Harms of Computed Tomography Lung Cancer Screening Programs for High-Risk Populations. AHRQ Publication No. 13-05196-EF-2. July 2013. http://www.uspreventiveservicestaskforce.org/uspstf13/lungcan/lungcanmodeling.htm.