Skip to main content

Advertisement

Log in

Analysis of lung cancer incidence in the nurses’ health and the health professionals’ follow-up studies using a multistage carcinogenesis model

  • Original Paper
  • Published:
Cancer Causes & Control Aims and scope Submit manuscript

Abstract

We analyzed lung cancer incidence among non-smokers, continuing smokers, and ex-smokers in the Nurses Health Study (NHS) and the Health Professionals Follow-Up Study (HPFS) using the two-stage clonal expansion (TSCE) model. Age-specific lung cancer incidence rates among non-smokers are identical in the two cohorts. Within the framework of the model, the main effect of cigarette smoke is on the promotion of partially altered cells on the pathway to cancer. Smoking-related promotion is somewhat higher among women, whereas smoking-related malignant conversion is somewhat lower. In both cohorts the relative risk for a given daily level of smoking is strongly modified by duration. Among smokers, the incidence in NHS relative to that in HPFS depends both on smoking intensity and duration. The age-adjusted risk is somewhat larger in NHS, but not significantly so. After smokers quit, the risk decreases over a period of many years and the temporal pattern of the decline is similar to that reported in other recent studies. Among ex-smokers, the incidence in NHS relative to that in HPFS depends both on previous levels of smoking and on time since quitting. The age-adjusted risk among ex-smokers is somewhat higher in NHS, possibly due to differences in the age-distribution between the two cohorts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. We also tested disjoint models for never and ever smokers. However, these models also lead to similar fits, but with a larger number of parameters.

  2. However, a smoking effect on initiation almost doubling the background rate is still consistent with the data.

  3. A gain of eight log-likelihood points with only one more parameter.

  4. Age-adjusted to the 1990 US total population. Please see Materials and Methods for details.

References

  1. Risch HA, Howe GR, Jain M, Burch JD, Holowaty EJ, Miller AB (1993) Are female smokers at higher risk for lung cancer than male smokers? A case–control analysis by histologic type. Am J Epidemiol 138(5):281–293

    PubMed  CAS  Google Scholar 

  2. Zang EA, Wynder EL (1996) Differences in lung cancer risk between men and women: examination of the evidence. J Natl Cancer Inst 88(3–4):183–192

    Article  PubMed  CAS  Google Scholar 

  3. Perneger TV (2001) Sex, smoking, and cancer: a reappraisal. J Natl Cancer Inst 93(21):1600–1602

    PubMed  CAS  Google Scholar 

  4. Bain C, Feskanich D, Speizer FE, Thun M, Hertzmark E, Rosner BA, et al (2004) Lung cancer rates in men and women with comparable histories of smoking. J Natl Cancer Inst 96(11):826–834

    PubMed  Google Scholar 

  5. Blot WJ, McLaughlin JK (2004) Are women more susceptible to lung cancer? J Natl Cancer Inst 96(11):812–813

    Article  PubMed  Google Scholar 

  6. Patel JD, Bach PB, Kris MG (2004) Lung cancer in US women: a contemporary epidemic. JAMA 291(14):1763–1768

    Article  PubMed  CAS  Google Scholar 

  7. Moolgavkar SH, Venzon DJ (1979) Two-event models for carcinogenesis: incidence curves for childhood and adult tumors. Math Biosci 47(1):55–77

    Article  Google Scholar 

  8. Moolgavkar SH, Knudson AG (1981) Mutation and cancer: a model for human carcinogenesis. J Natl Cancer Inst 66(6):1037–1052

    PubMed  CAS  Google Scholar 

  9. Hazelton WD, Luebeck EG, Heidenreich WF, Moolgavkar SH (2001) Analysis of a historical cohort of Chinese tin miners with arsenic, radon, cigarette smoke, and pipe smoke exposures using the biologically based two-stage clonal expansion model. Radiat Res 156(1):78–94

    Article  PubMed  CAS  Google Scholar 

  10. Hazelton WD, Clements MS, Moolgavkar SH (2005) Multistage carcinogenesis and lung cancer mortality in three cohorts. Cancer Epidemiol Biomarkers Prev 14(5):1171–1181

    Article  PubMed  CAS  Google Scholar 

  11. Heidenreich WF, Luebeck EG, Moolgavkar SH (1997) Some properties of the hazard function of the two-mutation clonal expansion model. Risk Anal 17(3):391–399

    Article  PubMed  CAS  Google Scholar 

  12. US Decennial Life Tables for 1989–91. vol 1. Hyattsville, Maryland: US Department of Health and Human Services, National Center for Health and Statistics; 1997

  13. Gilks WR, Richardson S, Spiegelhalter DJ (eds) (1996) Markov chain Monte Carlo in practice. Interdisciplinary Statistics. Chapman & Hall, London

    Google Scholar 

  14. Bain C, Feskanich D, Speizer FE, Thun M, Hertzmark E, Rosner BA, et al (2004) Erratum. J Natl Cancer Inst 96(23):1796–1797

    Google Scholar 

  15. Wakelee HA, Chang ET, Gomez SL, Keegan TH, Feskanich D, Clarke CA, et al (2007) Lung cancer incidence in never smokers. J Clin Oncol 25(5):472–478

    Article  PubMed  Google Scholar 

  16. Rachet B, Siemiatycki J, Abrahamowicz M, Leffondre K (2004) A flexible modeling approach to estimating the component effects of smoking behavior on lung cancer. J Clin Epidemiol 57(10):1076–1085

    Article  PubMed  CAS  Google Scholar 

  17. Luebeck EG, Heidenreich WF, Hazelton WD, Paretzke HG, Moolgavkar SH (1999) Biologically based analysis of the data for the Colorado uranium miners cohort: age, dose and dose-rate effects. Radiat Res 152(4):339–351

    Article  PubMed  CAS  Google Scholar 

  18. Heidenreich WF, Wellmann J, Jacob P, Wichmann HE (2002) Mechanistic modelling in large case–control studies of lung cancer risk from smoking. Stat Med 21(20):3055–3070

    Article  PubMed  CAS  Google Scholar 

  19. Schollnberger H, Manuguerra M, Bijwaard H, Boshuizen H, Altenburg HP, Rispens SM, et al (2006) Analysis of epidemiological cohort data on smoking effects and lung cancer with a multi-stage cancer model. Carcinogenesis 27(7):1432–1444

    Article  PubMed  CAS  Google Scholar 

  20. Burns DM, Shanks TG, Choi W, Thun MJ, Heath CW, Garfinkel L (1997) The American Cancer Society Cancer Prevention Study I: 12-year followup of 1 million men and women. In: Burns DM, Garfinkel L, Samet JM (eds) Changes in cigarette-related disease risks and their implications for prevention and control. Smoking and Tobacco Control, Monograph 8, NIH Publ No 97–4213, pp 113–304

  21. Shriver SP, Bourdeau HA, Gubish CT, Tirpak DL, Davis AL, Luketich JD, et al (2000) Sex-specific expression of gastrin-releasing peptide receptor: relationship to smoking history and risk of lung cancer. J Natl Cancer Inst 92(1):24–33

    Article  PubMed  CAS  Google Scholar 

  22. Siegfried JM, DeMichele MA, Hunt JD, Davis AG, Vohra KP, Pilewski JM (1997) Expression of mRNA for gastrin-releasing peptide receptor by human bronchial epithelial cells. Association with prolonged tobacco exposure and responsiveness to bombesin-like peptides. Am J Respir Crit Care Med 156(2 Pt 1):358–366

    PubMed  CAS  Google Scholar 

  23. Hauptmann M, Lubin JH, Rosenberg P, Wellmann J, Kreienbrock L (2000) The use of sliding time windows for the exploratory analysis of temporal effects of smoking histories on lung cancer risk. Stat Med 19(16):2185–2194

    Article  PubMed  CAS  Google Scholar 

  24. Leffondre K, Abrahamowicz M, Siemiatycki J, Rachet B (2002) Modeling smoking history: a comparison of different approaches. Am J Epidemiol 156(9):813–823

    Article  PubMed  Google Scholar 

  25. Peto R, Darby S, Deo H, Silcocks P, Whitley E, Doll R (2000) Smoking, smoking cessation, and lung cancer in the UK since 1950: combination of national statistics with two case–control studies. BMJ 321(7257):323–329

    Article  PubMed  CAS  Google Scholar 

  26. Bach PB, Kattan MW, Thornquist MD, Kris MG, Tate RC, Barnett MJ, et al (2003) Variations in lung cancer risk among smokers. J Natl Cancer Inst 95(6):470–478

    Article  PubMed  Google Scholar 

Download references

Acknowledgments

We thank the Cancer Intervention and Surveillance Modeling Network (CISNET) Group, Dr. Anup Dewanji and Dr. Jihyoun Jeon for useful suggestions. We acknowledge support from the NIH grants RO1 CA047658 and UO1 CA97415. Financial support: NIH grants RO1 CA047658 and UO1 CA97415.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Suresh H. Moolgavkar.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Two-stage clonal expansion model (10 KB)

Appendix

Appendix

In this section we show how to calculate explicitly the individual likelihoods (Eq. 2) as function of the model parameters and the individual’s smoking history. For simplicity, we assume a constant lag time between the appearance of the first malignant cell and diagnosis, so the survival at any age t, S(t), is equal to the TSCE model survival at age t − t lag , S 2(t − t lag ) (see Eq. 4). We use the expressions derived in Heidenreich et al. [11] for the TSCE model survival in case of piecewise-constant parameters. From Heidenreich et al. [11], we have that the survival function and its derivative at any age t are

$$ S(t)=S_2(t-t_{lag})=\exp\left\{\sum\limits_{j=1}^k\frac{\mu_{0,j}X} {\alpha_{j}}\ln\left(\frac{q_j-p_j} {f_j(t_{j-1},t_k)}\right)\right\}, $$
(5)
$$ \begin{aligned} S^\prime(t)= &S^\prime_2(t-t_{lag})\\ = &S_2(t-t_{lag})\times \sum\limits_{j=1}^k-\frac{\mu_{0,j}X} {\alpha_{j}}\frac{\partial}{\partial t_k}\ln{(f_j(t_{j-1},t_k)}),\\ \end{aligned} $$
(6)

where k is the number of age-periods with different smoking-dose before age t k t − t lag , [t j−1,t j ] denote the end-points of the j − th age-period, d j is the smoking-dose during the j − th age-period, μ0,j , α j , g j , μ1, j denote the parameter values during the j − th age-period and

$$ g_{j}=g(1+g_{c}d_j^{g_{e}}),\quad \alpha_{j}=\alpha(1+g_{c}d_j^{g_{e}}), $$
(7)
$$ \mu_{1,j}=\mu_{1}(1+\mu_{1c}d_j^{\mu_{1e}}),\quad \mu_{0,j}=\mu_{0}(1+\mu_{0c}d_j^{\mu_{0e}}), $$
(8)
$$ p_j,q_j=\frac{1}{2}\left(-g_{j} \mp \sqrt{g_{j}^2+4\alpha_{j}\mu_{1,j}}\right) $$
(9)
$$ \tilde{y}_k=0,\quad \tilde{y}_{j-1}=\frac{\alpha_{j-1}} {\alpha_{j}}\frac{(\tilde{y}_j-p_j)q_je^{q_j(t_{j-1}-t_j)}+(q_j-\tilde{y}_j)p_j e^{p_j(t_{j-1}-t_j)}} {f_j(t_{j-1},t_k)}, $$
(10)
$$ f_j(t_{j-1},t_k)=(\tilde{y_j}-p_j) \exp\left\{q_j(t_{j-1}-t_j)\right\}+ (q_j-\tilde{y_j})\exp\left\{p_j(t_{j-1}-t_j)\right\}, $$
(11)
$$ \frac{\partial}{\partial t_k}f_k(t_{k-1},t_k)=[\exp\left\{q_k(t_{k-1}-t_k)\right\} -\exp\left\{p_k(t_{k-1}-t_k)\right\}] p_kq_k, $$
(12)
$$ \frac{\partial}{\partial t_k}f_j(t_{j-1},t_k)=[\exp\left\{q_j(t_{j-1}-t_j)\right\} -\exp\left\{p_j(t_{j-1}-t_j)\right\}] \frac{\partial}{\partial t_k}\tilde{y}_{j}, $$
(13)
$$ \frac{\partial}{\partial t_k}\tilde{y}_{k-1}=\frac{\alpha_{k-1}} {\alpha_k}\frac{(q_k-p_k)^2e^{(p_k+q_k)(t_{k-1}-t_k)}} {(f_k(t_{k-1},t_k))^2}p_kq_k, $$
(14)
$$ \frac{\partial}{\partial t_k}\tilde{y}_{j-1}=\frac{\alpha_{j-1}} {\alpha_j}\frac{(q_j-p_j)^2e^{(p_j+q_j)(t_{j-1}-t_j)}} {(f_j(t_{j-1},t_k))^2}\frac{\partial}{\partial t_k}\tilde{y}_{j}. $$
(15)

To calculate the individual likelihood (Eq. 2), we only need to evaluate expressions (5) and (6) at the age at entry (t = a e ) and at the age at last (t = a l ) accordingly.

As an example, we calculate the likelihood contribution of the individual whose smoking history is illustrated in the top panel of Fig. 1 at the maximum likelihood estimates of the joint-model (Table 2). The individual entered the NHS study at age 43 (approx) and died at age 63 from other causes. Hence, we must calculate the TSCE model survival at ages 43 − t lag  = 38 and 63 − t lag  = 58 to obtain her individual likelihood (Eq. 2). This individual started smoking at age 20, at a constant rate of 30 cigarettes per day, and continued with this pattern until age 42. Hence, to calculate the TSCE model survival at age 38, we have to consider two different age-periods, namely [0, 20]≡[t 0, t 1] and [20, 38]≡[t 1, t 2]. From expression (5), we have that the survival at age 43 is

$$ S(43)=S_2(38)=\exp\left\{\sum\limits_{j=1}^2\frac{\mu_{0,j}X} {\alpha_{j}}\ln\frac{q_j-p_j}{f_j(t_{j-1},38)}\right\} $$
(16)
$$ =\exp\left\{\frac{\mu_{0,1}X} {\alpha_{1}}\ln\frac{q_1-p_1}{f_1(0,38)}+\frac{\mu_{0,2}X} {\alpha_{2}}\ln\frac{q_2-p_2}{f_2(20,38)}\right\} $$
(17)

In this particular example, the parameter values during the first age-period are equal to the background parameters (i.e., d 1 = 0). In contrast, during the second period, we have to multiply the promotion rate g and the malignant conversion rate μ1 by their corresponding dose–responses (with d 2 = 30). We also multiply the initiated cells’ division rate, α, by the dose–response of g in order to keep them consistent. In this model, there is no dose–response on the initiation rate, μ0, so it is constant between all age-periods. Using (Eqs. 5, 7–15) and the parameter estimates from Table 2, we obtain that

$$ \begin{aligned} \ln[S_2(38)]=&\frac{(8.14\times 10^{-8})(10^7)}{3} \ln\frac{2.55 \times10^{-6}-(-0.0960)}{0.0959}\\ &+\frac{(8.14\times 10^{-8})(10^7)}{5.54} \ln\frac{5.18\times 10^{-6}-(-0.1765)}{0.1766}\\ =&-0.00116. \\ \end{aligned} $$
(18)

So the survival at age 43 is S(43) = S 2(38) = 0.9988. To calculate the TSCE model survival at age 58, we need to consider the following age-periods: \([0, 20]\equiv[t_0,t_1], [20, 42]\equiv[t_1,t_2], [42,44]\equiv[t_2,t_3], [44,46]\equiv[t_3,t_4], [46,48]\equiv[t_4,t_5], [48,55]\equiv[t_5,t_6]\) and [55, 57]≡[t 6, t 7], [57, 58]≡[t 7, t 8]. At each age-period, we must adjust the model parameters by the corresponding dose–response. Similarly as before, we calculate the survival at age 63 using (Eqs. 5, 7–15)

$$ \begin{aligned} S(63)=S_2(58)=&\exp\left\{\sum\limits_{j=1}^8\frac{\mu_{0,j}X} {\alpha_{j}}\ln\frac{q_j-p_j}{f_j(t_{j-1},58)}\right\}\\ = &0.9768.\\ \end{aligned} $$
(19)

So the likelihood contribution of this individual is equal to

$$ L_i(63,43;\bar{\theta}(d_i))=\frac{S(63)}{S(43)}=\frac{0.9768} {0.9988}=0.9779. $$
(20)

As a final example, let’s assume that an individual with the same smoking history, is diagnosed with lung cancer at age 63. In this case, we must obtain S′(63) = S2(58) to calculate its individual likelihood (Eq. 2). Using (Eqs. 5–15) we have that

$$ \begin{aligned} S{^\prime}(63)= S{^\prime}_2(58)=&S_2(58)\times \sum\limits_{j=1}^8-\frac{\mu_{0, j}X}{\alpha_{j}}\frac{\partial} {\partial t_k}\ln{(f_j(t_{j-1},58))}\\ =&-3.1171\times 10^{-3}.\\ \end{aligned} $$
(21)

So the likelihood contribution of this individual is equal to

$$ L_i(63,43;\bar{\theta}(d_i))=-\frac{S^\prime(63)}{S(43)}= \frac{3.1171\times 10^{-3}}{0.9988}=3.1208\times 10^{-3}. $$
(22)

Fortran and R code to compute the TSCE hazard and survival functions in case of piecewise-constant parameters are available from the authors by request.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Meza, R., Hazelton, W.D., Colditz, G.A. et al. Analysis of lung cancer incidence in the nurses’ health and the health professionals’ follow-up studies using a multistage carcinogenesis model. Cancer Causes Control 19, 317–328 (2008). https://doi.org/10.1007/s10552-007-9094-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10552-007-9094-5

Keywords

Navigation