Introduction

The prevalence of obesity has dramatically increased for decades worldwide. Body mass index (BMI) is one of the most commonly used surrogate measurements of overweight (BMI 25.0–29.9 kg/m2) and obesity (BMI ≥ 30.0 kg/m2) [1]. The majority of studies supported the hypothesis that elevated BMI was associated with an increased risk of incidence of cancers of the colon [27], pancreas [3, 812] and kidney [6, 10, 1315], but with a decreased risk of incidence of lung cancer [6, 10, 1618]. No association was found between BMI and incidence of cancers of the prostate [1925] and rectum [4, 5, 2629]. But, the relationship between BMI and incidence of cancers of other sites is still inconsistent: a linear positive relationship [30, 31] or no [6] relationship with gallbladder cancer, an inverse relationship [6] or no [10, 32] relationship with stomach cancer, a non-significant positive association with the liver [3, 6, 33, 34], cervix [3, 10, 33] or all sites combined [3, 6, 10], and a linear positive relationship [35] or no relationship [3, 6, 10, 36] with the bladder. Recent evidence also suggests that obesity is associated with an increased risk of incidence of ovarian cancer [10, 3740]. It is generally accepted that elevated BMI is associated with a decreased risk of incidence of breast cancer for premenopausal women, but the relationship is reversed in postmenopausal women [3, 10, 4147].

We investigated the shape of the relationship between BMI and incidence of cancers in a large prospective Finnish cohort.

Methods

Study population

Population-based surveys on CVD and other non-communicable disease risk factors and their developing trends have been conducted every 5 years since 1972 in Finland (FINRISK study) [48]. Seven FINRISK cohorts of 1972, 1977, 1982, 1987, 1992, 1997 and 2002 are included in the current data analysis. All seven surveys included subjects who were 24–64 years of age, and the 1997 and 2002 survey also included subjects aged 65–74 years. The total number of persons in the seven surveys was 60,504, including 29,401 men and 31,103 women. The FINRISK studies were approved by the Ethics Committee of the National Public Health Institute, Helsinki, Finland.

Baseline measurements

The surveys included a self-administered questionnaire (mainly including questions on socioeconomic factors, medical history, health behaviour, and psychosocial factors), physical examinations and laboratory measures. Height and weight were measured on site by specially trained nurses with participants not wearing shoes and heavy clothing. BMI was calculated as body weight in kilograms divided by the square of height in meters. Smoking status was classified into three categories (never smokers, former smokers and current smokers). Leisure-time physical activity was graded in three categories (sedentary: reading, watching television, or other sedentary activity; mild: walking, bicycling or exercise otherwise at least 4 h per week; intensive: running, jogging, skiing, swimming, or heavy garden work, etc., at least 3 h per week, or participating in athletic competitions or regularly exercising several times a week). Education was classified into three categories (≤9, 10–12, >13 schooling years). Individuals with missing data on body weight or height, baseline smoking status, leisure-time physical activity or schooling years were excluded from the analyses (n = 4,606, 7.6 %).

Definition of cancer incidence

Information on incidence of cancers was obtained from the Finnish Cancer Registry (FCR) and the dates of deaths from the cause-of-death register of Statistics Finland by computer-based record linkage using the unique personal identity codes assigned to every resident of Finland. The FCR has collected data on all cancer cases diagnosed in Finland since 1953. All physicians, hospitals, and pathology, cytology and hematology laboratories in the country are obligated to send a notification to the FCR of all diagnosed and suspected cancer cases since 1961. In addition, Statistics Finland annually sends to the registry a computerized file of death certificates in which a malignant disease is mentioned. The data coverage in the FCR is virtually complete, 99 % for solid tumors, and the data accuracy is high as previously validated by different researchers [49].

The FCR uses International Classification of Diseases for Oncology, 3rd (ICD-O-3) in classification of the cancer cases. For the current study, cancers were categorised into following sites: any site (C000–C809), stomach (ICD-O-3 topography C16), colon (C18–C19), rectum (C20), liver (C220), gallbladder and extrahepatic bile ducts (C23–C24), pancreas (C25), lung (C34), breast (C50), cervix uteri (C53), ovary (C56), prostate (C61), kidney (C649) and bladder (C67). Only the first occurrence of cancer after the baseline examination was included in the analysis, subsequent cancers of the same site or not were excluded and subjects with a cancer diagnosed before the baseline survey were excluded from the cohort (n = 1,173, 1.9 %). Follow-up of each cohort member started from the date of baseline survey and continued until the date of first cancer diagnosis, date of death, or 31 December, 2008, whichever was the earliest.

Statistical analyses

The Cox proportional hazards model was used to estimate the hazard ratios (HR) and 95 % confidence intervals (CI) of BMI in relation to incidence of cancers, adjusted for baseline smoking status, leisure-time physical activity, education and area using attained age as the time scale. All analyses were performed separately for men and women. The shape of the relationship between BMI and incidence of cancer was explored using both parametric models (conventional linear or polynomial model) and nonparametric models (the linear spline or restricted cubic spline regression model). Akaike’s information criterion (AIC) was used to judge the model fitness between conventional linear model and polynomial models (including quadratic, cubic or fractional polynomial model), the lower the AIC value the better the model fitness is. The reduction of AIC is evaluated by the likelihood ratio test (LRT) or a deviance difference test [50, 51]. We subsequently fitted nonparametric smooth functions, using the restricted cubic spline or the linear spline corresponding to the best-fitting conventional polynomial or linear model [52]. The existence of threshold was estimated by a piecewise regression model using nonlinear least-squares estimation incorporating the “nl” function in Stata, with the lowest value of residual sum of squares, root-mean-square error and AIC, whereas the highest value of R-squared as the criteria for model selection of the optimal threshold or thresholds for BMI. In the Cox proportional hazards model, BMI was entered as a continuous variable, and factors including leisure-time physical activity, smoking status and education as categorical variables, along with a BMI-leisure-time physical activity, a BMI-smoking status or a BMI-education interaction term. In addition, HR for categorical BMI with incidence of cancer was also estimated by the Cox proportional hazards model using BMI category of 23.0–24.9 kg/m2 as the reference category. The proportional hazard assumption of each model was tested for specific variables and globally. The proportional hazards assumptions were met. Data analyses were performed in Stata Intercooled 11.2 (StataCorp, College Station, TX, USA).

Results

During the follow-up (mean 20.6 years), 8,429 (15.4 %) incident cancers were recorded (Table 1), 4,208 from men and 4,221 from women. The mean BMI was higher in older individuals, and persons with a low BMI tended to be more often to smoke (p < 0.05), to be physically active and more educated (p < 0.05 for trend test, Table 2).

Table 1 Baseline characteristics of the survey and the data of follow-up in FINRISK study
Table 2 Body mass index (BMI) by gender according to baseline age, smoking status and leisure-time physical activity

Table 3 shows that the best-fitting conventional model was conventional linear model for BMI in relation to incidence of cancers of the colon, liver and kidney in men and the gallbladder and breast in women (all p < 0.05 for LRT against their basic model), as well as the bladder and all sites combined in men and the stomach, colon, lung and ovary in women (all p > 0.05 for LRT against their basic model), and conventional polynomial model did not significantly improve the model fitness (p > 0.05 for LRT or for deviance difference test against their conventional linear model), which suggests a linear relationship. To the contrary, the conventional polynomial model significantly improved the model fitness of incidence of cancers of the lung in men (all p < 0.05 for LRT or deviance difference test of against the conventional linear model), prostate in men as well as all sites combined in women (p < 0.05 for LRT of quadratic polynomial model and p < 0.05 for deviance difference test of second-order fractional polynomial model against their basic model), which suggests a nonlinear relationship. In addition, model fitness for incidence of cancers of other sites was not improved with any term of BMI added (Table 3). HRs with 95 % CIs for incidence of cancer estimated based on the spline regression models were plotted against BMI in Fig. 1. The spline regression analysis showed that BMI had a non-threshold linear positive association with incidence of cancers of the colon, liver, kidney, bladder and all sites combined in men (Fig. 1c, g, s, u and w), and of cancers of the stomach, colon, gallbladder and ovary in women (Fig. 1b, d, j and q). BMI had an inverse association with incidence of cancers of the lung in men (Fig. 1m) and the lung and breast in women (Fig. 1n, o), whereas a J-shaped association with incidence of all cancers combined in women (Fig. 1x), which indicates there might be a threshold for incidence of cancers of the lung in men and the breast in women or two thresholds for incidence of all cancers combined in women existing. No association was observed for BMI and incidence of cancers of other sites (Fig. 1a, e, f, h, i, k, l, p, r, t and v). Threshold estimated for BMI by the non-linear least squares regression analysis based on the spline regression model was shown in Fig. 1 by the vertical lines and in Online Resource 1. Moreover, the relationship between BMI and incidence of cancer was confirmed by the results from analyses using the Cox proportional hazards model as presented as HR (95 % CI) for categorical BMI in Online Resource 2.

Table 3 Akaike’s information criterion (degrees of freedom) for the relationship between body mass index and hazard risk of cancer incidence in men and women
Fig. 1
figure 1figure 1figure 1

Hazard ratio and 95 % confidence intervals for body mass index (BMI) in relation to incidence of cancer among men and women. Dashed lines indicate hazard ratios and 95 % confidence intervals (95 % CIs) derived from the linear spline regression with one interior knot placed at the 50th percentiles of BMI (al, nq and sw), or from the restricted cubic spline regression with three interior knots placed at the 10th, 50th, and 90th percentiles of BMI (m, r and x), using age as the underlying time-scale in the Cox proportional hazards model adjusted for smoking status, leisure-time physical activity, education and area, and mean of BMI was set as the reference value (figure not shown for certain cancers with BMI > 45 kg/m2 due to their extreme high HRs). Vertical dot line indicates the location which represents threshold (with 95 % CIs denoted in subtitle) derived from the piecewise regression model followed by the spline regression model. Solid lines indicate slopes before and after thresholds

Since the interaction between linear BMI and smoking status was significant for incidence of all cancers combined in women (p = 0.01), an analysis was performed separately for smokers and never smokers. High BMI in women was associated with an increased overall cancer risk in never smokers but a reduced risk in smokers (Online Resource 3). The results were not altered much after excluding the first five years of follow-up (Online Resource 4).

Discussion

BMI had a non-threshold linear positive association with incidence of cancers of the colon, liver, kidney, bladder and all sites combined in men, and of cancers of the stomach, colon, gallbladder and ovary in women, an inverse association with incidence of cancers of the lung in men and the lung and breast in women, a J-shaped association with incidence of all cancers combined in women. In women, high BMI was associated with an increased overall cancer risk in never smokers but a reduced risk in smokers.

BMI was positively associated with incidence of colon cancer, but had no association with incidence of rectal cancer, which is in line with previous studies [27, 2629]. In spite of the fact that liver cancer and gallbladder cancer are relatively rare diseases, we still found BMI had a positive association with incidence of liver cancer in men and gallbladder cancer in women, which is in line with previous studies [3, 6, 30, 31, 33, 34]. Obese patients tend to have non-alcoholic fatty liver disease or gallstone disease, which might mediate the carcinogenesis [34, 5356]. In line with previous studies [6, 10, 1315], we found that BMI was positively associated with incidence of kidney cancer in men, which may be due to promoting kidney damage through oxidative stress [57], diabetes or hypertension [58, 59], or altered circulating concentrations of hormones [60, 61].

In addition, a non-significant linear positive relationship was observed between BMI and incidence of cancers of other sites including the bladder in men and the stomach and ovary in women, no association was observed between BMI and incidence of pancreatic cancer and cervical cancer. In recent studies, elevated BMI has been linked with an increased risk of incidence of cancers of the pancreas [3, 812] and ovary [10, 3740], but it is still controversial about the relationship between BMI and incidence of stomach cancer [6, 10, 32], bladder cancer [3, 6, 10, 35, 36] or cervical cancer [3, 10, 33]. Taken together, excess body fat is associated with elevated production of insulin, leading to the increase of insulin-like growth factor I (IGF-I), or secretes a variety of cytokines and sex steroids, which subsequently stimulates cell proliferation and suppresses apoptosis, or chronic inflammation, and thus has been suggested to play a role in the carcinogenesis [6265]. Elevated BMI was inversely associated with incidence of lung cancer, which is consistent with previous studies [6, 10, 1618]. Further, no significant interaction was detected between BMI and smoking status, which still needs to be confirmed, for the possible metabolic effects of smoking on body weight [66, 67]. A recent epigenetic study reported that one allele of the fat mass and obesity-associated gene that had been linked with elevated BMI, was associated with a decreased risk of incidence of lung cancer, independent of smoking or weight loss due to the preclinical disease [68].

We found that there was a significant inverse association between BMI and incidence of breast cancer in women. A declining risk association among premenopausal women but an increasing risk association among postmenopausal women has been reported in prvious studies [3, 10, 41, 4347]. Possible differences in these associations could be related to loss of normal ovarian function with reduced ovarian oestrogens production as the mechanism by which obesity could reduce the tumor promoting effects in premenopausal women, but enhanced oestrogen synthesis by adipose tissue that could contribute to an increased risk of breast cancer among postmenopausal women [69, 70].

Moreover, the possible explanation for the no association of BMI with incidence of prostate cancer observed in our study is that although with lower levels of sex hormone-binding globulin (SHBG) detected [3, 1925, 45], which is a risk factor for prostate cancer, obese men also possess lower levels of testosterone and IGF-I levels, which may exert a protective effect against the adverse effect of lower levels of SHBG [20]. Furthermore, we did not have information on stage of prostate cancer, since some studies revealed that elevated BMI was associated with an increased risk of incidence of high-graded or aggressive prostate cancer [22, 24, 25], but with a decreased risk of incidence of localized or low-graded prostate cancer [2325], the mutual coupling effect might be compensated.

We found that in women, high BMI in women was associated with an increased overall cancer risk in never smokers but a reduced risk in smokers, with significant interaction detected between BMI and smoking status. But no significant interaction was detected between BMI and leisure-time physical activity, or between BMI and education in relation to incidence of cancer. Physically active individuals tend to be leaner, physical activity has been reported to attenuate or eliminate the relation between BMI and the risk of incidence of cancers of the colon, rectum and pancreas [5, 8], perhaps through improving insulin resistance and increasing adiponectin levels [7173]. High educated individuals tend to be leaner, education may have an impact on energy balance by influencing obesity-related behaviours such as diet and physical activity [74, 75].

The strengths of the study are its large cohort size, population-based setting, large number of certain cancer cases accumulated during the follow-up, and completeness of the follow-up due to the well-organized registration system in Finland. But we do not have precise information on weight changes before the baseline and during the follow-up, nor on menopausal status and hormone therapies that would have been potentially affected the development of breast cancer [41, 43, 44, 46, 47]. Because BMI does not accurately differentiate fat mass from muscles, as an endocrine organ, visceral adipose tissue has been suggested to play a more important role in the carcinogenesis than the subcutaneous compartment [63, 64], it may be sometimes misleading to use BMI as an index for obesity in relation to cancer incidence [70, 76]. Thus, investigations on body composition or fat distribution in relation to cancer incidence are needed but the feasibility to conduct such an observational study is low. In addition, we do not have information on several lifestyles or behaviour factors, for instance, dietary factors or alcohol consumption, which might contribute to obesity, and is an independent etiologic factor for several cancers, especially for stomach cancer and liver cancer [33, 77]. Moreover, we have relatively few cases among never smokers and hence cannot rule out the possibility of interactions between smoking and BMI in relation to incidence of lung cancer.

Elevated BMI was associated with an increased risk of incidence of cancers of certain sites. Given that the prevalence of obesity is increasing globally, there is an urgent need to understand the mechanisms linking obesity and cancer, and to develop strategy to prevent cancers.