Introduction

Standard treatment of high-risk non-muscle-invasive bladder cancer (NMIBC) is complete transurethral resection (TURB) followed by adjuvant intravesical immunotherapy with bacillus Calmette–Guérin (BCG) [1, 2]. Despite adequate treatment, around 40% of patients with T1 urinary bladder cancer (UBC) will experience disease recurrence and around 20% disease progression, respectively [2,3,4]. Identifying the patients who are most likely to recur or progress during or after adjuvant BCG would be highly valuable in the clinical decision-making process and patient counselling. These patients could be offered alternative treatment strategies such as early/immediate radical cystectomy [3]. To address this unmet need, an effort has been put into the development of prognostic models and risk stratification tools such as those from the European Organization for Research and Treatment of Cancer (EORTC) and the Club Urológico Español de Tratamiento Oncológico (CUETO) [5, 6]. However, the performance of these models is still under debate, in part because they do not take in account the heterogeneous genomic landscape and mutational burden of the disease [7], or other adverse pathologic features such as lympho-vascular invasion and variant histology [8,9,10,11,12]. Moreover, sex-based differences in biology, epidemiology, and outcomes of UBC are well documented in muscle-invasive UBC [13, 14]. However, the association of patients’ sex with treatment outcome to BCG therapy is still controversial [5, 7, 15, 16] and has so far been implemented only in the CUETO risk tables [5].

To fill this gap in knowledge, we investigated the impact of sex on clinical outcomes of patients treated with TURB and BCG for T1 G3/high-grade (HG) UBC.

Methods

Study population

We analyzed the data of 2635 patients treated with adjuvant intravesical BCG for T1 UBC between 1984 and 2019 within a multicenter collaboration including 25 centers across Europe and the United States of America.

TURB and adjuvant BCG instillations

A second look TURB was performed at the physician’s discretion based on pathologic and intraoperative findings. All surgical specimens were processed according to standard pathologic procedures and staged according to the TNM classification. Tumor grade was assigned according to the 1973 and/or 2004 World Health Organization system.

Metastatic disease and concomitant upper urinary-tract urothelial carcinoma were excluded using radiological imaging.

Due to the retrospective nature of the study, the indication and duration for adjuvant BCG therapy were given at the physician's discretion. An adequate BCG treatment was defined as the administration of at least five of six doses of an initial induction course plus at least two of three doses of maintenance therapy or the administration of at least five of six doses of an initial induction course plus at least two of six doses of a second induction course [17].

Follow-up

Due to the retrospective nature of the study, follow-up was not standardized but rather performed according to institutional standards and based on guidelines at the time and at the physician’s discretion.

Outcome measurement

The primary endpoint was the association of patients’ sex with disease recurrence, during BCG therapy or after completion of BCG therapy. The secondary endpoint was the association of patients’ sex with disease progression to MIBC.

The “time to event” was calculated as the time between first pT1G3/HG diagnosis and a histopathological confirmed recurrence or progression with TURB. Patients who did not develop a recurrence during the follow-up were censored at their last follow-up visit.

Statistical analyses

Statistical analyses were performed in different steps. First, to account for missing baseline data that were assumed to be missing at random, we performed multiple imputations by using chained equations. Thirty imputed data sets were generated using predictive mean matching for numeric variables, logistic regression for binary variables, and Bayesian polytomous regression for factor variables. Second, we used inverse probability weighting (IPW) to reduce the bias of unweighted estimators and adjust for covariates imbalance between males and females. The variables used to estimate the inverse probability weights were age, smoking status, exposure to chemical compounds, previous low-grade UBC, previous intravesical therapy, presence of detrusor muscle in the TURB specimen, G3 grade, multifocal T1 UBC, tumor size > 3 cm, concomitant CIS, second look TURB, number of BCG induction cycles and number of BCG maintenance cycles. Post-weighting balance in covariates was evaluated by using standardized mean differences (supplementary figure S1). Third, unweighted and IPW-adjusted univariable Cox proportional hazard regression analyses were used to estimate the hazard ratios (HR) with their 95% confidence intervals (CI) for the association of patients’ sex with recurrence and disease progression. Fourth, we estimated recurrence and disease progression rates between groups using cumulative incidence curves.

Finally, we performed subgroups analyses in the weighted population using IPW-adjusted Cox proportional hazard regression analyses investigating the association of sex with disease recurrence, HG-recurrence, and disease progression in patients adequately treated with BCG. Outcome estimates were graphically visualized using cumulative incidence curves.

Statistical significance was considered at p < 0.05. All tests were performed with R (R Foundation for Statistical Computing, v3.5.1).

Results

A total of 2170 (82%) males and 465 (18%) females were available for analysis. Unweighted and weighted clinicopathologic features of the patients, stratified by sex, are shown in Table 1. Standardized differences of unweighted comparisons showed that both groups differed significantly with respect to age, smoking status, exposure to chemical compounds and detrusor muscle in the TURB specimen. Overall, 64% of patients without muscle at first TURB underwent a second look TURB and 37% of patients with muscle at first TURB underwent a second look TURB.

Table 1 Baseline characteristics of 2635 patients treated with transurethral resection of the bladder (TURB) and adjuvant intravesical bacillus Calmette–Guérin (BCG) for pT1G3/HG urinary bladder cancer (UBC) in unweighted and weighted study population after multiple imputation

After IPW adjustment, all relevant standardized differences were less than 10%, which indicated that clinico-pathologic features between groups were subsequently comparable.

The median follow-up for alive patients in the cohort was 50 months (IQR 26–88). Within this period, 1090 (50%) males and 244 (52%) females experienced a recurrence, and 391 (18%) males and 104 (22%) females experienced a disease progression. On unweighted univariable Cox regression analyses, female sex was associated with disease progression (crude HR 1.24, 95%CI 1.001–1.54, p = 0.04) but not with recurrence (crude HR 1.07, 95%CI 0.93–1.23, p = 0.34). On IPW-adjusted univariable Cox regression analyses, female sex was also associated with disease progression (HR 1.25, 95%CI 1.01–1.56, p = 0.04) but not with recurrence (HR 1.06, 95%CI 0.92–1.22, p = 0.41) (Fig. 1).

Fig. 1
figure 1

Cumulative incidence curves for the association of patients’ sex with time to disease recurrence (a) and progression in 2635 patients treated with transurethral resection of the bladder (TURB) and adjuvant intravesical bacillus Calmette–Guérin (BCG) for T1G3/HG urinary bladder cancer

Overall, 544 patients, 436 (20%) males and 108 (23%) females, were treated with radical cystectomy for BCG failure. Data on pathologic T-stage were available for 506 patients. Within this group, 146 (36%) males and 34 (34%) females had non-organ confined disease (pT3/pT4 and/or positive nodal stage, p = 0.65).

A total of 1056 patients, 871/2170 males (40%) and 185/465 females (40%), were treated with adequate BCG. Of these, 52% (454) of the males and 54% (99) of the females had recurrences, 38% (328) of the males and 40% (74) of the females had HG-recurrence and 17% (151) of the males and 20% (37) of the females had disease progression. On unweighted univariable Cox regression analyses, patients’ sex was not associated with disease recurrence (crude HR 1.01, 95%CI 0.81–1.26, p = 0.91), HG-recurrence (crude HR 1.02, 95%CI 0.79–1.31, p = 0.87) or disease progression (crude HR 1.10, 95%CI 0.77–1.58, p = 0.60). On IPW-adjusted Cox regression analyses, patients’ sex was not associated with disease recurrence (HR 0.99, 95%CI 0.80–1.24, p = 0.96), HG-recurrence (HR 1.00, 95%CI 0.78–1.29, p = 0.99) or disease progression (HR 1.12, 95%CI 0.78–1.60, p = 0.55) (Fig. 2).

Fig. 2
figure 2

Cumulative incidence curves for the association of patients’ sex with time to disease recurrence (a) high-grade (HG) recurrence (b) and disease progression (c) in 1056 patients treated with transurethral resection of the bladder (TURB) and adequate adjuvant intravesical bacillus Calmette–Guérin (BCG) for T1G3/HG urinary bladder cancer

Discussion

In a retrospective analysis of a large multicenter international dataset of patients treated with TURB and adjuvant intravesical BCG for T1G3/HG UBC, we found an association of female sex with disease progression. However, this difference disappeared in patients adequately treated with BCG.

The difference in incidence and oncologic outcomes of UBC has been widely reported in the literature14. These dissimilarities have been partially explained by the activity of the sex steroid hormone pathways, differences in the quality of treatment, and sex-specific differences in immunity [18,19,20,21]. However, results across studies are controversial and are mainly related to muscle-invasive UBC.

In NMIBC, the bulk of the evidence presented by the current literature questions the prognostic value and differential response to BCG between males and females.

The CUETO and the 2016 updated EORTC risk tables are the two most used prognostic tools in clinical decision making for patients with UCB treated with BCG [5, 6]. Because patients’ sex was found to be associated with disease recurrence in the CUETO data, sex has been implemented in their model. Neither the CUETO nor the EORTC studies found an association between patient’s sex and disease progression.

The association of patients’ sex with oncologic outcomes of T1 UBC has been investigated in several other retrospectives series, with similar results.

In a retrospective series of 146 patients and in a subgroup analysis of 234 out of 916 patients treated with TURB and adjuvant intravesical BCG, female sex was not found to be an independent predictor of disease recurrence and progression [15, 16].

There are several factors that should be considered when contextualizing our study with these results. In the study by Palou et al. [15], patients were treated with TURB, which was clinically judged complete by the surgeon, and did not undergo a second look TURB. Also in the study by Kluth et al. [16], data on second look TURB were not available. Despite the presence of detrusor muscle in all the specimens, which is considered a surrogate marker for the quality of the resection [22, 23], the residual disease could not be ruled out. Complete resection is essential for the optimal outcomes as it impacts prognosis and adjuvant therapy response in patients with NMIBC [24].

Both series could be biased by the relatively small number of patients and events. In a larger single-center retrospective series of 1021 patients treated with TURB and induction BCG, authors did not find an association of patients’ sex with disease recurrence or progression [25]. However, in this series, only 40% of the patients included had a T1 UBC.

We expanded upon these limitations by investigating the association of patients’ sex in a large multicenter dataset of T1 UBC. To the best of our knowledge, this is the largest series representing real-world clinical data. Moreover, our analyses adjusted relevant prognostic variables, which were not reported in other studies.

Another major limitation of the previously mentioned retrospective series is the administration of intravesical BCG therapy restricted to the induction course. Indeed, the efficacy of the treatment, particularly in patients with high-risk, is dependent on the maintenance schedule [26, 27]. We added detail on these shortcomings by investigating the association of patients’ sex with oncologic outcomes in patients adequately treated with BCG [17]. In this subgroup, the association of patients’ sex with disease progression could no longer be observed. This generates the hypothesis that an adequate BCG treatment might equalize the effect of patients’ sex on therapy response. However, the number of patients in this subgroup might have been too small to detect a statistically significant difference.

Our study is not devoid of limitations, which are mainly inherent to its retrospective design. During the large time span of study treatment modalities such as endoscopic image enhancement [28], changes in local clinical practices at each site and imaging modalities used for surveillance may have changed significantly. We had no information on the BCG strain used. However, there is no evidence on the differential effectiveness of one strain compared to another [29, 30], passaging and sub-culturing over the years may have changed the virulence of mycobacteria and immunological response in the host [31]. We had histopathology information at disease recurrence only for the first recurrence. Therefore, we could not assess whether the patient developed an HG-recurrence after a first low grade recurrence. Finally, we acknowledge the limitation of a missing central pathology review and the lack of information on histologic variants and lymphovascular invasion. Moreover, histopathological examination was performed according to institutional standards at each center, which could have led to heterogeneous results.

Despite all these limitations, our study has relevant clinical implications. Given the current world-wide BCG shortage, accurate selection of patients who are more likely to respond to the therapy is of paramount importance to avoid overtreatment, reduce complications, and drug wastage. We analyzed a cohort of NMIBC patients treated with BCG with the highest risk of disease recurrence and progression and found an association of female sex with disease progression. This evidence may help physicians during patient counselling regarding adjuvant therapies or early cystectomy and follow-up scheduling.

Conclusion

Our analysis generates the hypothesis of differential oncologic outcomes in female compared to male patients if not adequately treated with BCG for T1G3/HG UBC. Further studies should focus on sex-based differences in innate and adaptive immune system and their association with BCG response. Moreover, these differences should be considered during trial planning for novel immunotherapy agents in NMIBC.