Perihilar cholangiocarcinoma (PHC) is a rare disease with a dismal prognosis [1, 2]. Radical surgery, consisting of a combined extrahepatic bile duct and partial liver resection, is the only curative treatment [3]. Despite various imaging techniques used for preoperative staging including state-of-the-art computed tomography (CT) or magnetic resonance imaging (MRI) scans, up to 47 % of patients have locally advanced or metastatic disease at surgical exploration [4, 5]. Staging laparoscopy (SL) prior to exploration may detect small liver metastases or peritoneal metastases that are frequently undetectable on routine CT or MRI scans. Additional SL may therefore prevent unnecessary laparotomy and associated postoperative morbidity or even mortality. However, the diagnostic yield of SL for PHC and its accuracy to detect unresectable disease remain unclear with varying results reported in the literature [68]. A recent study even found a decreasing diagnostic accuracy of SL over a period of 17 years in a single academic institution, possibly as a result of improved imaging techniques during the past decade [6]. As the place of routine SL in the preoperative staging of PHC is under debate, the aim of the present study was to define its current role by conducting a systematic review and meta-analysis of all available literature.

Materials and methods

A systematic review was conducted by two independent authors (R.J.S.C. and A.T.R.) according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement [9]. A study protocol was followed which defined the study objectives, eligibility criteria, outcome measures, search strategy and methodology of analysis (Appendix 1, ESM).

Eligibility criteria

Both retrospective and prospective studies providing data on the diagnostic accuracy of SL (with or without additional laparoscopic ultrasound) in patients with PHC were considered for inclusion. Case reports, reviews, studies with less than 10 patients and patients with gallbladder carcinoma or intrahepatic cholangiocarcinoma were excluded. Findings at exploratory laparotomy and pathological examination were considered as reference standard for staging, except when laparoscopy detected biopsy-proven metastatic lesions, locally advanced tumors or benign disease.

Outcome measures

Primary outcome measures were the overall yield of SL and diagnostic accuracy in terms of sensitivity to detect unresectable disease. The yield represents the number of patients (expressed as a percentage of all patients that undergo SL) that are withheld from an unnecessary laparotomy. The yield thereby reflects the proportion of patients that benefit from the SL procedure. Specificity was not considered as an endpoint as the specificity of SL is always 100 % since there are no false positives; laparoscopy and the reference standard are the same if histological examination during SL is positive.

As secondary analysis, the yield and sensitivity were calculated for more recent studies (after 2010) and studies with at least 100 patients. Also, sensitivity was separately investigated for combined liver and peritoneal metastases, liver metastases only, peritoneal metastases only, lymph node metastases and locally advanced disease (tumor invading vascular structures or surrounding organs). The additional diagnostic value of intraoperative ultrasound (IOUS) during SL was also investigated.

Search strategy

A literature search was conducted in PubMed and EMBASE using MeSH and free text words with the aid of a clinical librarian. No language or time period restrictions were applied. Two reviewers (R.J.S.C. and A.T.R.) independently screened for relevance in titles and abstracts retrieved from the search. Selected articles were then assessed in full length by both reviewers to check the eligibility criteria. Disagreements during the search and selection process were resolved by discussion, and when needed, a third author (T.M.v.G.) was asked. The reference lists of eligible articles were checked for additional fitting papers. The search was updated until August 1, 2015.

Search in PubMed and EMBASE:

PubMed: (“Cholangiocarcinoma”[MeSH] OR “Bile Duct Neoplasms”[Mesh] OR “Klatskin’s Tumor”[Mesh] OR cholangiocarcinoma*[tiab] OR klatskin tumor*[tiab] OR klatskin tumour*[tiab] OR HCCA[tiab] OR proximal bile duct tumor* OR extrahepatic bile duct tumor* OR proximal bile duct cancer* OR extrahepatic bile duct cancer* OR proximal bile duct tumour* OR extrahepatic bile duct tumour* OR proximal biliary cancer*) AND (“Laparoscopy”[MeSH Terms] OR laparoscop*[tiab]) AND (“Neoplasm Staging”[Mesh] OR staging[tiab]).

EMBASE:

(exp bile duct carcinoma/or exp bile duct tumor/or Klatskin tumor/or (cholangiocarcinoma* or klatskin tumor* or klatskin tumour* or HCCA or proximal bile duct tumor* or extrahepatic bile duct tumor* or proximal bile duct cancer* or extrahepatic bile duct cancer* or proximal bile duct tumour* or extrahepatic bile duct tumour* or proximal biliary cancer* OR bile tract carcinoma* OR biliary carcinoma* OR biliary duct carcinoma* OR biliary tract carcinoma*).ti,ab,kw.) AND (exp laparoscopy/or laparoscop*.ti,ab,kw.) AND (cancer staging/or staging/or staging.ti,ab,kw.).

Data collection

Data were independently extracted by the two reviewers using a spreadsheet. Data were collected taking into account author and institution, publication date, study period, study design, number of patients undergoing laparoscopy, number of completed procedures, number of patients undergoing laparoscopic intraoperative ultrasound (IOUS), number of avoided laparotomies, total number of patients with unresectable disease, number of liver, peritoneal and lymph node metastases, locally advanced disease, other reasons for unresectability, true positives, true negatives, false positives, false negatives, complications following SL and time interval between SL and laparotomy. Previously reported smaller series by the same institution in the same study period were considered as duplicates and excluded.

The yield was calculated by dividing the total number of avoided laparotomies by the total number of laparoscopies. Sensitivity was calculated by dividing the total number of avoided laparotomies by all patients with unresectable disease. Likewise, for calculating the sensitivity of SL for detecting specific lesions (e.g., peritoneal metastases) the total number of those specific lesions detected by SL was divided by all of those specific lesions at both SL and laparotomy.

Quality assessment

Methodological quality of included studies was independently assessed by the two reviewers using the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) tool, which is specifically developed for systematic reviews of diagnostic accuracy studies [10]. Risk of bias and applicability concerns were assessed for each included study using this tool, which is integrated in the RevMan software (Review Manager version 5.3. Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2014).

Statistical analysis

Yield and diagnostic accuracy results from individual studies were graphically presented by plotting the sensitivity estimates [with 95 % confidence intervals (CIs)] using StatsDirect version 2.8.0 (StatsDirect statistical software. http://www.statsdirect.com. England: StatsDirect Ltd. 2013) and Meta-DiSc version 1.4 (XI Cochrane Colloquium, Barcelona, 2006), respectively. Heterogeneity among studies was tested using Cochran’s Q-test, and the amount of variation by heterogeneity was reflected by the inconsistency index value (I 2). A I 2 value above 75 % was considered as substantial heterogeneity. Results of individual studies were pooled, and summary estimates were calculated using a random-effect model (DerSimonian–Laird) in all cases. Individual studies lacking data to obtain sensitivity rates were excluded from meta-analysis.

Results

Search results

One hundred and seventy-three records, excluding duplicates, were retrieved from the electronic database search. A total of 12 studies were included in the analysis. Figure 1 shows a flowchart of the search and selection process.

Fig. 1
figure 1

Flowchart of study selection process

Study characteristics

Eleven retrospective studies and one prospective study were identified providing an all-Western population of 832 PHC patients who had undergone SL between 1992 and 2014 (Table 1). The individual study population ranged from 10 patients in the smallest series to 175 in the largest and one of the more recent series. Except for one study in which 6 patients were included that appeared to have unresectable gallbladder carcinoma (3 at SL and 3 at laparotomy) [11], all studies provided specific data from a PHC-only study cohort. Almost all SL procedures (98 %) were completed (Table 1). Four studies reported morbidity rates (0–3 %) following SL, and the exact time interval between SL and laparotomy was reported in only three studies (range 6.5–37 days). One study described that almost all patients had undergone laparotomy immediately following SL [7]. Laparoscopic IOUS, whether performed in all patients or not, was reported in 9 studies (Table 2). Table 3 provides an overview of preoperative imaging performed in each study.

Table 1 Characteristics of included studies, listed according to time period
Table 2 Overview of yield and sensitivity of SL among included studies
Table 3 Overview of preoperative imaging among included studies

Methodological quality of included studies

Results of the quality assessment of included studies using the QUADAS-2 tool are presented in Figs. 2 and 3. Two studies were judged as low risk of bias and low applicability concern in all domains [7, 12]. The SL procedure was well described in most studies. Two studies were judged as high risk of bias regarding flow and timing as the interval between index test (SL) and reference test (explorative laparotomy) was relatively long compared to other studies, although most studies did not report this time interval [6, 11].

Fig. 2
figure 2

Risk of bias and applicability concerns for each included study

Fig. 3
figure 3

Risk of bias and applicability concerns presented as percentages across the included studies

Synthesis of outcome

The yield of SL in patients with potentially resectable PHC varied from 6.4 to 45.0 % among the 11 studies (Table 2). Pooled yield was 24.4 % (95 % CI 16.4–33.4) with a high level of heterogeneity (I 2 = 87.3 %; Fig. 4). Subgroup analysis revealed that the yield was lower for studies published beyond 2010 (18.6, 95 % CI 8.1–32.2, I 2 = 91.9 %) and slightly higher for studies with more than 100 patients (28.2, 95 % CI 14.0–45.1, I 2 = 93.5; Table 4).

Fig. 4
figure 4

Meta-analysis of yield of SL in PHC among included studies

Table 4 Subgroup analysis

The diagnostic accuracy of SL in terms of overall sensitivity to detect unresectable disease could be retrieved from 11 of 12 studies and varied from 31.6 to 75.0 % (Table 2). The pooled sensitivity was 52.2 % (95 % CI 47.1–57.2) with a moderate level of heterogeneity (I 2 = 69.8 %; Fig. 5A). Data on liver and peritoneal metastases were provided by 7 studies, whereas 8 studies specified the number of nodal metastases and locally advanced disease (Table 5). Staging laparoscopy had the highest sensitivity to detect peritoneal metastases (80.7 %, 95 % CI 70.9–88.3, I 2 = 59.0 %), and the sensitivity for liver metastases was 59.0 % (95 % CI 42.1–74.4, I 2 = 40.3 %; Fig. 5C, D). The pooled sensitivity for both liver and peritoneal metastases among 8 of 12 studies was 76.7 % (95 % CI 69.4–82.9, I 2 = 49.4 %; Fig. 5B). The overall sensitivity to detect unresectable disease in studies beyond 2010 tended to be lower, whereas a less clear difference was seen for studies with more than 100 patients. Results of all subgroup analysis are shown in Table 4.

Fig. 5
figure 5

Meta-analysis of sensitivity of SL to detect unresectable disease and specific lesions in PHC among included studies

Table 5 Reasons for unresectability at SL and laparotomy among included studies

The additional value of IOUS during SL could be evaluated from 6 studies. In the remaining 3 studies, insufficient data were available to investigate the additional value. In only two of the 6 studies, all patients had undergone laparoscopic IOUS [13, 14]. Two studies showed beneficial value of IOUS. In one of the studies where all patients were subjected to IOUS during SL, the yield increased with 17 % with the use of IOUS, which was mainly due to identification of locally advanced disease [13]. Only one patient in this study had deep liver metastases detected by IOUS. In another study, 50 % (2/4) of unresectable lesions were identified by IOUS and these were locally advanced tumors [15]. In the remaining studies that provided data on IOUS, no additional value was found as only 1 patient from these studies was detected with unresectable disease who did not also have unresectable disease on laparoscopic inspection alone [7, 11, 14, 16]. Furthermore, in two of these studies, all locally advanced tumors were missed at laparoscopic ultrasonography assessment [14, 16]. Due to lack of most data, pooling of IOUS results could not be performed.

Discussion

From 12 studies reporting on the additional value of SL in 832 patients with PHC, we demonstrated that in 1 out of 4 patients undergoing SL, an unnecessary laparotomy was avoided. The overall sensitivity of SL to detect unresectability was relatively low (52 %), but subgroup analysis revealed that sensitivity for liver and peritoneal metastases was reasonable (77 %). The latter could be explained by a good sensitivity of SL for detecting peritoneal metastases only (81 %).

Despite extensive preoperative staging, still almost half of patients appear to have locally advanced tumors or metastases (liver, peritoneal or nodal) upon exploratory laparotomy. Liver and peritoneal metastases are readily detectable at SL, as was shown in our meta-analysis, but they are the reason for unresectability in only up to a third of all PHC cases [4]. Most studies in this systematic review showed poor accuracy to detect locally advanced disease or nodal metastases. These numbers explain why only a subset of patients actually benefits from this additional procedure. Furthermore, preoperative imaging with CT and MRI has drastically improved in the last decade making it easier to detect (extra)hepatic metastases, lymph node metastases and locally advanced disease [17]. Only smaller lesions may remain that are also easily missed at SL. Accordingly, our analysis showed that the yield and sensitivity in studies published after 2010 tended to be lower. Preoperative imaging was more extensive in these studies. So far, only two studies have shown any factors (i.e., Blumgart T-stage) that increase the diagnostic yield in these patients [7, 13], whereas several other predictors (e.g., CA19-9, tumor size) of unresectable disease at SL have been identified and validated for pancreatic cancer patients [1823].

The value of IOUS during SL seems questionable as only the minority of studies reported any additional value in detecting local unresectability and metastatic disease. Laparoscopic ultrasound may identify less superficial liver metastases or tumor involvement of branches of the portal vein or hepatic artery, which would be impossible without exploration of the hilum, but the studies in this review provided insufficient data supporting its added value.

Ideally, SL and laparotomy are performed in a single session, to avoid two hospital admissions and two surgical procedures. In many centers, however, SL is performed separately from subsequent laparotomy due to logistical reasons related to anesthetic and operating room time planning. Furthermore, it is convenient to perform SL in an early phase as rapid detection of unresectability allows timely start of palliative care (metal biliary stents and chemotherapy). Detection of unresectability at laparotomy would otherwise require time for adequate biliary drainage and/or time for hypertrophy to occur after portal vein embolization. Two studies in our systematic review, however, reported a median interval time between SL and laparotomy of more than 5 weeks in which time small undetected lesions potentially have become large enough to be detected at laparotomy. This may partly explain a low sensitivity in one of these studies [6]. Another reason to perform SL prior to laparotomy in separate sessions is that assessment of biopsies may not be conclusive on frozen-section examination and require time for definitive histopathological diagnosis.

A previous systematic review on staging laparoscopy in proximal bile duct tumors also included a substantial number of gallbladder cancer patients from these series and even studies with gallbladder carcinomas only [8]. As gallbladder carcinomas are considered as a distinct entity that more frequently metastasizes to the liver or peritoneum, adding these patients to the current meta-analysis would lead to overestimating of the diagnostic value of SL in PHC [24, 25]. Moreover, our updated analysis on the topic includes five large recent studies comprising more than half of the total review cohort. We did include one large study in a PHC cohort in which three patients appeared to have an unresectable gallbladder carcinoma at SL and three patients at laparotomy [11]. As it was not possible to exclude only these six patients from analysis, we chose not to exclude the whole study as these patients had been diagnosed on preoperative imaging as having a PHC, which reflects clinical practice. In another study included in our analysis, the performance of a pancreaticoduodenectomy in some cases may suggest distal cholangiocarcinomas that were preoperatively diagnosed as Bismuth type 1 or 2 perihilar tumors [15].

Several limitations may apply to our meta-analysis. Firstly, the number of studies that could be included in the review was low. Relatively few centers worldwide have extensive experience with the surgical management of PHC and not every center routinely performs SL for these tumors. Remarkably, we analyzed an all-Western study population and no studies were retrieved from Asia where the highest incidence of PHC is found. Secondly and most importantly, our meta-analysis showed a significant amount of heterogeneity. The variation in outcomes between the studies was probably caused by differences in cohort size, time interval between SL and laparotomy and time period (only 5 study cohorts included patients after 2010). Nonsignificant, moderate heterogeneity was observed for pooled estimates of sensitivity to detect combined liver and peritoneal metastases and liver metastases only. Significant heterogeneity for most other outcomes persisted even after performing subgroup analyses in studies with more than 100 patients or studies published after 2010. Unfortunately, some studies could not be included in the sensitivity subgroup analysis as some data were unavailable in order to allow for profound analysis. This may have biased the sensitivity rates to detect liver and peritoneal metastases. Apart from possible flaws in flow and timing of the index test (SL) among studies, as mentioned previously, the overall methodological quality of studies assessed with the QUADAS-2 tool was reasonable.

In conclusion, results from this systematic review suggest that 1 in 4 patients with PHC benefits from SL with the highest sensitivity particularly for detecting peritoneal metastases. However, due to considerable heterogeneity among available studies, pooled estimates should be carefully interpreted. As the yield and sensitivity of SL may decrease over years with further improvement of preoperative imaging techniques, the utility of this additional staging modality may further diminish, thereby discouraging its routine use. Large studies that identify predictors of unresectable disease at SL, that can be used to select PHC patients who may benefit most from this procedure, are warranted.