Introduction

Cytokines such as tumour necrosis factor (TNF)-α, interleukin (IL)-1 and IL-6 play a fundamental role in the pathogenesis of rheumatoid arthritis (RA) [1]. Biological disease-modifying anti-rheumatic drugs (bDMARDs) targeting these cytokines have resulted in improved outcomes among patients with RA resistant to traditional synthetic DMARD (sDMARDs). Drugs which inhibit TNF (tumour necrosis factor inhibitors (TNFi)) were the first available biologic therapies [2], but the choice has expanded, and since 2010, tocilizumab (TCZ), a humanised monoclonal antibody against the IL-6 receptor, has been available for use in RA. In the UK, the drug is approved by the National Institute for Health and Care Excellence (NICE) for use both as a first-line biologic therapy after failure of two sDMARDs and as a subsequent-line biologic therapy after failure of TNFi in patients with severe RA (DAS28 >5.1) [35].

With an expanding choice of biologic therapies for use in patients resistant to sDMARDs, there is little information on how these therapies are being selected in clinical practice, as well as a lack of data regarding comparative effectiveness between therapies to help clinicians guide choice of therapies. There also has been little evidence from routine care regarding the factors associated with the optimal effectiveness of TCZ. For example, the impact of prior bDMARD exposure history and concomitant use of methotrexate (MTX) on the effectiveness of TCZ have not been assessed sufficiently.

Whilst a randomised controlled trial (RCT) (ADACTA study) suggested the superiority of TCZ monotherapy in reducing DAS28 compared to anti-TNF adalimumab (ADA) monotherapy among bDMARD-naïve patients with inadequate response to MTX [6], there have been no other RCTs comparing the efficacy of TCZ with TNFi directly. Three cohort studies (two retrospective and one prospective) have compared the effectiveness of TCZ and TNFi, which also suggested the superiority of TCZ [79]; however, there is a lack of large comparative effectiveness studies in routine clinical settings.

There is also limited or inconsistent information regarding factors associated with better response to TCZ. Recent prospective studies of patients who started TCZ have suggested that TCZ may be more effective for patients when used as a first biologic compared to those patients receiving the drug after failure of other biologics [1012]. While evidence has suggested that TNFi may be more effective when used in combination with MTX, this is less clear for TCZ. Two clinical trials (the ACT-RAY study and the SURPRISE study) and three cohort studies suggested that concomitant MTX may lead to favourable treatment outcomes [10, 1316]; however, a retrospective study, a European collaborative study and ACT-LIFE study from routine clinical settings suggested no difference in effectiveness of TCZ when combined with MTX [11, 13, 17, 18]). Limited evidence from routine use is currently available.

Further information regarding the “real-world” effectiveness of TCZ when used at different points in the RA treatment pathway will provide better evidence on which to frame healthcare decisions. The aims of this analysis are therefore to (1) describe characteristics of patients with RA selected for TCZ in the UK, (2) compare the real-world effectiveness of TCZ to TNFi over the short term (first 6 months) when used as a first biologic and (3) assess the influence of concurrent MTX therapy and past biologic exposure on short-term treatment outcomes following TCZ.

Methods

Patients

The British Society for Rheumatology Biologics Register for RA (BSRBR-RA) is a national prospective cohort study collecting information on patients aged 16 or older with a physician’s diagnosis of RA starting biologic therapies. This analysis included all either patients registered with the BSRBR-RA starting TCZ (either as their first biologic (first-line TCZ cohort) or those who had already received biologic therapies and were switching to TCZ (subsequent-line TCZ cohort) or starting TNFi therapy (etanercept—Enbrel (ETN), infliximab—Remicade (IFX), adalimumab—Humira (ADA) or certolizumab pegol—Cimzia (CZP)) as their first biologic between 01 January 2010 and 30 November 2015.

Baseline data

At the start of bDMARD therapy, baseline data were collected from the hospital including patient demographics; smoking status (current smoker/ex- or never smoker); height (cm) and weight (kg) to calculate body mass index (BMI) (kg/m2); disease duration (years); past and current anti-rheumatic therapies; rheumatoid factor (RF) status; extra-articular manifestations of RA (EARA) including pulmonary fibrosis, sicca syndrome, serosal involvement (pleuritic/pericarditis), eye involvement, systemic vasculitis and other specified systemic features based on physicians diagnosis; 28 swollen and tender joint counts (SJC, TJC); patients global assessment visual analogue scale (VAS) (0–100 mm) erythrocyte sedimentation rate (ESR) (mm/h); and/or C-reactive protein (CRP) (mg/dl). Ever existence of comorbidities (hypertension, angina, myocardial infarction, stroke, epilepsy, asthma, chronic bronchitis/emphysema, peptic ulcer, liver disease, renal disease, tuberculosis, demyelination, diabetes, hyperthyroidism, depression or cancer) was captured using a tick list. A list of current medications was also obtained, and statin use was used as a surrogate for baseline hyperlipidaemia. In addition, a comprehensive history of prior cancer was obtained through linkage to the UK national cancer register. Patients complete the Health Assessment Questionnaire (HAQ) adapted for UK use [19].

Follow-up data

Follow-up data were captured from the hospital and patient every 6 months for 3 years and then annually from the hospital only. Hospital data include the most recent DAS28 and changes to anti-rheumatic therapy including stop reasons and adverse events. Patients complete a HAQ.

Data analysis

This study compared the baseline characteristics and treatment effectiveness at 6 months between the (1) first-line TCZ cohort versus first-line TNFi cohort, (2) first-line TCZ cohort versus subsequent-line TCZ cohort and (3) first-line/subsequent-line TCZ users with and without concomitant MTX. All data available up to 31 May 2016 were included.

Treatment effectiveness at 6 months was described using the following measures: change in DAS28 and its components, European League Against Rheumatism (EULAR) response [20], DAS28 remission [21], change in HAQ score, the proportion of patients who achieved the minimal clinically important difference (MCID) in HAQ, defined as a ≥0.22 decrease in score [22] and the drug retention rates up to year 1. DAS28 was calculated using the ESR where available. However, if only a CRP measurement was provided (27% of patients), the DAS28-CRP [23] was calculated instead (online Supplementary Table 1). Non-parametric descriptive statistics were used to compare baseline characteristics between cohorts. Regression models were used to compare EULAR response (ordinal models) and DAS28 remission (logistic regression) between treatments. Each regression model was adjusted by (1) age and sex and (2) the estimated propensity score using inverse probability of treatment weighting (IPTW) without trimming. [24] The propensity score was determined separately for each comparison. To estimate each propensity score, logistic regression models including factors showing significant differences between cohorts at baseline and the factors that may predict changes in outcome were constructed. Drug survival up to year 1 was described using Kaplan-Meier curves and compared using the log-rank test.

Missing data were accounted for in regression models using multiple imputation, using the mi package in Stata with 20 iterations. Information regarding missing data is included in online Supplementary Table 1. When it was not possible to calculate the DAS28 due to one or more missing components, the DAS28 score itself was imputed. All analyses were performed using Stata (StataCorp 2013, Stata Statistical Software: Release 13, College Station, TX: StataCorp LP).

Results

From 01 January 2010 until 30 November 2015, 2636 patients registered with the BSRBR-RA starting their first biologic (217 TCZ; 2419 TNFi: 884 ETN, 51 IFX, 542 ADA, 942 CZP), and 777 patients registered starting TCZ as a subsequent biologic therapy.

Comparison of baseline characteristics and treatment effectiveness between patients starting first-line TCZ or first-line TNFi following inadequate response to sDMARDs

In general, there were few differences between patients starting TCZ or TNFi as first-line biologic therapy (Table 1). Noted differences largely reflected background comorbidity: a history of pulmonary fibrosis was higher in the TCZ group compared to TNFi (5 vs 2%, p = 0.001), as was prior cancer (9 vs 5%, p = 0.001). The proportion of MTX use at baseline was similar between the two cohorts, but the TCZ cohort had a slightly lower percentage of previous MTX use (94 vs 96%, p = 0.05). The TCZ cohort had lower VAS score (72 vs 78 mm, p = 0.002), but the disease activity according to DAS28 was similar to the TNFi cohort (6.2 vs 6.0, p = 0.18).

Table 1 Baseline characteristics of patients initiating first-line TNFi, first-line TCZ and subsequent-line TCZ cohort

Compared with the first-line TNFi cohort, the first-line TCZ cohort showed significantly lower DAS28 at 6 months (3.0 vs 3.5, p = 0.003) (Table 2) and higher absolute change in DAS28 (2.9 vs 2.4, p < 0.001). The percentage of good responders was slightly higher in the TCZ cohort than in the TNFi cohort (49 vs 42%); however, after adjustment using IPTW by propensity score, there was no difference in odds of achieving a higher EULAR response category. The percentage of patients who achieved DAS28 remission was higher in the first-line TCZ cohort compared to the TNFi cohort (42 vs 28%, p < 0.001), which remained significant after adjustment (adjusted odds ratio (OR) 1.86 (1.25–2.78)). However, an analysis of the individual components of DAS28 showed that the reduction in ESR was greater with TCZ (median ESR change 16.0 vs 7.0 mm/h, p < 0.001), but changes were similar in the other components of the DAS28 (online Supplementary Table 2). HAQ score and the proportion with MCID achievement at 6 months were also similar between the two cohorts (Table 2). Drug survival curves are shown in Fig. 1. The estimated 1-year survival was 0.67 (95% CI 0.65–0.69) for first-line TNFi users and was 0.77 (95% CI 0.70–0.82) for the first-line TCZ users. This difference was not statistically significant (p = 0.20).

Table 2 Changes in disease activity, disability and treatment response at 6 months following start of therapy
Fig. 1
figure 1

Drug survival curves of the first-line TNFi cohort, the first-line cohort and the subsequent-line TCZ cohort

Comparison of baseline characteristics and treatment effectiveness between patients starting TCZ as a first-line versus subsequent-line biologic

Compared with the first-line cohort, the subsequent-line cohort had a significantly longer disease duration (12 vs 4 years, p < 0.001), slightly better disease activity (median DAS28 5.7 vs 6.2, p < 0.001) and lower functional capacity (median HAQ 2.0 vs 1.6, p < 0.001) (Table 1). There were higher proportions of any extra-articular manifestations (30 vs 15%, p < 0.001), any comorbidities (73 vs 62%, P = 0.003), reported depression (29 vs 20%, p = 0.001), hypertension (37 vs 25%, p = 0.003) and joint replacement surgery history (30 vs 15%, P < 0.001), but the proportions of pulmonary fibrosis were similar. Among subsequent-line TCZ cohort, most patients had received two or more prior bDMARDs (41% had two, 31% had three or more). Ninety-three percent had received at least one prior TNFi, and 55% had also received rituximab.

Compared to the first-line TCZ cohort, a lower proportion of the subsequent-line cohort achieved good EULAR responses and DAS28 remission (Table 2). However, after adjusting using IPTW by propensity score, there was no significant difference in the odds of achieving either better EULAR response or DAS28 remission between groups. In the subsequent-line cohort, change in HAQ score at 6 months was significantly smaller than in the first-line cohort (median delta HAQ 0.0 vs 0.3, p = 0.002) and a lower percentage of patients who achieved MCID (34 vs 49%, p < 0.001). Up to year 1, the drug retention rate for the subsequent-line TCZ users was estimated to be 0.71 (95% CI 0.67–0.74), which was similar to the first-line TCZ users (p = 0.23) (Fig. 1).

Association between concurrent MTX use and response to TCZ

Within patients starting TCZ either as first or subsequent-line therapy, those who receive concomitant MTX therapy were younger and had fewer extra-articular manifestations and pulmonary fibrosis (online Supplementary Table 3).

After adjustment using propensity score, no significant difference in the proportion who met the EULAR response criteria or DAS28 remission was observed between MTX users and non-users, both in the first-line and subsequent-line cohorts, although low numbers in the first-line cohort resulted in wide confidence intervals (Table 3). At 6 months, functional activities were also similar between the cohorts. The drug survival curves were also similar between patients receiving TCZ with and without concomitant MTX therapy (Fig. 2a, b). The retention rates were estimated to be 0.77 (95% CI 0.68–0.84) for the first-line TCZ users with MTX and 0.76 (95% CI 0.62–0.85) for TCZ monotherapy (p = 0.10), 0.70 (95% CI 0.66–0.74) for the subsequent-line TCZ users with MTX and 0.70 (95% CI 0.64–0.76) for TCZ monotherapy (p = 0.50), respectively.

Table 3 Treatment response comparison between the MTX users versus non-users
Fig. 2
figure 2

Drug survival curve comparison between the MTX users versus non-users. a Comparison among the first-line TCZ users. b Comparison among the subsequent-line TCZ users

Discussion

This is the first study both to describe the clinical characteristics of patients selected to receive TCZ as a first-line therapy in routine clinical use and to examine its effectiveness as a first-line and subsequent-line therapy. A majority of patients (78% of TCZ cohort) were starting this treatment as a subsequent-line biologic in the UK. In part, this may reflect the timing of UK approvals for use, which did not extend to first-line use until 2012, although even after this date, the majority of patients recruited were not starting TCZ as a first-line therapy.

Although similar with respect to disease history and disease activity, compared to patients starting TNFi as their first biologic, there were apparent differences in the comorbidity profile of patients starting TCZ, which may have influenced the choice of therapy. This was most noted with histories of pulmonary fibrosis and cancer. Although recent studies with long follow-up periods suggest that patients using TNFi are not at increased risk of interstitial pneumonia [25] or cancer development [26], several prior studies had indicated a possible link between TNFi usage and increasing risk of developing interstitial pneumonia and/or cancer [2729] and both are included as relative contraindications to TNFi in current UK guidelines [29]. Despite the possible association between TCZ use and increases in fasting lipid levels found during clinical trials [30], we did not see any difference in baseline history of hyperlipidaemia or cardiovascular disease.

This analysis has suggested a superior effectiveness of TCZ compared to TNFi as a first bDMARD, particular in relation to the proportion of patients who achieved DAS28 remission. These results are similar to the ADACTA study (40% remission in the first-line TCZ users vs 11% in first-line ADA users) [6] and a retrospective study from Germany (44% in first-line TCZ users vs 30% in first-line TNFi users) [7]. A national registry in Portugal showed higher percentage of DAS remission in a first-line TCZ cohort (71% in first-line TCZ users vs 24% in first-line TNFi users, p < 0.001) [9]. However, it is possible that the significantly higher percentage of DAS28 remission achievement of TCZ users is explained through differences in the effects of each therapy in lowering the ESR and/or CRP, as when we compared the change in each DAS component between the first-line TCZ users and TNFi users, the improvements on the other components of DAS28 (SJC, TJC and patient VAS) were similar. A single-centre registry in Japan reported similar results in that patients who achieved DAS28 remission with TCZ showed lower mean ESR but, in fact, higher swollen joint counts [32]. Further studies focusing on other measures of inflammation besides the DAS28, such as improvements in synovitis as measured by ultrasound, are warranted. Whether this greater improvement in ESR is clinically important in terms of long-term clinical and radiographic outcomes, independent from improvements in swollen joint count, is not known. As another measure of effectiveness, we compared drug survivals of first-line TNFi and TCZ. Although there was a trend towards better survival with TCZ, this did not reach statistical significance. In addition, the analysis was not adjusted for baseline differences between patients because of the small quantity of data, and therefore, no conclusions can be drawn about the superiority of one drug over another in the longer term.

The majority of the patients in this study starting TCZ did so after one or more other biologic therapies, with many having failed >1. This will in part reflect the timing of introduction of different biologics into clinical practice. The ACT-LIFE study in routine clinical setting suggested that TCZ works better when used as a first-line therapy than as a subsequent line [11]. Our data also showed that the absolute proportion of patients achieving a good EULAR response among subsequent-line TCZ users was less than that seen among first-line users. However, after adjusting for differences in disease status at the start of the therapy, including disease duration and functional ability, these differences were no longer significant, perhaps indicating a group of patients with more resistant disease overall. These results are supported by those of a Japanese post-marketing surveillance which suggested that longer than 10-year disease duration and advanced disease (defined using Steinbrocker stage and class) were factors influencing lower treatment response at 28 weeks [10]. A further observational study also suggested that previous bDMARD exposure was not a significant predictor of treatment response of TCZ at month 6 [12], although there still remains a possibility that this difference did not reach statistical significance because of the small number of the first-line TCZ users.

In the UK, TCZ is also approved for use as monotherapy without MTX [5]. We did not see any difference in the influence of concurrent use of MTX on treatment response among the TCZ users, either as a first-line or subsequent-line biologic, which differs from that reported for TNFi [33]. While the evidence has suggested that TNFi may be more effective when used in combination with MTX, the observed benefits of concurrent MTX with TNFi are not well explained. In part, it may relate to factors associated with general patient drug tolerance and/or adherence. Studies also support a possible role for MTX in preventing anti-drug antibodies [34], although this area has been less studied with TCZ. In the current study, the sample size of the first-line TCZ users was very small, which may have prevented detection of smaller differences between MTX users and non-users. Whether the lack of benefit of MTX cotherapy with TCZ will still be apparent within larger datasets or over the longer term requires further study.

This study has a number of limitations. Firstly, the UK has restricted access to bDMARD therapy, demanding high disease activity (DAS28 >5.2). This may inflate the baseline assessments and affect clinical response, which may reduce the generalisability of the findings to other countries. However, the observed channelling of some patients towards treatments based on their comorbidity profile is less likely to differ between populations, and therefore, the treatment effectiveness data of this study could apply to patients with high disease activity in other countries.

Secondly, DAS28 was used as a primary treatment effectiveness measure. The DAS28 score might overestimate treatment effectiveness of TCZ due to the suppression of acute phase reactants, which may explain in part the greater lowering of ESR among patients receiving TCZ compared to TNFi. TCZ blocks IL-6 binding to its receptor, thereby preventing IL-6-mediated signal transduction, which alters the normalising of inflammatory markers (ESR and CRP) [35]. Several studies have reported that there is a discrepancy between DAS28 remission and Clinical Disease Activity Index (CDAI) remission (which does not include an acute-phase reactant component) [3537]. Kawashiri et al. recommended that CDAI should be used as a disease activity measure for TCZ users [36]. However, the BSRBR-RA did not collect physician global assessment, a component of CDAI. Additionally, although DAS28-CRP tends to be lower than DAS-ESR and lower remission criteria are suggested [38], we were unable to distinguish DAS28-ESR and DAS28-CRP in the BSRBR-RA cohort, as the data were not always captured. However, in this study, other than DAS28, the drug survival curves and functional activity were also assessed as a measure of treatment effectiveness and did not identify any difference between treatments. Thirdly, this study might be affected by incomplete data. In approximately 20% of patients, the 6-month disease activity and HAQ data were missing, which is common in large observational longitudinal studies. To minimise response bias and attrition bias, multiple imputation model was used in this study. However, the effects of residual or unmeasured confounding factors could not be avoided. Although we compared the changes in disease activity and disability between therapies and therapy regimens, we did not compare the rates of adverse events. The ACT-RAY study did suggest a non-significant increase in the occurrence of raised liver enzymes among patients receiving TCZ in combination with MTX [14]. This study focused on short-term outcomes, but as further follow-up is accrued, future studies should look at the longer-term tolerability and maintenance of effect, including the risk of adverse events. It would also be interesting to assess the effectiveness of subsequent bDMARDs after primary failure with TCZ therapy. Although a recent retrospective study suggested that TNFi could be superior than abatacept for the patients showing inadequate response to TCZ [39], there has been sparse evidence in the bDMARD selection after TCZ. Our data suggest that any study of comparative drug safety or long-term tolerability must also take into account differences in the proportions of baseline comorbidities, particularly pulmonary disease.

In conclusion, this national cohort study of real-world TCZ use has shown that with an expanding choice of therapies for patients who are sDMARD inadequate responders, similar improvements in disease activity and disability are seen between patients starting either TCZ or TNFi, with the exception of a greater improvement in ESR among TCZ users. The proportion of responders among patients who start TCZ following an inadequate response to one or more previous biologic therapies is lower compared to first-line use but may largely be explained by differences in disease severity. The benefit of MTX cotherapy with TCZ was not apparent in this study; however, this may be underestimated because of the very small sample size of the first-line TCZ cohort.