Abstract
Objective. To evaluate the relative risk (RR) of pulmonary disease among patients with rheumatoid arthritis (RA) treated with leflunomide (LEF).
Methods. We searched PubMed, Embase, and the Cochrane Central Register of Controlled Trials to April 15, 2014. We included double-blind randomized controlled trials (RCT) of LEF versus placebo or active comparator agents in adults with RA. Studies with fewer than 50 subjects or shorter than 12 weeks were excluded. Two investigators independently searched both databases. All authors reviewed selected studies. We compared RR differences using the Mantel-Haenszel random-effects method to assess total respiratory adverse events, infectious respiratory adverse events, noninfectious respiratory adverse events, interstitial lung disease, and death.
Results. Our literature search returned 5673 results. A total of 8 studies, 4 with placebo comparators, met our inclusion criteria. There were 708 respiratory adverse events documented in 4579 participants. Six cases of pneumonitis occurred, all in the comparator group. Four pulmonary deaths were reported, none in the LEF group. LEF was not associated with an increased risk of total adverse respiratory events (RR 0.99, 95% CI 0.56–1.78) or infectious respiratory adverse events (RR 1.02, 95% CI 0.58–1.82). LEF was associated with a decreased risk of noninfectious respiratory adverse events (RR 0.64, 95% CI 0.41–0.97).
Conclusion. Our study found no evidence of increased respiratory adverse events in RCT of LEF treatment.
Rheumatoid arthritis (RA) is a chronic debilitating inflammatory joint disease affecting up to 1% of the population in developed countries1. It is associated with significant symptoms, functional limitations, and an increased risk of mortality because of cardiovascular and other causes2,3,4. Adequate treatment results in symptom improvement, normalization of physical and social functioning, and improvements in mortality5,6,7. A variety of effective treatments exist for RA; however, patient and physician concerns regarding possible adverse events may limit optimal use in clinical practice8.
A concern over the risk of pulmonary toxicity, in particular interstitial lung disease (ILD), has been raised for many of the drugs used to treat RA. Historically, methotrexate (MTX) was most commonly attributed as a causative agent in ILD9. The introduction of leflunomide (LEF) to clinical practice was swiftly followed by reports of ILD associated with its use. Initial reports from Japan were subsequently supplemented by cases in Europe and the United States10,11,12. Cases of ILD have also been attributed to the majority of biologic agents used to treat RA13. ILD is an intrinsic part of the disease process in RA14. Pulmonary complications in RA may also occur because of infections and unrelated pulmonary disease processes15. Therefore, cases of ILD should be expected in patients with RA and it is important to compare any suspected drug-related risk with an appropriate control population.
Two systematic literature reviews of case reports and observational studies reported an increase in the risk of ILD in patients treated with LEF13,16. Similar to any form of scientific work, observational studies have limitations, the most relevant of which are a susceptibility to ascertainment and channeling biases. Double-blind randomized controlled trials (RCT) afford a setting that should limit these risks. A reported metaanalysis of double-blind RCT in patients with RA treated with MTX found an increased risk of infectious but not noninfectious pulmonary disease with MTX use, casting doubt on widely held beliefs on the drug’s pulmonary effects17.
The aim of our study was to evaluate the risk of ILD associated with LEF use by performing a metaanalysis of double-blind RCT in patients with RA.
MATERIALS AND METHODS
Data sources and searches
A systematic literature search was performed with no date limits using PubMed, Embase, and the Cochrane Central Register of Controlled Trials databases. The search was performed to April 15, 2014. We also searched for previously published metaanalyses and systematic literature reviews. The reference lists of relevant articles were reviewed. Full details of the search terms used are given in the supplementary appendix (available from the authors on request).
Study selection
The literature search was performed independently by 2 authors (CL and RC); discrepancies were resolved by consensus.
The inclusion criteria for study selection were (1) double-blind RCT; (2) human studies; (3) patients with RA; (4) studies in English; (5) studies consisting of a minimum of 2 arms, at least 1 receiving LEF and at least 1 not receiving LEF; (6) studies including only adults (> 18 yrs); (7) trials of ≥ 12 weeks’ duration; (8) studies of ≥ 50 patients; and (9) studies reporting respiratory side effects for LEF and comparator groups separately. In the case of multiple publications of 1 RCT, we included the publication most relevant to our inclusion criteria involving detailed reporting of respiratory side effects. If the results of a study were reported at multiple timepoints, we included the publication of greatest duration, provided it remained a double-blind RCT and fully reported respiratory adverse events. If required, we reviewed previous publications of the same trial to fully assess the trial protocol and risk of bias. This approach was taken to avoid including study subjects more than once in the metaanalysis.
Relevant articles were selected using a 2-step approach. First, titles and abstracts of identified references were screened to exclude articles that did not deal with the topic of interest. Second, the full text of relevant articles was reviewed.
Data extraction and quality assessment
For each included study, data were extracted by 2 of the authors independently (CL and RC). Any discrepancies were resolved by discussion.
The data were entered into a database (RC) and checked by the remaining authors.
The following variables were extracted: authors, year of publication, population studied, number of patients, mean age and range, sex, disease duration, percentage of LEF-naive, previous immunosuppressive drug use, steroid use at baseline, study design and duration, comparator drug, and adverse events. Adverse events were extracted as both total respiratory adverse events and individual adverse events in each category reported in any individual trial. There was minor variation in the terminology used to describe respiratory adverse events in the included studies; adverse events were therefore divided into 2 subgroups: infectious adverse events and noninfectious adverse events.
Data synthesis and analysis
Data were metaanalyzed using the RevMan Version 5.1 software18 and expressed as relative risk (RR) for dichotomous variables. Random-effects metaanalysis using the Mantel-Haenszel method was used throughout because the I2 statistic revealed the presence of between-study heterogeneity. Results were expressed as RR with 95% CI.
Assessment of bias
We used the criteria described in the Cochrane Handbook of Systematic Reviews 5.1.019 to assess for trial-level risk of bias in included studies. Two authors (CL and RC) independently assessed the studies for risk of bias. Any discrepancies were resolved by discussion and consensus. A risk-of-bias graph and summary were generated. Funnel plots were generated to assess for publication bias.
Sensitivity analysis
Sensitivity analysis was performed to assess the effect of (1) trial size (trials of < 400 participants vs trials of ≥ 400 participants), (2) exclusion of studies introducing significant heterogeneity to the results, (3) exclusion of studies using methods other than intention-to-treat for reporting of safety data, and (4) comparator drug (placebo vs active comparator). The sensitivity analysis for trial size was repeated at various cutoffs with no differences seen; the final decision on 400 participants was made because this bisected the dataset.
RESULTS
Literature search
The literature search produced 5673 citations. Of these citations, 4619 were from the Embase search, 882 from the PubMed search, and 172 from Cochrane Central. Of the 5673 initially returned citations, 5665 were excluded after the review of the abstract and/or full text of the article for the reasons shown in Figure 1. Eight articles met the inclusion criteria and were included in our metaanalysis.
Study characteristics
Characteristics of the 8 included studies are shown in Table 120,21,22,23,24,25,26,27. The mean study duration was 55 weeks and the number of patients ranged from 85 to 1784. The 8 articles reported on a total of 4579 patients, 2274 who received LEF and 2305 who received comparator treatments. Three studies involved synthetic disease-modifying antirheumatic drug (DMARD) comparators alone, 2 studies placebo comparators only, 2 placebo and synthetic DMARD comparator groups, and 1 paeoniflorin plus cervus and cucumis polypeptide injection20,21,22,23,24,25,26,27. There was no difference in dropout rates between the LEF (median 28.68%, range 4.33%–56.32%) and comparator groups (29.17%, 12.75%–70.44%).
Findings
There were 708 documented respiratory adverse events. LEF was not associated with an increased risk of total adverse respiratory events relative to comparator agents (RR 0.99, 95% CI 0.56–1.78, I2 = 88%; Figure 220,21,22,23,24,25,26,27). LEF was not associated with an increased risk of infectious adverse respiratory events (RR 1.02, 95% CI 0.58–1.82, I2 = 88%; Supplementary Figure 1 is available from the authors on request). LEF was associated with a decreased risk of noninfectious respiratory adverse events (RR 0.64, 95% CI 0.41–0.97, I2 = 0%; Supplementary Figure 2 is available from the authors on request). There were 6 reported cases of pneumonitis, all in patients treated with MTX in the comparator group (RR 0.20, 95% CI 0.02–1.75); no definitive diagnostic features or additional information were reported in these cases. There were 4 pulmonary deaths, all in patients treated with MTX in the comparator group (RR 0.26, 95% CI 0.03–2.20).
Risk of bias in included studies
In general, the data suggested a low risk of bias in the included studies (Supplementary Figure 3 and Supplementary Figure 4 are available from the authors on request). Inadequate information to assess the risk of selection bias was the most common potential risk of bias identified with 3 of the studies providing inadequate information to assess both the risk of bias because of random sequence generation and the risk of bias because of allocation concealment. Attrition bias was the most serious risk of bias identified with 2 of the studies reporting safety data on study completers only.
The funnel plot for total respiratory events showed no evidence of publication bias (Supplementary Figure 5 is available from the authors on request).
Sensitivity analysis
No significant effect of study size on overall results was found (< 400 patients, RR 0.79, 95% CI 0.58–1.09; ≥ 400 patients, RR 2.16, 95% CI 0.49–9.49). No difference in total respiratory adverse events was seen between studies with placebo (RR 0.93, 95% CI 0.60–1.44) compared with all active comparators (RR 1.10, 95% CI 0.50–2.45) or MTX (RR 0.91, 95% CI 0.82–1.02). While overall LEF was associated with a decreased risk of non-infectious respiratory adverse events, analysis by comparator agent did not reveal any differences (placebo, RR 1.03, 95% CI 0.04–25.00; all active comparators, RR 0.41, 95% CI 0.08–2.21; MTX, RR 0.41, 95% CI 0.08–2.21).
Analysis of study heterogeneity revealed that significant heterogeneity was introduced by the inclusion of the study by Chen, et al; we therefore repeated the analysis excluding the study20. This significantly reduced the I2, but did not materially change the overall results. In our analysis, LEF was not associated with an increase in total respiratory adverse events (RR 0.90, 95% CI 0.81–1.01, I2 = 0%) or infectious respiratory adverse events (RR 0.96, 95% CI 0.84–1.10, I2 = 0%), and was associated with a decrease in noninfectious respiratory adverse events (RR 0.64, 95% CI 0.41–0.97, I2 = 0%).
Exclusion of the studies reporting safety data on study completers only did not alter the overall results (RR for total respiratory adverse events 0.90, 95% CI 0.81–1.00, I2 = 0%; RR for infectious respiratory adverse events 0.93, 95% CI 0.78–1.11, I2 = 14%; RR for noninfectious respiratory adverse events 0.64, 95% CI 0.41–0.97, I2 = 0%).
DISCUSSION
There was no increase in lung disease in patients with RA treated with LEF in our study. In prespecified subgroup analyses, LEF had a neutral effect on infectious pulmonary adverse events and decreased the risk of noninfectious pulmonary adverse events. The majority of studies used MTX as a comparator agent; previous work from our group demonstrated no association between MTX use and an increased risk of noninfectious pulmonary adverse events compared with other biologic and synthetic DMARD17. In our study, 6 reported cases of pneumonitis occurred in the MTX group compared with none in the LEF group; this did not reach statistical significance. Potential explanations include chance, a protective effect of LEF against RA-related ILD, a detrimental effect of MTX compared with LEF but not to other DMARD, or a small increased risk with MTX use, which falls within the CI of the previous studies17,28.
LEF was originally introduced in the United States and Europe at the end of the 1990s with no reports of a significant increase in pulmonary disease. However, soon after the introduction of LEF to Japan, case reports of ILD appeared10,11. Subsequently, cases were also reported from the United States and Europe12. Two systematic literature reviews of observational studies and case reports reported on LEF-related ILD13,16. Roubille and Haraoui reported on 34 cases of LEF-induced or -exacerbated ILD, and while acknowledging the limitations of the data, concluded that LEF can be associated with rapid onset, potentially fatal ILD13. Raj and Nugent reported on 138 cases of LEF-related ILD and concluded that LEF can cause ILD most commonly within the first 3 months of starting treatment16. In contrast to these studies, a nested case-control study performed by Suissa, et al concluded that the 2-fold increase in ILD with LEF use was likely the result of channeling bias12. The disparity in results and conclusions present in the literature represents the ongoing uncertainty regarding LEF’s involvement in lung disease in clinical practice. We believe our study is concordant with the findings of Suissa, et al’s and can help ameliorate physician concern over LEF usage. All study designs have inherent strengths and weaknesses; the design of double-blind RCT intends to remove both overt and covert bias as confounding factors in study results. A significant limitation of the majority of randomized trials is that the study population is of relatively small size, reducing the power to detect small but clinically significant effects. Metaanalytical techniques help alleviate this limitation because the combination of studies increases the overall power, in addition to equally distributing inherent bias between the groups.
Similar to any study, ours also has some important limitations. Our study used previously performed clinical trials, therefore we were dependent on accurate recording and reporting of adverse events by the investigators involved in the original studies. As with previous work in this area, analysis was restricted to the adverse events detailed in the respective studies and we did not have access to the unpublished patient-level data. There was significant variation between studies in the frequency and terminology used to report adverse events. This is likely reflective of the adverse event definitions and reporting requirements of the original randomized trials. Included studies were of relatively short duration with a mean duration of 55 weeks; however, previous studies have reported that the majority of cases of purported LEF-related ILD occurred within the first 20 weeks of treatment13,16. Participants in clinical trials are by design a more homogeneous and healthier population with fewer comorbidities; the results obtained from an evaluation of these trials may lack external validity. In particular, patients with preexisting lung disease are generally excluded from participation in clinical trials, limiting the generalizability of these findings to that particular population. A final limitation may be considered the relatively low number of studies included in our metaanalysis. We elected to focus on large-scale high-quality studies, which included full reporting of the relevant data to minimize the effect of data collection errors on our results.
Our study has important clinical implications. The lack of evidence for an effect of LEF on lung disease suggests that previous reports may represent an association rather than a direct causative effect. Many factors including channeling bias and a genuine desire to report potential safety concerns may have contributed to this. While not completely excluding an effect of LEF on pulmonary disease, our data suggest that the extreme caution afforded this agent in patients with pulmonary disease may not be justified. Coupled with previous data on MTX, our data support increased use of existing effective disease-modifying agents in the treatment of patients with RA, including those with coexistent or new-onset pulmonary disease17. More severe RA is a risk factor for RA-related ILD, and the attainment of low disease activity with DMARD, including LEF, may contribute to a protective pulmonary effect14.
The results of our metaanalysis demonstrate no increase in respiratory adverse events in patients with RA treated with LEF in double-blind RCT. Studies of pulmonary adverse events in patients treated with LEF and related agents for other diseases may provide further valuable information.
Acknowledgment
The PRISMA Checklist and other supplementary material are available from the authors on request.
- Accepted for publication January 22, 2016.