Metaanalyses, Network Metaanalyses, and Systematic Reviews: The Perpetual Motion Machine All Over Again

YUSUF YAZICI

doi:10.3899/jrheum.190900

The term metaanalysis was first used in the mid-1970s for describing methods designed to characterize and combine the findings of prior studies to increase statistical power, along with providing quantitative summary estimates, and to identify data gaps and biases¹. (In this editorial I will use the term metaanalysis to encompass not only metaanalyses but also systematic reviews and network metaanalyses, because the issues I raise apply to all of them and their variations.) When applied to studies conducted with similar populations and methods, metaanalyses can be useful. However, this is not the case with many metaanalyses where the findings of studies that differ in important ways have been combined, prompting the comment that “they have mixed apples and oranges” — and sometimes “apples, lice, and killer whales — yielding meaningless conclusions”^1,2.

Combining the results of individual studies potentially increases the total number of participants, and this should mean increased statistical power, yet differences in participant demographics and study methods may actually lead to decreased power owing to variability in the patient characteristics¹. This then leads to more difficulty in ascertaining the real effects.

Add to this the issue of unpublished research to potentially skew the conclusions, because positive findings get published more often than negative results, starting with the decision to submit them in the first place³. It has been reported that falsified data also make it into metaanalyses⁴. In one example authors showed that 46% of all metaanalysis publications had their conclusions changed by publications with falsified data and 32% of all the analyses had a considerable change in the outcome⁵.

There has also been a surge in the number of metaanalyses published over the years. The rate of growth was significantly greater for metaanalysis at 4676% compared to randomized clinical trials (RCT) at 138% during the same time period⁶. Metaanalyses may help to synthesize and update the literature using valuable methods of evidence-based medicine; however, only an estimated 3% of them are methodologically sound, nonredundant, and provide useful clinical information⁷. Although the optimal metaanalysis/RCT ratio has yet to be determined, an ever-increasing proportion of this literature may provide minimal value, which should precipitate a reappraisal of the foundations, production, and reporting of metaanalyses^6,8.

Many potential reasons for this trend of an exploding number of metaanalyses have been proposed, ranging from an actual need for updating accumulated evidence and hence the need for summarized data, to padding of resumes and journal citation statistics^9,10. Others have also suggested that metaanalysis may serve as “easily publishable units or marketing tools”^11,12. Even what is considered the gold standard for metaanalyses, Cochrane Reviews, has been shown to not meet its own standards in its reports, and the scandal around the firing of one of its founders should give pause to anyone who cares about the sanctity of scientific rigor^13,14. These recent trends have led to questions about the purpose, quality, and credibility of most reviews as well as calls to abandon metaanalyses altogether, and that part of the responsibility falls on the journal editors and reviewers to make sure only good quality work gets published¹⁵.

A potential problem for rheumatology and specifically for rheumatoid arthritis (RA) studies is how the changed classification criteria for RA will affect future recommendations when a metaanalysis is done looking at treatment options for RA. The main issue is that the new criteria published in 2010¹⁶ have been shown, by us and others, to have decreased specificity, which of course leads to patients who do not have RA and have other diagnoses explaining their condition to be classified as having RA^17,18,19. There have also been data suggesting that patients with RA classified using the 2010 criteria have less severe disease, respond better to treatments, and have improved remission rates²⁰. Some have suggested that RA itself is changing. A far more plausible explanation is that the new criteria, by the way they were developed, select for patients with milder disease and even patients who do not have RA to be enrolled into clinical trials, hence skewing the results. Imagine the confusion and likely incorrect conclusions that would be reached after a meta-analysis with these skewed results.

We seem to believe that more data from many patients, regardless of how the data were collected, analyzed, and reported, will answer many questions that a single, well-done trial would not. I disagree. A well-done study would probably tell us a lot more than 20 studies combined if a lot of them have methodological flaws, and very likely somewhat different types of patients studied, as mentioned before. It is much more straightforward to dissect a single study to really understand what the question asked was, how it was studied, and what the conclusions were than to try to interpret a metaanalysis where you do not know how the many potential issues listed above have affected the conclusions.

The reason for doing an RCT is not that it can someday be part of a metaanalysis. Maybe we should be more focused on the misplaced desire to keep pooling trials that probably should not be pooled to draw conclusions that should not be drawn. Each trial’s only goal, in the case of drugs being tested, is to show if something works, yes or no, plus or minus, 1 or 0. All the other derivative conclusions are nice to have and can lead to further hypothesis development for the next study. RCT, however, are very good tools for saying that a certain medication works, and you should potentially offer it to a specific patient to see how that patient would do. Nothing more, nothing less.

I think the time has come to limit RCT and their conclusions to what was measured in that trial. The attempt to draw more than what these individual RCT can provide is the problem. We are constantly looking for the shortcuts that are not there. This is no different from all the attempts at personalized medicine for complex conditions, such as hypertension, diabetes, or RA, where it has been very robustly demonstrated that predicting outcomes at a single-patient level will very likely never be achieved²¹. As Roberts, et al stated, “Thus, our results suggest that genetic testing, at its best, will not be the dominant determinant of patient care and will not be a substitute for preventative medicine strategies incorporating routine checkups and risk management based on the history, physical status and life style of the patient … Recognition of these merits and limits … can minimize unrealistic expectations and foster fruitful investigations²¹.” This love of trying to draw simple conclusions that would be applicable to all patients seems similar to 17th-century attempts to develop a perpetual motion machine. Everybody really loved the idea and wanted it to be possible (similar to the enthusiasm for individualized genetic testing or personalized medicine attempts), but you cannot break the first law of thermodynamics. Hence, there will never be a perpetual motion machine.

The latest incarnation of this kind of wishful thinking is related to artificial intelligence and the “era of big data,” which can be thought of as the next step in the metaanalyses movement²². I remember the days when all would be solved if we only could sequence the whole human genome. We did, and learned a lot about diseases, but we found no insights into predicting diseases, best treatments, or outcomes in an individual patient with a common disease, which is what most people have and what most doctors try to treat. I would respectfully suggest that while we still can, we should try going back to what I will call “small” data, where only a few, well-done studies, with the aim of answering a hypothesis-driven question, are taken seriously and used in making treatment recommendations and decisions, because I do not know of more serious work for a doctor than taking care of an individual patient.

Benjamin Franklin said when he was a young man, “Lose no time; be always employ’d in something useful; cut off all unnecessary actions.” Maybe it is time we applied this to our approach to most metaanalyses.

Footnotes

See Placebo response in RA trials, page 28

REFERENCES

1.↵
1. Barnard ND,
2. Willett WC,
3. Ding EL
. The misuse of meta-analysis in nutrition research. JAMA 2017;318:1435–6.
OpenUrl PubMed
2.↵
1. Eysenck H
. Meta-analysis squared: does it make sense? Am Psychol 1995;50:110–1.
OpenUrl CrossRef
3.↵
1. Ioannidis JP
. Effectiveness of antidepressants: an evidence myth constructed from a thousand randomized trials? Philos Ethics Humanit Med 2008;3:14.
OpenUrl CrossRef PubMed
4.↵
1. Garmendia CA,
2. Bhansali N,
3. Madhivanan P
. Research misconduct in FDA-regulated clinical trials: a cross-sectional analysis of warning letters and disqualification proceedings. Ther Innov Regul Sci 2018;52:592–605.
OpenUrl
5.↵
1. Garmendia CA,
2. Nassar Gorra L,
3. Rodriguez AL,
4. Trepka MJ,
5. Veledar E,
6. Madhivanan P
. Evaluation of the inclusion of studies identified by the FDA as having falsified data in the results of meta-analyses: the example of the apixaban trials. JAMA Intern Med 2019;179:582–4.
OpenUrl
6.↵
1. Niforatos JD,
2. Weaver M,
3. Johansen ME
. Assessment of publication trends of systematic reviews and randomized clinical trials, 1995 to 2017. JAMA Intern Med 2019 Jul 29 (E-pub ahead of print).
7.↵
1. Ioannidis JP
. The mass production of redundant, misleading, and conflicted systematic reviews and meta-analyses. Milbank Q 2016;94:485–514.
OpenUrl CrossRef PubMed
8.↵
1. Møller MH,
2. Ioannidis JPA,
3. Darmon M
. Are systematic reviews and meta-analyses still useful research? We are not sure. Intensive Care Med 2018;44:518–20.
OpenUrl
9.↵
1. Patsopoulos NA,
2. Analatos AA,
3. Ioannidis JP
. Relative citation impact of various study designs in the health sciences. JAMA 2005;293:2362–6.
OpenUrl CrossRef PubMed
10.↵
1. Qadir XV,
2. Clyne M,
3. Lam TK,
4. Khoury MJ,
5. Schully SD
. Trends in published meta-analyses in cancer research, 2008–2013. Cancer Causes Control 2017;28:5–12.
OpenUrl
11.↵
1. Bastian H,
2. Glasziou P,
3. Chalmers I
. Seventy-five trials and eleven systematic reviews a day: how will we ever keep up? PLoS Med 2010;7:e1000326.
OpenUrl CrossRef PubMed
12.↵
1. Siontis KC,
2. Ioannidis JPA
. Replication, duplication, and waste in a quarter million systematic reviews and meta-analyses. Circ Cardiovasc Qual Outcomes 2018;11:e005212.
OpenUrl
13.↵
1. Franco JVA,
2. Garrote VL,
3. Escobar Liquitay CM,
4. Vietto V
. Identification of problems in search strategies in Cochrane Reviews. Res Syn Methods 2018;9:408–16.
OpenUrl
14.↵
1. Gøtzsche PC
. Cochrane — no longer a collaboration. [Internet. Accessed August 15, 2019.] Available from: blogs.bmj.com/bmj/2018/11/08/peter-c-gotzsche-cochrane-no-longer-a-collaboration
15.↵
1. Wallach JD
. Meta-analysis metastasis. JAMA Intern Med 2019 Jul 29 (E-pub ahead of print).
16.↵
1. Aletaha D,
2. Neogi T,
3. Silman AJ,
4. Funovits J,
5. Felson DT,
6. Bingham CO 3rd,
7. et al.
2010 rheumatoid arthritis classification criteria: an American College of Rheumatology/European League against Rheumatism collaborative initiative. Ann Rheum Dis 2010;69:1580–8.
OpenUrl Abstract/FREE Full Text
17.↵
1. Kennish L,
2. Labitigan M,
3. Budoff S,
4. Filopoulos MT,
5. McCracken WA,
6. Swearingen CJ,
7. et al.
Utility of the new rheumatoid arthritis 2010 ACR/EULAR classification criteria in routine clinical care. BMJ Open 2012;2:e001117.
OpenUrl Abstract/FREE Full Text
18.↵
1. Kaneko Y,
2. Kuwana M,
3. Kameda H,
4. Takeuchi T
. Sensitivity and specificity of 2010 rheumatoid arthritis classification criteria. Rheumatology 2011;50:1268–74.
OpenUrl CrossRef PubMed
19.↵
1. van der Linden MP,
2. Knevel R,
3. Huizinga TW,
4. van der Helm-van Mil AH
. Classification of rheumatoid arthritis: comparison of the 1987 American College of Rheumatology criteria and the 2010 American College of Rheumatology/European League Against Rheumatism criteria. Arthritis Rheum 2011;63:37–42.
OpenUrl CrossRef PubMed
20.↵
1. Burgers LE,
2. van Nies JA,
3. Ho LY,
4. de Rooy DP,
5. Huizinga TW,
6. van der Helm-van Mil AH
. Long-term outcome of rheumatoid arthritis defined according to the 2010-classification criteria. Ann Rheum Dis 2014;73:428–32.
OpenUrl Abstract/FREE Full Text
21.↵
1. Roberts NJ,
2. Vogelstein JT,
3. Parmigiani G,
4. Kinzler KW,
5. Vogelstein B,
6. Velculescu VE
. The predictive capacity of personal genome sequencing. Sci Transl Med 2012;4:133ra58.
OpenUrl Abstract/FREE Full Text
22.↵
1. Emanuel EJ,
2. Wachter RM
. Artificial intelligence in health care: will the value match the hype? JAMA 2019;321:2281–2.
OpenUrl

In this issue

Download PDF

Bookmark this article

Cited By...

More in this TOC Section

Show more Editorial

[1] 1.↵
Barnard ND,
Willett WC,
Ding EL
. The misuse of meta-analysis in nutrition research. JAMA 2017;318:1435–6.
OpenUrl PubMed

[2] Barnard ND,

[3] Willett WC,

[4] Ding EL

[5] 2.↵
Eysenck H
. Meta-analysis squared: does it make sense? Am Psychol 1995;50:110–1.
OpenUrl CrossRef

[6] Eysenck H

[7] 3.↵
Ioannidis JP
. Effectiveness of antidepressants: an evidence myth constructed from a thousand randomized trials? Philos Ethics Humanit Med 2008;3:14.
OpenUrl CrossRef PubMed

[8] Ioannidis JP

[9] 4.↵
Garmendia CA,
Bhansali N,
Madhivanan P
. Research misconduct in FDA-regulated clinical trials: a cross-sectional analysis of warning letters and disqualification proceedings. Ther Innov Regul Sci 2018;52:592–605.
OpenUrl

[10] Garmendia CA,

[11] Bhansali N,

[12] Madhivanan P

[13] 5.↵
Garmendia CA,
Nassar Gorra L,
Rodriguez AL,
Trepka MJ,
Veledar E,
Madhivanan P
. Evaluation of the inclusion of studies identified by the FDA as having falsified data in the results of meta-analyses: the example of the apixaban trials. JAMA Intern Med 2019;179:582–4.
OpenUrl

[14] Garmendia CA,

[15] Nassar Gorra L,

[16] Rodriguez AL,

[17] Trepka MJ,

[18] Veledar E,

[19] Madhivanan P

[20] 6.↵
Niforatos JD,
Weaver M,
Johansen ME
. Assessment of publication trends of systematic reviews and randomized clinical trials, 1995 to 2017. JAMA Intern Med 2019 Jul 29 (E-pub ahead of print).

[21] Niforatos JD,

[22] Weaver M,

[23] Johansen ME

[24] 7.↵
Ioannidis JP
. The mass production of redundant, misleading, and conflicted systematic reviews and meta-analyses. Milbank Q 2016;94:485–514.
OpenUrl CrossRef PubMed

[25] Ioannidis JP

[26] 8.↵
Møller MH,
Ioannidis JPA,
Darmon M
. Are systematic reviews and meta-analyses still useful research? We are not sure. Intensive Care Med 2018;44:518–20.
OpenUrl

[27] Møller MH,

[28] Ioannidis JPA,

[29] Darmon M

[30] 9.↵
Patsopoulos NA,
Analatos AA,
Ioannidis JP
. Relative citation impact of various study designs in the health sciences. JAMA 2005;293:2362–6.
OpenUrl CrossRef PubMed

[31] Patsopoulos NA,

[32] Analatos AA,

[33] Ioannidis JP

[34] 10.↵
Qadir XV,
Clyne M,
Lam TK,
Khoury MJ,
Schully SD
. Trends in published meta-analyses in cancer research, 2008–2013. Cancer Causes Control 2017;28:5–12.
OpenUrl

[35] Qadir XV,

[36] Clyne M,

[37] Lam TK,

[38] Khoury MJ,

[39] Schully SD

[40] 11.↵
Bastian H,
Glasziou P,
Chalmers I
. Seventy-five trials and eleven systematic reviews a day: how will we ever keep up? PLoS Med 2010;7:e1000326.
OpenUrl CrossRef PubMed

[41] Bastian H,

[42] Glasziou P,

[43] Chalmers I

[44] 12.↵
Siontis KC,
Ioannidis JPA
. Replication, duplication, and waste in a quarter million systematic reviews and meta-analyses. Circ Cardiovasc Qual Outcomes 2018;11:e005212.
OpenUrl

[45] Siontis KC,

[46] Ioannidis JPA

[47] 13.↵
Franco JVA,
Garrote VL,
Escobar Liquitay CM,
Vietto V
. Identification of problems in search strategies in Cochrane Reviews. Res Syn Methods 2018;9:408–16.
OpenUrl

[48] Franco JVA,

[49] Garrote VL,

[50] Escobar Liquitay CM,

[51] Vietto V

[52] 14.↵
Gøtzsche PC
. Cochrane — no longer a collaboration. [Internet. Accessed August 15, 2019.] Available from: blogs.bmj.com/bmj/2018/11/08/peter-c-gotzsche-cochrane-no-longer-a-collaboration

[53] Gøtzsche PC

[54] 15.↵
Wallach JD
. Meta-analysis metastasis. JAMA Intern Med 2019 Jul 29 (E-pub ahead of print).

[55] Wallach JD

[56] 16.↵
Aletaha D,
Neogi T,
Silman AJ,
Funovits J,
Felson DT,
Bingham CO 3rd,
et al.
2010 rheumatoid arthritis classification criteria: an American College of Rheumatology/European League against Rheumatism collaborative initiative. Ann Rheum Dis 2010;69:1580–8.
OpenUrl Abstract/FREE Full Text

[57] Aletaha D,

[58] Neogi T,

[59] Silman AJ,

[60] Funovits J,

[61] Felson DT,

[62] Bingham CO 3rd,

[63] et al.

[64] 17.↵
Kennish L,
Labitigan M,
Budoff S,
Filopoulos MT,
McCracken WA,
Swearingen CJ,
et al.
Utility of the new rheumatoid arthritis 2010 ACR/EULAR classification criteria in routine clinical care. BMJ Open 2012;2:e001117.
OpenUrl Abstract/FREE Full Text

[65] Kennish L,

[66] Labitigan M,

[67] Budoff S,

[68] Filopoulos MT,

[69] McCracken WA,

[70] Swearingen CJ,

[71] et al.

[72] 18.↵
Kaneko Y,
Kuwana M,
Kameda H,
Takeuchi T
. Sensitivity and specificity of 2010 rheumatoid arthritis classification criteria. Rheumatology 2011;50:1268–74.
OpenUrl CrossRef PubMed

[73] Kaneko Y,

[74] Kuwana M,

[75] Kameda H,

[76] Takeuchi T

[77] 19.↵
van der Linden MP,
Knevel R,
Huizinga TW,
van der Helm-van Mil AH
. Classification of rheumatoid arthritis: comparison of the 1987 American College of Rheumatology criteria and the 2010 American College of Rheumatology/European League Against Rheumatism criteria. Arthritis Rheum 2011;63:37–42.
OpenUrl CrossRef PubMed

[78] van der Linden MP,

[79] Knevel R,

[80] Huizinga TW,

[81] van der Helm-van Mil AH

[82] 20.↵
Burgers LE,
van Nies JA,
Ho LY,
de Rooy DP,
Huizinga TW,
van der Helm-van Mil AH
. Long-term outcome of rheumatoid arthritis defined according to the 2010-classification criteria. Ann Rheum Dis 2014;73:428–32.
OpenUrl Abstract/FREE Full Text

[83] Burgers LE,

[84] van Nies JA,

[85] Ho LY,

[86] de Rooy DP,

[87] Huizinga TW,

[88] van der Helm-van Mil AH

[89] 21.↵
Roberts NJ,
Vogelstein JT,
Parmigiani G,
Kinzler KW,
Vogelstein B,
Velculescu VE
. The predictive capacity of personal genome sequencing. Sci Transl Med 2012;4:133ra58.
OpenUrl Abstract/FREE Full Text

[90] Roberts NJ,

[91] Vogelstein JT,

[92] Parmigiani G,

[93] Kinzler KW,

[94] Vogelstein B,

[95] Velculescu VE

[96] 22.↵
Emanuel EJ,
Wachter RM
. Artificial intelligence in health care: will the value match the hype? JAMA 2019;321:2281–2.
OpenUrl

[97] Emanuel EJ,

[98] Wachter RM

Main menu

User menu

Search

Metaanalyses, Network Metaanalyses, and Systematic Reviews: The Perpetual Motion Machine All Over Again

Footnotes

REFERENCES

In this issue

Citation Manager Formats

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Content

Resources

Subscribers

More

Main menu

User menu

Search

Metaanalyses, Network Metaanalyses, and Systematic Reviews: The Perpetual Motion Machine All Over Again

Footnotes

REFERENCES

In this issue

Citation Manager Formats

Jump to section

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Content

Resources

Subscribers

More