- This study assessed the quality of evidence for interventions in common mental disorders within Cochrane reviews.
- Researchers analyzed 93 Cochrane reviews, encompassing 207 interventions for various mental health conditions.
- Only 47 (22.7%) of interventions showed moderate or high-quality evidence and significant superiority over controls.
- The authors concluded that many common mental disorder interventions lack robust, convincing evidence.
- Clinicians should recognize that most interventions for common mental disorders lack strong evidence of efficacy.
Navigating the Evidence Base for Common Mental Disorders
Common mental disorders, such as depressive and anxiety disorders, represent a significant public health challenge, affecting millions globally and imposing considerable personal and societal burdens [1, 2]. These conditions frequently lead to impaired functioning, reduced quality of life, and substantial healthcare costs [3, 4]. While numerous pharmacological and psychological interventions are available and routinely employed in clinical practice, the strength of the evidence supporting their widespread use can vary [5, 6]. Guidelines and clinical recommendations are typically built upon systematic evaluations of efficacy and safety, emphasizing the importance of robust data to inform treatment selection [7, 8]. A comprehensive understanding of the quality of evidence for these interventions is therefore essential for clinicians striving to deliver effective, evidence-based care.
Re-evaluating the Evidence Landscape
A prior analysis, which examined a random sample of 2,428 (35%) Cochrane reviews published between January 2008 and March 2021, highlighted concerns regarding the evidence base for common mental disorder interventions. In that earlier work, only two of 48 interventions (4.2%) were found to demonstrate a high quality of evidence, show significant superiority to a comparator, and be regarded as effective by Cochrane authors. This finding prompted a re-evaluation, as the high-quality GRADE standard (Grading of Recommendations Assessment, Development and Evaluation, a system used to rate the quality of evidence and strength of recommendations) might be overly demanding. Specifically, the assessment of risk of bias, a component of the GRADE methodology that evaluates the likelihood of systematic errors in a study, was considered potentially too stringent, leading to a downgrade of evidence quality.
To address these concerns and provide a more comprehensive picture, the present study included all systematic reviews of interventions for common mental disorders published by the Common Mental Disorder Cochrane group between January 1, 2008, and June 24, 2025. The researchers focused on interventions demonstrating either a high or a moderate quality of evidence. Interventions were included if they were compared to placebo, no treatment, or treatment as usual in randomized or quasi-randomized trials, and had received GRADE ratings for their primary outcome. The study adhered to rigorous methodological standards, carried out in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses Statement (PRISMA), and its protocol was registered at PROSPERO (CRD42024547340).
The data extraction process was meticulous: for each Cochrane report, the GRADE rating of the first listed primary outcome was extracted. If this initial rating was not high, the GRADE ratings of up to three further primary or secondary outcomes were inspected, with only secondary outcomes addressing target symptoms being included. For the outcome with the highest GRADE rating, additional detailed data were extracted, including the number of trials, number of patients, effect size, and confidence interval. As an addition to the original protocol, information on whether the Cochrane authors recommended the intervention as effective was also extracted. To ensure accuracy and minimize bias, identified Cochrane reports were randomly allocated to five pairs of raters. For each report, two reviewers independently extracted data and compared the results, with discrepancies resolved by consensus or, if necessary, by involving a third rater.
Scope of Interventions Examined
The comprehensive analysis began by identifying 150 reviews of interventions for common mental disorders published in the Cochrane Database of Systematic Reviews between January 1, 2008, and June 24, 2025. From this initial pool, 57 reviews were excluded due to various criteria. Specifically, 42 reviews lacked a GRADE rating for their primary outcome, 20 reviews included only comparisons with active treatments rather than placebo, no treatment, or treatment as usual, and one review was superseded by an updated version. After these exclusions, 93 Cochrane reviews met the inclusion criteria, providing the foundation for evaluating 207 eligible interventions.
These 93 included reviews covered diverse patient populations: 74 reports focused on adults, 13 reports examined interventions in children and adolescents, and 9 reports included mixed samples of children, adolescents, and adults. The 207 eligible interventions addressed a broad spectrum of common mental disorders. The largest categories included 50 treatments for depressive disorders, 46 for anxiety disorders (which encompassed obsessive-compulsive disorder), 27 for post-traumatic stress disorder, and 46 for self-harm. Other conditions covered were 6 interventions for pathological gambling, 5 for bipolar disorder or mania, 4 for somatoform disorders, 4 for eating disorders, and 3 for insomnia. The remaining interventions addressed several other conditions not specifically enumerated.
Quality of Evidence and Efficacy Findings
The analysis of 207 eligible interventions revealed a varied landscape regarding the quality of evidence supporting their use for common mental disorders. The Cochrane authors rated 11 (5.3%) of these interventions as showing a high quality of evidence, while 60 (29.0%) demonstrated a moderate quality of evidence. A substantial portion of interventions, however, were supported by lower quality evidence: 69 (33.0%) showed a low quality of evidence, and 66 (31.9%) were rated as having a very low quality of evidence. When considering efficacy, the researchers found that of the interventions with high quality evidence, 8 (3.9%) were significantly superior to the control condition. For those with moderate quality evidence, 39 (18.8%) showed significant superiority. Even among interventions with lower quality evidence, 20 (9.7%) with low quality and 20 (9.7%) with very low quality were significantly superior to the control condition. Critically, the study determined that only 47 (22.7%) of all interventions were supported by a high or moderate quality of evidence and simultaneously demonstrated significant superiority to the control condition, indicating a substantial gap in robust evidence for many commonly used treatments.
Beyond efficacy, the reporting of harms associated with these interventions was also examined. Harms were reported for 62 of the 207 interventions (30.0%), with 12 (5.8%) yielding significantly more harms than the control conditions. A notable limitation impacting the strength of the evidence was the small number of underlying studies for many comparisons: 50.2% of all comparisons were based on only one (75 of 207) or two (29 of 207) studies, which inherently limits the generalizability and robustness of findings. When comparing treatment modalities, pharmacotherapy, encompassing 99 interventions, showed that 29.3% had high or moderate quality of evidence and statistically significant results. However, the majority of pharmacological interventions, 63.6% (63 of 99), showed low or very low quality of evidence, and 52.5% (52 of 99) did not have statistically significant results. A similar pattern emerged for psychotherapy, which included 75 interventions: 20.0% demonstrated high or moderate quality of evidence and statistically significant results. Yet, 66.7% (50 of 75) of psychotherapy interventions had low or very low quality of evidence, and 54.7% (41 of 75) did not yield statistically significant results. For all other intervention types, only 6.5% met the combined criteria of high or moderate quality of evidence and statistically significant results.
Interventions with Robust Support
Despite the broad scope of interventions examined, the study identified a very limited number that met the most stringent criteria for robust evidence. Specifically, in only 6 of the 207 comparisons analyzed, the intervention demonstrated both a high or moderate quality of evidence and statistically significant superiority over the control condition, and was also explicitly considered effective by the Cochrane authors. These six interventions, representing a small fraction of the total, included valproate for acute mania in mixed samples of inpatients and outpatients, eszopiclone for insomnia, cognitive behavior therapy (CBT) for adult self-harm, CBT for child and adolescent anxiety disorders, sildenafil for sexual dysfunction caused by antidepressants, and amitriptyline for depressive disorders. This highlights a critical need for more comprehensive and high-quality research to establish a stronger evidence base for many commonly used treatments in mental health.
Further analysis revealed varying levels of evidence strength across different common mental disorders. For interventions targeting depressive disorders, 32.0% showed high or moderate quality of evidence and statistically significant superiority over control conditions. Similarly, for anxiety disorders, 28.3% of interventions met these criteria. However, the proportion of interventions with robust support was considerably lower for other conditions. For post-traumatic stress disorder, 14.8% of interventions demonstrated high or moderate quality of evidence and statistically significant superiority, while for self-harm, this figure dropped to 10.9%. These findings underscore that even within specific diagnostic categories, the evidence supporting interventions varies substantially, urging clinicians to critically evaluate the strength of evidence for each treatment option.
Clinical Implications and Limitations
The comprehensive analysis underscores a critical challenge in the evidence base for common mental disorders. In sum, of all interventions evaluated for these conditions, less than a quarter (22.7%) were supported by high or moderate quality evidence and were statistically significantly superior to the comparator. This finding suggests that many interventions currently studied and applied in clinical practice for common mental disorders still lack robust, convincing evidence. For practicing physicians, this highlights the importance of critically appraising the evidence supporting treatment recommendations and engaging in shared decision-making with patients, particularly when considering interventions with less established efficacy.
The researchers acknowledged that their evaluation was based on the existing GRADE ratings provided by the Cochrane report authors, which may be considered a limitation of the study. While the possibility of errors from Cochrane assessors cannot be entirely excluded, the authors noted that such ratings are probably quite accurate, given that Cochrane reviews have consistently demonstrated high quality in previous assessments. This methodological transparency ensures that while the findings are significant, the context of their derivation, including potential limitations, is clearly understood, reinforcing the need for ongoing rigorous research to strengthen the evidence base in mental health.
References
1. Cipriani A, Furukawa TA, Salanti G, et al. Comparative Efficacy and Acceptability of 21 Antidepressant Drugs for the Acute Treatment of Adults With Major Depressive Disorder: A Systematic Review and Network Meta-Analysis. The Lancet. 2018. doi:10.1016/s0140-6736(17)32802-7
2. Norman R, Byambaa M, De R, Butchart A, Scott JG, Vos T. The Long-Term Health Consequences of Child Physical Abuse, Emotional Abuse, and Neglect: A Systematic Review and Meta-Analysis. PLoS Medicine. 2012. doi:10.1371/journal.pmed.1001349
3. Finnes A, Enebrink P, Ghaderi A, Dahl J, Nager A, Öst L. Psychological treatments for return to work in individuals on sickness absence due to common mental disorders or musculoskeletal disorders: a systematic review and meta-analysis of randomized-controlled trials.. International archives of occupational and environmental health. 2019. doi:10.1007/s00420-018-1380-x
4. Nigatu YT, Liu Y, Uppal M, et al. Interventions for enhancing return to work in individuals with a common mental illness: systematic review and meta-analysis of randomized controlled trials.. Psychological medicine. 2016. doi:10.1017/S0033291716002269
5. Boniface S, Malet-Lambert I, Coleman R, et al. The Effect of Brief Interventions for Alcohol Among People with Comorbid Mental Health Conditions: A Systematic Review of Randomized Trials and Narrative Synthesis.. Alcohol and alcoholism (Oxford, Oxfordshire). 2018. doi:10.1093/alcalc/agx111
6. Tareke M, Yirdaw BA, Gebeyehu A, Gelaye B, Azale T. Effectiveness of school-based psychological interventions for the treatment of depression, anxiety and post-traumatic stress disorder among adolescents in sub-Saharan Africa: A systematic review of randomized controlled trials. PLoS ONE. 2023. doi:10.1371/journal.pone.0293988
7. Page MJ, Moher D, Bossuyt PM, et al. PRISMA 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews. BMJ. 2021. doi:10.1136/bmj.n160
8. Michie S, Richardson M, Johnston M, et al. The Behavior Change Technique Taxonomy (v1) of 93 Hierarchically Clustered Techniques: Building an International Consensus for the Reporting of Behavior Change Interventions. Annals of Behavioral Medicine. 2013. doi:10.1007/s12160-013-9486-6