Published on in Vol 7 (2024)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/54810, first published .
Identifying Depression Through Machine Learning Analysis of Omics Data: Scoping Review

Identifying Depression Through Machine Learning Analysis of Omics Data: Scoping Review

Identifying Depression Through Machine Learning Analysis of Omics Data: Scoping Review

Review

1School of Nursing, Columbia University, New York, NY, United States

2Brookdale Department of Geriatrics and Palliative Care, Icahn School of Medicine, Mount Sinai Health System, New York, NY, United States

3School of Nursing, University of Pennsylvania, Philadelphia, PA, United States

Corresponding Author:

Brittany Taylor, AAS, BA, BS, MPhil

School of Nursing

Columbia University

560 W 168th St

New York, NY, 10032

United States

Phone: 1 2123424172

Email: bt2542@cumc.columbia.edu


Background: Depression is one of the most common mental disorders that affects >300 million people worldwide. There is a shortage of providers trained in the provision of mental health care, and the nursing workforce is essential in filling this gap. The diagnosis of depression relies heavily on self-reported symptoms and clinical interviews, which are subject to implicit biases. The omics methods, including genomics, transcriptomics, epigenomics, and microbiomics, are novel methods for identifying the biological underpinnings of depression. Machine learning is used to analyze genomic data that includes large, heterogeneous, and multidimensional data sets.

Objective: This scoping review aims to review the existing literature on machine learning methods for omics data analysis to identify individuals with depression, with the goal of providing insight into alternative objective and driven insights into the diagnostic process for depression.

Methods: This scoping review was reported following the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews) guidelines. Searches were conducted in 3 databases to identify relevant publications. A total of 3 independent researchers performed screening, and discrepancies were resolved by consensus. Critical appraisal was performed using the Joanna Briggs Institute Critical Appraisal Checklist for Analytical Cross-Sectional Studies.

Results: The screening process identified 15 relevant papers. The omics methods included genomics, transcriptomics, epigenomics, multiomics, and microbiomics, and machine learning methods included random forest, support vector machine, k-nearest neighbor, and artificial neural network.

Conclusions: The findings of this scoping review indicate that the omics methods had similar performance in identifying omics variants associated with depression. All machine learning methods performed well based on their performance metrics. When variants in omics data are associated with an increased risk of depression, the important next step is for clinicians, especially nurses, to assess individuals for symptoms of depression and provide a diagnosis and any necessary treatment.

JMIR Nursing 2024;7:e54810

doi:10.2196/54810

Keywords



Significance of Depression

Depression is one of the most common mood disorders, with a prevalence of approximately 20% in adults in the United States [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1,Tomasik J, Han SY, Barton-Owen G, Mirea D, Martin-Key NA, Rustogi N, et al. A machine learning algorithm to differentiate bipolar disorder from major depressive disorder using an online mental health questionnaire and blood biomarker data. Transl Psychiatry. Jan 12, 2021;11(1):41. [FREE Full text] [CrossRef] [Medline]2]. Among people with diagnosed depression, nearly half experience severe depression, and 40% experience moderate depression [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1]. Between 2010 and 2018, the number of adults in the United States diagnosed with depression increased by 13%, and the associated health care costs also increased, including medical and pharmaceutical costs, workplace absenteeism, and suicide-related costs [Greenberg PE, Fournier AA, Sisitsky T, Simes M, Berman R, Koenigsberg SH, et al. The economic burden of adults with major depressive disorder in the United States (2010 and 2018). Pharmacoeconomics. Jun 05, 2021;39(6):653-665. [FREE Full text] [CrossRef] [Medline]3]. Despite a greater investment in mental health, approximately half of the people experiencing depression have been diagnosed and treated [Di Y, Wang J, Liu X, Zhu T. Combining polygenic risk score and voice features to detect major depressive disorders. Front Genet. Dec 20, 2021;12:761141. [FREE Full text] [CrossRef] [Medline]4]. There have been limited improvements in the mental health care of depression during the past decade, primarily owing to the challenges in accurately diagnosing this complex illness [Walther A, Cannistraci CV, Simons K, Durán C, Gerl MJ, Wehrli S, et al. Lipidomics in major depressive disorder. Front Psychiatry. Oct 15, 2018;9:459. [FREE Full text] [CrossRef] [Medline]5]. Consequently, there is an urgent imperative to explore and establish more objective diagnostic approaches that can better identify individuals with depression and pave the way for more effective interventions and personalized treatment strategies.

Diagnostic Methods for Depression

The gold standard for depression diagnosis involves a structured psychiatric interview [Tomasik J, Han SY, Barton-Owen G, Mirea D, Martin-Key NA, Rustogi N, et al. A machine learning algorithm to differentiate bipolar disorder from major depressive disorder using an online mental health questionnaire and blood biomarker data. Transl Psychiatry. Jan 12, 2021;11(1):41. [FREE Full text] [CrossRef] [Medline]2] that includes validated depression scales such as the Center for Epidemiologic Studies–Depression Scale, Hamilton Rating Scale for Depression-17, Montgomery-Asberg Depression Rating Scale, and the Beck Depression Inventory [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. While these validated scales can be administered by a trained interviewer, a licensed mental health provider is required to make a formal diagnosis [Tomasik J, Han SY, Barton-Owen G, Mirea D, Martin-Key NA, Rustogi N, et al. A machine learning algorithm to differentiate bipolar disorder from major depressive disorder using an online mental health questionnaire and blood biomarker data. Transl Psychiatry. Jan 12, 2021;11(1):41. [FREE Full text] [CrossRef] [Medline]2]. This method, while routinely used, is subjective to the clinician conducting the interview, leading to potential variations in diagnosis.

There are several other barriers to the diagnosis of depression, which include limited access to health care services and societal stigma toward mental health diagnoses. The Diagnostic and Statistical Manual of Mental Disorders defines depression as a heterogenous disorder that is diagnosed based on the core symptoms of depressed mood or anhedonia and at least 4 of the 9 other symptoms, including appetite changes, sleep changes, fatigue, difficulty in concentrating, feeling worthless, and suicidal ideation; depression is present if these symptoms last for at least 2 weeks [Walther A, Cannistraci CV, Simons K, Durán C, Gerl MJ, Wehrli S, et al. Lipidomics in major depressive disorder. Front Psychiatry. Oct 15, 2018;9:459. [FREE Full text] [CrossRef] [Medline]5]. Furthermore, the heterogeneity of symptoms in depression makes diagnosis difficult [Squarcina L, Villa FM, Nobile M, Grisan E, Brambilla P. Deep learning for the prediction of treatment response in depression. J Affect Disord. Feb 15, 2021;281:618-622. [CrossRef] [Medline]7], and it is described differently across cultures [Kalibatseva Z, Leong FT. Cultural factors, depressive and somatic symptoms among Chinese American and European American college students. J Cross Cult Psychol. Sep 29, 2018;49(10):1556-1572. [CrossRef]8]. In addition, there is social stigma and perceived conflict with normative social roles that prevent many patients from being honest about their thoughts and feelings [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6].

Nursing Care for Depression

Second to social work, nursing is the largest profession in the mental health workforce [Phoenix BJ, Hurd M, Chapman SA. Experience of psychiatric mental health nurse practitioners in public mental health. Nurs Adm Q. 2016;40(3):212-224. [CrossRef] [Medline]9]. In 2013, it was estimated that 4% of the total registered nursing workforce provided mental health care, and in 2015, the number was estimated by the National Nursing Workforce Survey to be 134,000 registered nurses [Phoenix BJ, Hurd M, Chapman SA. Experience of psychiatric mental health nurse practitioners in public mental health. Nurs Adm Q. 2016;40(3):212-224. [CrossRef] [Medline]9]. Advanced practice registered nurses are a vital part of the mental health workforce, especially in rural areas where there are few licensed mental health professionals with prescribing capabilities [Phoenix BJ, Hurd M, Chapman SA. Experience of psychiatric mental health nurse practitioners in public mental health. Nurs Adm Q. 2016;40(3):212-224. [CrossRef] [Medline]9].

Genomics of Depression

Owing to multilevel biases around diagnoses of depression, including implicit bias of providers, social desirability bias of patients, and bias introduced by data processing, alternative methods for an objective biologically informed diagnosis are being explored [Chao YS, Lin KF, Wu CJ, Wu HC, Hsu HT, Tsao LC, et al. Simulation study to demonstrate biases created by diagnostic criteria of mental illnesses: major depressive episodes, dysthymia, and manic episodes. BMJ Open. Nov 10, 2020;10(11):e037022. [FREE Full text] [CrossRef] [Medline]10,Zhao S, Bao Z, Zhao X, Xu M, Li MD, Yang Z. Identification of diagnostic markers for major depressive disorder using machine learning methods. Front Neurosci. Jun 18, 2021;15:645998. [FREE Full text] [CrossRef] [Medline]11]. Currently, biomarkers, such as single nucleotide polymorphisms (SNPs), messenger RNA (mRNA), microRNA, proteins, and methylated DNA, are being sequenced and combined with scores on standardized depression instruments to evaluate whether they can improve the sensitivity and specificity of a depression diagnosis. Ideally, biomarker profiling would be performed on brain tissue, as it offers valuable insights into the underlying neurobiological mechanisms [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. However, brain biopsies are dangerously invasive, so peripheral blood or saliva is often used as an alternative sample type [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. Importantly, recent studies have shown a high correlation in gene expression and methylation patterns between blood and saliva samples and brain tissue, supporting the utility of peripheral samples as valuable surrogates for understanding the molecular mechanisms underlying depression [Braun PR, Han S, Hing B, Nagahama Y, Gaul LN, Heinzman JT, et al. Genome-wide DNA methylation comparison between live human brain and peripheral tissues within individuals. Transl Psychiatry. Jan 31, 2019;9(1):47. [FREE Full text] [CrossRef] [Medline]12-Nishitani S, Isozaki M, Yao A, Higashino Y, Yamauchi T, Kidoguchi M, et al. Cross-tissue correlations of genome-wide DNA methylation in Japanese live human brain and blood, saliva, and buccal epithelial tissues. Transl Psychiatry. Feb 27, 2023;13(1):72. [FREE Full text] [CrossRef] [Medline]14]. Therefore, this study focuses on studies that use blood or saliva sample types for the diagnosis of depression.

The heritability of depression is estimated to be 40%, and many studies have been performed to identify genetic variants or SNPs that are associated with depression [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15,Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16]. Genomic analysis can be performed through genome-wide association studies (GWASs). The 2 types of GWAS are classical and functional. Classical GWAS identifies SNPs that are associated with specific traits or diseases [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. Functional GWAS determines how SNPs overlap with regulatory elements such as enhancers and promotors and predicts how these SNPs function [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. A GWAS of samples in the Taiwan Biobank identified SNPs in 17 different genes that were significantly associated with depression [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16]. Results from GWAS analyses suggest that depression is a polygenic disorder, meaning many SNPs can affect the hereditary influence [Di Y, Wang J, Liu X, Zhu T. Combining polygenic risk score and voice features to detect major depressive disorders. Front Genet. Dec 20, 2021;12:761141. [FREE Full text] [CrossRef] [Medline]4]. SNPs identified through GWASs can be used to compute polygenic risk scores [Di Y, Wang J, Liu X, Zhu T. Combining polygenic risk score and voice features to detect major depressive disorders. Front Genet. Dec 20, 2021;12:761141. [FREE Full text] [CrossRef] [Medline]4]. Polygenic risk scores combine the effects of genetic variants into an overall score that reflects an individual’s propensity for a disease [Schultebraucks K, Choi KW, Galatzer-Levy IR, Bonanno GA. Discriminating heterogeneous trajectories of resilience and depression after major life stressors using polygenic scores. JAMA Psychiatry. Jul 01, 2021;78(7):744-752. [FREE Full text] [CrossRef] [Medline]17].

Transcriptomics of Depression

The transcriptome is all of the body’s mRNA and contains coding instructions for protein synthesis [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18,Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19]. Transcriptome analysis is useful for measuring gene expression. Recently developed sequencing techniques allow the expression levels of thousands of transcripts to be measured simultaneously [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19]. Differentially expressed genes (DEGs) in patients with depression and healthy controls have been identified in both peripheral blood samples and brain tissues [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18].

Epigenomics of Depression

Epigenetics leads to heritable changes in gene expression without affecting the underlying genetic sequences [Yao Q, Chen Y, Zhou X. The roles of microRNAs in epigenetic regulation. Curr Opin Chem Biol. Aug 2019;51:11-17. [CrossRef] [Medline]20]. Studies have shown that epigenetics may be as influential as genetic variants in the development of depression [Chen D, Meng L, Pei F, Zheng Y, Leng J. A review of DNA methylation in depression. J Clin Neurosci. Sep 2017;43:39-46. [CrossRef] [Medline]21]. Two types of epigenetic modifiers are DNA methylation (DNAm) and microRNA. DNAm occurs at sites in the genetic sequence where the nucleotides cytosine and guanine are bound together in clusters known as cytosine-phosphodiester bond-guanine (CpG) islands [Chen D, Meng L, Pei F, Zheng Y, Leng J. A review of DNA methylation in depression. J Clin Neurosci. Sep 2017;43:39-46. [CrossRef] [Medline]21]. DNAm is responsive to environmental stimuli and can affect gene expression by inhibiting the transcription of affected genes [Chen D, Meng L, Pei F, Zheng Y, Leng J. A review of DNA methylation in depression. J Clin Neurosci. Sep 2017;43:39-46. [CrossRef] [Medline]21]. MicroRNAs are small, noncoding RNAs up to 25 nucleotides in length [Yao Q, Chen Y, Zhou X. The roles of microRNAs in epigenetic regulation. Curr Opin Chem Biol. Aug 2019;51:11-17. [CrossRef] [Medline]20]. Unlike mRNA, they are not translated into protein. Instead, they bind to mRNA to suppress protein translation, leading to decreased gene expression [Yao Q, Chen Y, Zhou X. The roles of microRNAs in epigenetic regulation. Curr Opin Chem Biol. Aug 2019;51:11-17. [CrossRef] [Medline]20]. The effects of several microRNAs have been found to be upregulated or downregulated in individuals with depression [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1].

In some studies, >1 sequencing method is used on the samples to produce different types of omics data. In the multiomics study by Bhak et al [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6], blood samples were sequenced using Methyl-Seq to produce epigenomic data and RNA-Seq to produce transcriptomic data. Using these data, the authors were able to distinguish between people with depression who have attempted suicide, people with depression who have not attempted suicide, and healthy controls [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. Combining >1 omics data type can improve prediction accuracy [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6].

Microbiomics of Depression

The diversity of microbiota in the gut is influenced by genetics, development, and environment [Limbana T, Khan F, Eskander N. Gut microbiome and depression: how microbes affect the way we think. Cureus. Aug 23, 2020;12(8):e9966. [FREE Full text] [CrossRef] [Medline]22]. In the gut microbiome, the gut microbiota transmit signals to the brain through pathways associated with neural transmission and control of behaviors [Limbana T, Khan F, Eskander N. Gut microbiome and depression: how microbes affect the way we think. Cureus. Aug 23, 2020;12(8):e9966. [FREE Full text] [CrossRef] [Medline]22]. Depression has been associated with gut dysbiosis, an imbalance of the gut microbiota that is associated with adverse health outcomes [Martinez JE, Kahana DD, Ghuman S, Wilson HP, Wilson J, Kim SC, et al. Unhealthy lifestyle and gut dysbiosis: a better understanding of the effects of poor diet and nicotine on the intestinal microbiome. Front Endocrinol (Lausanne). 2021;12:667066. [FREE Full text] [CrossRef] [Medline]23,Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24]. Some strains of bacteria have been associated with depression in multiple studies, including Eggerthella, Subdoligranulum, Coprococcus, and Ruminococcaceae [Radjabzadeh D, Bosch JA, Uitterlinden AG, Zwinderman AH, Ikram MA, van Meurs JBJ, et al. Gut microbiome-wide association study of depressive symptoms. Nat Commun. Dec 06, 2022;13(1):7128. [FREE Full text] [CrossRef] [Medline]25]. Furthermore, studies have found differences in metabolic pathways between individuals with depression and healthy controls [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24].

Machine Learning Methods to Identify Individuals With Depression From Omics Data

Omics data are inherently complex and often too large for manual evaluation [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26]. Machine learning, a form of artificial intelligence, is useful for detecting subtle patterns in large data sets, allowing it to predict multifactorial diseases [Zhao S, Bao Z, Zhao X, Xu M, Li MD, Yang Z. Identification of diagnostic markers for major depressive disorder using machine learning methods. Front Neurosci. Jun 18, 2021;15:645998. [FREE Full text] [CrossRef] [Medline]11,Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27]. By training algorithms on data, machine learning models identify patterns and make predictions that may be beyond human capabilities [Musolf AM, Holzinger ER, Malley JD, Bailey-Wilson JE. What makes a good prediction? Feature importance and beginning to open the black box of machine learning in genetics. Hum Genet. Sep 04, 2022;141(9):1515-1528. [FREE Full text] [CrossRef] [Medline]28]. Machine learning algorithms can be supervised, where the algorithm learns from labeled training data to make predictions in unlabeled testing data, or unsupervised, where there is no labeling, and the algorithm categorizes the data into groups or finds complex patterns [Shatte AB, Hutchinson DM, Teague SJ. Machine learning in mental health: a scoping review of methods and applications. Psychol Med. Jul 2019;49(9):1426-1448. [CrossRef] [Medline]29].

Machine learning models are being investigated to aid in the development of predictive algorithms to help understand how genetic variation can affect disease status [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16]. A key aspect of machine learning is feature selection, which helps determine the importance of each feature and its contribution to the model’s performance during training; in omics data, features can encompass various entities, such as SNPs, DEGs, or DNAm sites [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. Machine learning can be useful for analyzing transcriptomic data because traditional statistical methods may not fully capture molecular interactions between genes [Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30].

Through machine learning, researchers can not only identify genes associated with a specific disease but also explore linear and nonlinear gene interactions [Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30]. While there is great potential in using machine learning to advance omics knowledge on depression, no prior studies have summarized the machine learning methods used to analyze omics data for depression. Therefore, this scoping review aims to provide an overview of the existing literature on using machine learning methods to analyze omics data to identify individuals with depression.


This scoping review was reported following the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews) guidelines [Tricco AC, Lillie E, Zarin W, O'Brien KK, Colquhoun H, Levac D, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. Oct 02, 2018;169(7):467-473. [FREE Full text] [CrossRef] [Medline]31].

Search Strategies

Searches were conducted in 3 databases between November and December 2022: PubMed, CINAHL, and Scopus. The search strategy used terms representing machine learning; depression; and different types of omics, including genomics, transcriptomics, and epigenomics (

Multimedia Appendix 1

Search strategy and keywords.

DOCX File , 23 KBMultimedia Appendix 1). Keywords were combined using Boolean operators.

Selection Criteria

After deduplication, 3 independent reviewers (BT, MH, and SN) conducted pairwise screening of titles and abstracts with specific inclusion and exclusion criteria using Covidence (Veritas Health Innovation) systematic review web software. This resulted in a set of papers for full-text review that were also reviewed pairwise, with disagreements resolved by consensus. Specific inclusion criteria consisted of studies published in peer-reviewed journals, English, and the past 5 years (ie, between January 1, 2017, and December 31, 2022). Publication dates were limited to the past 5 years because genetic sequencing is constantly evolving, and older studies may have used outdated methods [Koch L, Potenski C, Trenkmann M. Sequencing moves to the twenty-first century. Nature. 2021. URL: https://www.nature.com/articles/d42859-020-00100-w [accessed 2024-04-29] 32]. Furthermore, all studies had to include (1) an omics method involving the sequencing of genetic material to identify depression and (2) an approach that used machine learning or deep learning to analyze the omics data. Papers were excluded if they focused on omics methods that did not involve sequencing of genetic material, such as metabolomics and lipidomics. In addition, review papers; deep learning studies of medical images; and studies focusing on other disorders, such as bipolar disorder, anxiety disorder, posttraumatic stress disorder, and schizophrenia, were excluded.

Any disagreements between screeners were discussed and resolved through consensus. After the initial screening, full texts of the remaining papers were reviewed. Reference lists were also screened to identify any additional papers meeting the inclusion criteria. Covidence software was used throughout the screening process. Data charting was completed for the eligible studies using Word (Microsoft Corp).

Data Extraction

Items extracted included author, year, study design, and sample size. Data extracted included the omics type, machine learning method, sample type, and depression screening instrument used. Charted data were synthesized by grouping studies according to their omics method (eg, genomics and transcriptomics).

Critical appraisal was performed using the Joanna Briggs Institute Critical Appraisal Checklist for Analytical Cross-Sectional Studies [Checklist for analytical cross sectional studies. The Joanna Briggs Institute. 2020. URL: http://joannabriggs.org/research/critical-appraisal-tools.html [accessed 2022-07-20] 33]. This checklist was chosen because the genomic data in the studies included in this review were analyzed at a single point in time [Wang X, Cheng Z. Cross-sectional studies: strengths, weaknesses, and recommendations. Chest. Jul 2020;158(1S):S65-S71. [CrossRef] [Medline]34]. The checklist appraises inclusion criteria, measurement of exposure and outcomes, confounding, and statistical analysis. Questions are answered as yes, no, unclear, or not applicable [Checklist for analytical cross sectional studies. The Joanna Briggs Institute. 2020. URL: http://joannabriggs.org/research/critical-appraisal-tools.html [accessed 2022-07-20] 33].


Search Summary

The initial database search yielded 964 papers; 266 (27.6%) papers were removed as duplicates. Of the 964 papers, the titles and abstracts of 698 (72.4%) papers were screened for eligibility. A priori exclusion criteria were applied throughout the title and abstract screening of the 698 papers, and 668 (95.7%) papers were excluded. Of the 698 papers, 30 (4.3%) met the criteria for full-text review and were assessed for eligibility, of which 15 (50%) were included in this scoping review. This screening process is visualized in a PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) flow diagram (Figure 1).

Figure 1. PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) flow diagram.

Summary of Study Characteristics

The included studies were published between 2017 and 2022. The studies were conducted in 8 countries: Germany (1/15, 7%), South Korea (1/15, 7%), Australia (1/15, 7%), China (1/15, 7%), Taiwan (1/15, 7%), Canada (2/15, 13%), United States (6/15, 40%), Japan (1/15, 7%), and India (1/15, 7%). All the studies were cross-sectional design studies. The studies addressed genomics (5/15, 33%), transcriptomics (5/15, 33%), epigenomics (3/15, 20%), multiomics (1/15, 7%), and microbiomics (1/15, 7%). Machine learning methods included random forest, support vector machine, k-nearest neighbor, artificial neural network, and deep learning. Study characteristics are further described in Table 1.

Table 1. Study characteristics.
Type of omics and studyCountrySample size, nAge rangeDepression diagnosisScreening instrument
Genomics

Arabnejad et al [Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35], 2018United States922 (463 cases and 459 controls)Not givenScreening
  • Composite International Diagnostic Interview–Short Form
  • Structured Clinical Interview for DSM-IVa
  • Patient Health Questionnaire-9

Arloth et al [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15], 2020Germany3514 (1476 cases and 2038 controls)Not givenNot given
  • Not given

Lin et al [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16], 2021Taiwan9828 (2457 cases and 7371 controls)Mean 51.2 (SD 10.4) yearsPsychiatrist
  • Patient Health Questionnaire

Sekaran and Sudha [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26], 2019United States100 (66 cases and 34 controls)Not givenNot given
  • Not given

Takahashi et al [Takahashi Y, Ueki M, Tamiya G, Ogishima S, Kinoshita K, Hozawa A, et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry. Aug 17, 2020;10(1):294. [FREE Full text] [CrossRef] [Medline]36], 2020Japan6733 (185 cases and 6548 controls)Mean 60 (SD 11) yearsNot given
  • Center for Epidemiological Studies–Depression Scale
Transcriptomics

Ciobanu et al [Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30], 2020Australia521 (27 cases and 494 controls)70 to 90 yearsScreening
  • Geriatric Depression Scale
  • Patient Health Questionnaire
  • Neuropsychiatric Inventory

Le et al [Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]37], 2020United States157 (78 cases and 79 controls)Not givenPsychiatrist
  • Montgomery-Asberg Depression Rating Scale

Parvandeh et al [Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]38], 2020United States915 (463 cases and 452 controls)Not givenScreening
  • Composite International Diagnostic Interview–Short Form
  • Structured Clinical Interview for DSM-IV
  • Patient Health Questionnaire-9

Qi et al [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18], 2021Canada2295 (1765 cases and 530 controls)>18 yearsNot given
  • Not given

Verma and Shakya [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19], 2022India59 (30 cases and 29 controls)Not givenNot given
  • Not given
Epigenomics

Fan et al [Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27], 2021China391 (291 cases and 100 controls)18 to 65 yearsPsychiatrist
  • Hamilton Rating Scale for Depression-17

Payne et al [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39], 2020United States267 (54 cases and 213 controls)Not givenScreening
  • Edinburgh Postnatal Depression Scale

Qi et al [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1], 2020Canada168 (140 cases and 28 controls)Not givenPsychiatrist
  • Montgomery-Asberg Depression Rating Scale
Microbiomics

Stevens et al [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24], 2021United States40 (20 cases and 20 controls)Not givenPsychiatrist
  • None
Multiomics

Bhak et al [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6], 2019South Korea182 (95 cases and 87 controls)19 to 46 yearsPsychiatrist
  • Hamilton Rating Scale for Depression-17

aDSM-IV: Diagnostic and Statistical Manual of Mental Disorders (Fourth Edition).

Genomics

One study combined classical and functional GWASs and annotated SNPs based on their regulatory potential and combination with a functional unit (FU) [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. This method is called a multivariate FU-wide association study (DeepWAS) [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. A DeepWAS can identify SNPs associated with a disease (dSNPs) [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. A DeepWAS successfully identified 61 dSNPs in 237 FUs that were associated with depression; 60 (25.3%) of these dSNPs were significant (Table 2) [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. To validate these results, the dSNPs were compared to SNPs identified by other GWASs [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. A total of 4 dSNPs overlapped with a large GWAS by the UK Biobank: the LARP6-LRRC49 gene, 2 intergenic regions near the WNT2 and ASZ1 genes, the ATG9B and ABCB8 genes on chromosome 7, and a site near the C1orf220 and MIR4424 genes on chromosome 1 [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. In addition, the DeepWAS identified an SNP on the transcription factor binding site of MEF2C on chromosome 8 as a regulator for depression [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. The GWAS using data collected from 2 prefectures in Japan included 102 SNPs in the model with the highest prediction accuracy [Takahashi Y, Ueki M, Tamiya G, Ogishima S, Kinoshita K, Hozawa A, et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry. Aug 17, 2020;10(1):294. [FREE Full text] [CrossRef] [Medline]36]. However, none of these variants were significant at the 5.0×10–8 level, and the top 11 variants only explained 0.0036% of the variance in the validation data set, which is a very small effect size [Takahashi Y, Ueki M, Tamiya G, Ogishima S, Kinoshita K, Hozawa A, et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry. Aug 17, 2020;10(1):294. [FREE Full text] [CrossRef] [Medline]36].

Using data from the Taiwan Biobank, a novel SNP, rs192922209, located in the intron region of the FBN1 gene on chromosome 15, was associated with depression [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16]. In addition, a novel SNP was associated with depression in female individuals: rs114542799 in the intron region of the ALDH1L1 gene on chromosome 3 [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16]. Furthermore, this study identified 17 SNPs with potential roles as expression quantitative trait loci [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16]. Arabnejad et al [Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35] used GWAS data to identify significant SNPs and their associated genes to test for pathways that overlap with depression. They identified the top 500 SNPs using different feature selection methods and compared the number of genes detected to the biological pathways [Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35]. Pathways that previous studies have associated with depression were reported: axon guidance pathway, neuronal system pathway, and pathways related to G protein–coupled receptors, which affect neurotransmitter signaling [Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35].

Sekaran and Sudha [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26] aimed to identify genetic variants related to depression by using DNA microarrays. Sample participants were classified into 3 categories: patients with depression with lipopolysaccharide treatment, patients with depression without lipopolysaccharide treatment, and healthy controls [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26]. A total of 27 genetic biomarkers associated with depression were identified; the biomarker A_23_P109436, was able to classify the data with the highest precision [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26].

Table 2. Study findings.
Type of omics and studySample typeKey findings
Genomics

Arabnejad et al [Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35], 2018Blood
  • Detected pathways associated with depression, including axon guidance, neuronal system, and G protein–coupled receptor signaling

Arloth et al [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15], 2020Not given
  • Identified 61 dSNPsa in 237 FUsb; 60 of the dSNPs were significant
  • A total of 4 dSNPs were also found in a GWASc by the UK Biobank
  • A SNPd on the MEF2C gene was identified as a regulator for depression

Lin et al [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16], 2021Blood
  • This study identified a novel SNP on the FBN1 gene associated with depression
  • A novel SNP on the ALDH1L1 was associated with depression in female individuals
  • A total of 17 SNPs with potential roles as expression quantitative trait loci were pinpointed

Sekaran and Sudha [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26], 2019Not given
  • Identified 27 genetic biomarkers associated with depression
  • A biomarker, A_23_P109436, classified the data with the highest precision

Takahashi et al [Takahashi Y, Ueki M, Tamiya G, Ogishima S, Kinoshita K, Hozawa A, et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry. Aug 17, 2020;10(1):294. [FREE Full text] [CrossRef] [Medline]36], 2020Blood
  • The model with the highest prediction accuracy included 102 SNPs
  • None of these SNPs were significant at the 5.0×10–8 level
Transcriptomics

Ciobanu et al [Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30], 2020Blood
  • Downregulation of the transferrin receptor gene is associated with depression

Le et al [Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]37], 2020Blood
  • Identified 23 depression gene modules

Parvandeh et al [Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]38], 2020Blood
  • The best performing model had a significant overlap of 959 genes with the initial 7616 genes (P<.001)

Qi et al [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18], 2021Brain and blood
  • Analysis of brain mRNAe revealed 62 DEGsf used to distinguish cases from controls
  • Analysis of blood mRNA found 1376 DEGs

Verma and Shakya [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19], 2022Blood
  • A total of 624 transcripts correlated with the classification of patients with depression who died by suicide, those who did not die by suicide, and healthy controls
Epigenomics

Fan et al [Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27], 2021Blood
  • Identified 9 differentially methylated sites on the tryptophan hydroxylase-2 gene

Payne et al [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39], 2020Blood
  • Found that DNAmg in the first trimester could accurately predict depression in the third trimester
  • Third-trimester DNAm predicted postpartum depression

Qi et al [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1], 2020Blood
  • A total of 4 microRNAs differed significantly, but these differences were not significant
Microbiomics

Stevens et al [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24], 2021Stool
  • Found decreased amounts of Faecalibacterium, Ruminococcus, Lachnospiraceae, and Bacterioides species in the microbiomes of the individuals in the group with depressive symptoms
Multiomics

Bhak et al [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6], 2019Blood
  • Identified 48 DEGs and 810 differentially methylated sites that significantly correlated with depression scores

adSNPs: single nucleotide polymorphisms associated with a disease.

bFU: functional unit.

cGWAS: genome-wide association study.

dSNP: single nucleotide polymorphism.

emRNA: messenger RNA.

fDEG: differentially expressed gene.

gDNAm: DNA methylation.

Transcriptomics

Ciobanu et al [Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30] used transcriptomic data to identify a link between depression and the transferrin receptor gene on chromosome 3. When downregulated, this gene is associated with recurrent depression [Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30]. In the study by Verma and Shakya [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19], differential gene expression was examined between patients with depression who died by suicide, those who did not die by suicide, and healthy controls. A total of 624 transcripts were found to be biologically and functionally related to classifying the 3 categories [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19]. Most of these transcripts were associated with neurotransmitter receptors, postsynaptic signal transmission, synaptic depression, gamma-aminobutyric acid receptor activation, and glutamatergic synapse [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19].

Using RNA sequence data, Parvandeh et al [Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]38] aimed to classify patients with depression and healthy controls. They analyzed 7616 genes that are known to be associated with depression based on prior studies; these genes were compared to a repository of genes associated with mental disorders from the DisGeNET platform [Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]38]. The best performing model had an overlap of 959 genes with the initial 7616 genes and P<.001, indicating significant overlap [Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]38]. Using brain mRNA to discriminate between cases and controls, the best performing model identified 62 DEGs [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18]. These genes were associated with upregulation of metalloaminopeptidase activity, downregulation of oxidoreductase activity, and upregulation of aminopeptidase activity [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18]. Furthermore, this study used blood mRNA to identify 1376 DEGs associated with depression [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18]. RNA-Seq Rdata was used to identify depression gene modules (DGMs), genes that are interconnected and coexpressed, and predict a clinical diagnosis of depression [Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]37]. A total of 23 DGMs were identified; DGM-5 was most predictive of depression diagnosis and was significantly associated with depression severity [Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]37].

Epigenomics

In the epigenetic study of postpartum depression by Payne et al [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39], the authors used DNAm biomarker profiles on the TTC9B and HP1BP3 genes to predict antenatal and postpartum depression [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39]. A total of 4 separate cohorts were included in this study, and blood samples were drawn during different trimesters of pregnancy [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39]. They found that DNAm biomarkers from samples collected during the first trimester could accurately predict depression in the third trimester [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39]. In addition, biomarker profiles in third-trimester samples predicted depression in the postpartum period [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39].

The DNAm study by Fan et al [Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27] focused on methylation of the tryptophan hydroxylase-2 gene, which functions in the production of serotonin. They identified 9 CpG sites on the tryptophan hydroxylase-2 gene that differ significantly between patients with depression and healthy controls [Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27]. In the microRNA study by Qi et al [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1], 4 microRNAs were found to differ significantly between patients with depression and healthy controls. However, none of these microRNAs remained significant after Bonferroni correction [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1].

Microbiomics

One study used genomic variants in the microbiome to distinguish between individuals with depression and healthy controls [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24]. After examining exact amplicon sequence variants, biological sequences that have been inferred through shotgun sequencing, the authors found decreased abundances of Faecalibacterium, Ruminococcus, Lachnospiraceae, and Bacterioides species in the microbiomes of the individuals in the depression group compared to those in the healthy group [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24]. Furthermore, they found that pathways involved in the degradation of the neurotransmitter gamma-aminobutyric acid and the fatty acid butyrate were more prominent in individuals with depression [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24].

Multiomics

The multiomics study using blood transcriptome and methylome data identified DEGs and differentially methylated sites (DMSs) in individuals with depression and controls [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. This study included 3 cohorts: 56 individuals with depression who attempted suicide, 39 individuals with depression who did not attempt suicide, and 87 healthy controls [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. A total of 80 DMSs were identified between individuals with depression who did not attempt suicide, and 95 DMSs and 7 DEGs were identified between individuals with depression who attempted suicide and controls [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. Between individuals with depression who did and did not attempt suicide, 69 DMSs were found [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. In addition, 48 DEGs and 810 DMSs were significantly correlated with scores on the Hamilton Rating Scale for Depression-17 [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. A functional enrichment test was conducted to investigate pathways associated with the model input features. A difference in enrichment was detected between depressed individuals who did not attempt suicide “and controls in the Hippo signaling pathway, which includes the Protein Kinase C gene on chromosome 2 and the Frizzled Class Receptor 7 gene on chromosome 1 [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. In addition, protocadherin genes were enriched in depressed individuals who attempted suicide compared to controls [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6].

Supervised Machine Learning

In an epigenomic study, linear discriminant analysis and support vector machine were used to predict depression in the first, second, or third trimester of pregnancy [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39]. Linear discriminant analysis predicted depression in the third trimester with an accuracy >70% and an area under the curve (AUC) of 0.72 (Table 3); similarly, support vector machine predictions for the same trimester had an accuracy of 72% and AUC of 0.83 [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39]. Support vector machine also successfully identified women with depression in the postpartum period with an AUC of 0.78; an AUC >0.5 indicates the model has some level of discriminatory ability and can adequately distinguish between cases and controls better than random chance [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39].

Table 3. Machine learning methods and performance metrics.
Type of omics, study, and machine learning methodAUCaAccuracySensitivitySpecificity
Genomics

Arabnejad et al [Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35], 2018b


ReliefFc


Random forest


Lasso regression

Arloth et al [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15], 2020


DeepWASd or DeepSEAe0.59-0.66

Lin et al [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16], 2021


Random forest0.820.760.76


Support vector machine0.760.760.76


Decision tree0.760.760.76


Logistic ridge regression0.820.760.76


LogitBoost0.820.760.76

Sekaran and Sudha [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26], 2019


Bayesian network0.96f


Support vector machine0.73


Random forest0.91


Neural network0.72


Linear discriminant analysis0.70

Takahashi et al [Takahashi Y, Ueki M, Tamiya G, Ogishima S, Kinoshita K, Hozawa A, et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry. Aug 17, 2020;10(1):294. [FREE Full text] [CrossRef] [Medline]36], 2020g


Smooth-threshold multivariate genetic prediction


Genomics best linear unbiased prediction


Summary data–based best linear unbiased prediction


Bayes regression


Ridge regression
Transcriptomics

Ciobanu et al [Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30], 2020


Fuzzy forest0.630.630.66

Le et al [Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]37], 2020


Tree-based pipeline optimization tool0.48-0.65


Extreme gradient boost0.49-0.59

Parvandeh et al [Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]38], 2020


Consensus nested cross-validation0.59


Nested cross-validation0.56


Private evaporative cooling0.58


General Elastic net0.51

Qi et al [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18], 2021


Extreme gradient boost0.55-0.720.67-0.85


Logistic regression0.62-0.91

Verma and Shakya [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19], 2022


Random forest0.39-0.61


K-nearest neighbor0.28-0.61
Epigenomics

Fan et al [Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27], 2021


Random forest0.79-0.910.69-0.780.65-0.740.81-0.92


Support vector machine0.57-0.860.50-0.850.41-0.830.49-0.88


Neural network0.78-0.990.75-0.970.78-0.980.49-0.95

Payne et al [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39], 2020


Support vector machine0.77-0.84


Linear discriminant analysis0.72

Qi et al [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1], 2020


Clustering0.49-0.97
Microbiomics

Stevens et al [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24], 2021


Random forest0.66-0.90
Multiomics

Bhak et al [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6], 2019


Random forest0.87-0.930.59-0.980.85-1

aAUC: area under the curve.

bMachine learning methods were evaluated based on the number of genes found in pathways implicated in mood disorders.

cNot reported.

dDeepWAS: multivariate functional unit–wide association study.

eDeepSEA: deep learning-based sequence analyzer.

fItalics represent the best performing models.

gThe only performance metrics given were partial correlation coefficients.

The GWAS of the Taiwan Biobank used 5 machine learning algorithms to build creative models incorporating SNPs and demographic information: logistic ridge regression, support vector machine, decision tree, LogitBoost, and random forest [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16]. Logistic ridge regression and LogitBoost had the best performance with an AUC >0.82 and sensitivity and specificity >0.76 [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16]. In the GWAS study by Takahashi et al [Takahashi Y, Ueki M, Tamiya G, Ogishima S, Kinoshita K, Hozawa A, et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry. Aug 17, 2020;10(1):294. [FREE Full text] [CrossRef] [Medline]36], the authors aimed to decrease overfitting by decreasing the number of null variants included in the model. They compared the performance of 6 different models: smooth-threshold multivariate genetic prediction, polygenic risk scores, genomic best linear unbiased prediction, summary data–based best linear unbiased prediction, a Bayesian hierarchical model for the analysis of complex traits, and ridge regression [Takahashi Y, Ueki M, Tamiya G, Ogishima S, Kinoshita K, Hozawa A, et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry. Aug 17, 2020;10(1):294. [FREE Full text] [CrossRef] [Medline]36]. The smooth-threshold multivariate genetic prediction had the highest prediction accuracy with a partial correlation of 0.05 and P value of <.005; this model also successfully reduced overfitting [Takahashi Y, Ueki M, Tamiya G, Ogishima S, Kinoshita K, Hozawa A, et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry. Aug 17, 2020;10(1):294. [FREE Full text] [CrossRef] [Medline]36]. The study by Sekaran and Sudha [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26] used 5 different machine learning algorithms to identify genetic biomarkers: Bayesian network, support vector machine, random forest, back propagation neural network, and linear discriminant analysis. The accuracy of the Bayesian network and support vector machine was >90%; the accuracy of the other algorithms was <75% [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26].

The transcriptomic study by Ciobanu et al [Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30] combined a random forest classifier model with Weighted Gene Coexpression Network Analysis into an algorithm called fuzzy forest that identified an association between depression and the transferrin receptor gene. The fuzzy forest classifier was able to reduce the dimensionality of the transcriptomic data and allow a predictive marker of depression to be identified with a smaller sample size [Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30]. In a transcriptomic study using brain tissue, extreme gradient boost (XGBoost) was chosen for its feature selection and reduction characteristics and ability to rank features by importance [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18]. The AUC for the best performing model was 0.72 [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18]. Furthermore, XGBoost was used in the transcriptomic study by Le et al [Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]37], and its performance was compared to 2 tree-based pipeline optimization tools (TPOTs). XGBoost produced an accuracy of 0.59, and the standard TPOT produced a similar accuracy of 0.60 [Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]37]. The TPOT combined with a feature set selector and the ability to slice the data into smaller subsets, produced the highest prediction accuracy of 0.68 [Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]37].

In the multiomics study by Bhak et al [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6], the authors used a random forest model and feature selection to analyze blood transcriptome and methylome data; this model correctly predicted the labels for suicide attempters and nonsuicide attempters with depression and controls. Scores on the Hamilton Rating Scale for Depression-17 were also correctly predicted by a linear regression model [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. The microbiomic study by Stevens et al [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24] used a random forest method to identify gut microbiome taxa and related metabolic pathways associated with depression. The R packages ALDEx2, DADA2, and PIME (R Foundation for Statistical Computing) analyzed the DNA sequences of the microbiota in stool samples to produce exact amplicon sequence variants, identify taxa associated with those variants using a Naive Bayes classifier, and filter the results into unique amplicon sequence variant sequences [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24]. This approach differentiated between individuals with depression and healthy controls, and the results were supported by multivariate analyses with a P value of <.001 and effect size >0.5 [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24]. Machine learning predicted metabolic pathways associated with the individuals in the depression and control groups with AUCs ranging from 0.66 to 0.9 [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24].

Verma et al [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19] used random forest and k-nearest neighbor methods to analyze transcriptomic data and classify patients as depressed and died by suicide, depressed and did not die by suicide, and healthy controls. K-nearest neighbor stores all cases and classifies new cases based on their similarity [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19]. Using random forest, the test data were classified with an accuracy of 61.11%, and the training data were classified with an accuracy of 97.56%; with k-nearest neighbor, the accuracy was 61.11% for test data and 76.6% for training data [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19].

The GWAS using the top 500 SNPs to identify biological pathways associated with depression compared the performance of random forest; least absolute shrinkage and selection operator; and ReliefF, a nearest neighbors feature selection algorithm [Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35]. ReliefF was the best performing algorithm, likely due to its ability to detect statistical interactions, and this method identified most genes associated with biological pathways related to depression [Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35]. Furthermore, ReliefF was used in a transcriptomic study and was combined with different cross-validation methods [Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]38]. The private evaporative cooling and general elastic net algorithms had the highest accuracy on the training data, but consensus nested cross-validation had the highest accuracy on the validation data as well as low overfitting [Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]38].

In the study of microRNAs by Qi et al [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1], a regularized gradient boosted method was used to classify individuals with depression and healthy controls. The models were trained with cross-validation and 2500 iterations of parameter searches [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1]. The models were then retrained using the best parameters [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1]. The best model achieved an AUC of 0.93 [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1]. When classifying cases as normal to mild or moderate to severe, the best model achieved an AUC of 0.76 [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1].

Unsupervised Machine Learning

The study of microRNAs by Qi et al [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1] used an unsupervised clustering approach to differentiate individuals with depression from healthy controls. A total of 500 iterations of a k-means clustering method were applied to the data set [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1]. They obtained 2 clusters with similar sample sizes, both with an AUC >0.70 [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1].

Deep Learning

The DeepWAS study by Arloth et al [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15] used a deep learning method called deep learning-based sequence analyzer to predict the function of SNPs. Of >8 million SNPs analyzed; this method predicted 40,000 regulatory SNPs based on their affinity with an FU [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. The AUCs ranged from 0.59 to 0.66 [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15]. A regularized linear regression was used to determine which SNPs were associated with depression [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15].

The DNAm study by Fan et al [Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27] used a support vector machine, random forest, and a neural network to predict depression based on methylation of the tryptophan hydroxylase-2 gene. The neural network had the best performance with an AUC of 0.988, sensitivity of 98.3%, specificity of 95%, accuracy of 97.4%, and positive predictive value of 98.3% [Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27]. In addition, they found that models combining clinical variables with tryptophan hydroxylase-2 methylation performed better than models based on clinical variables or methylation alone [Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27].

Critical Appraisal

The studies’ strengths and weaknesses were identified using the Joanna Briggs Institute Critical Appraisal Checklist for Analytical Cross-Sectional Studies, as shown in Table 4. Of the 15 studies, only 2 (13%), Fan et al [Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27] and Qi et al [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1], clearly defined the criteria for inclusion in the sample. However, in all 15 studies, the individuals and setting were described in detail. A total of 47% (7/15) of the studies classified participants as experiencing depression but did not report how depression was measured or diagnosed. This may be due to the authors using data from biobanks and not having access to specific data about the participants.

The authors did not identify possible confounding factors in 11 (73%) of the 15 studies. However, it is typical that confounding is addressed when processing variables and during feature engineering, but it may not always be described as it is such a regular process. Therefore, the questions addressing confounding factors were marked “not applicable.” The study did not investigate the cause of depression or any associated diseases or disorders. Furthermore, those 11 studies did not present strategies to deal with confounding factors. The genomic outcomes were measured in a valid and reliable way in all the studies. The statistical analyses used seemed appropriate in all 15 studies.

Table 4. Joanna Briggs Institute Critical Appraisal Checklist for Analytical Cross-Sectional Studies.
QuestionArabnejad et al [Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35], 2018Arloth et al [Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]15], 2020Bhak et al [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6], 2019Ciobanu et al [Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30], 2020Fan et al 27], 2021Le et al [Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]37], 2020Lin et al [Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]16], 2021Parvandeh et al [Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]38], 2020Payne et al [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39], 2020Qi et al [Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]1], 2020Qi et al [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18], 2021Sekaran and Sudha [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26], 2019Stevens et al [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24], 2021Takahashi et al [Takahashi Y, Ueki M, Tamiya G, Ogishima S, Kinoshita K, Hozawa A, et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry. Aug 17, 2020;10(1):294. [FREE Full text] [CrossRef] [Medline]36], 2020Verma and Shakya [Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19], 2022
Were the criteria for inclusion in the sample clearly defined?UnclearNoNoNoYesNoNoNoNoYesNoNoNoNoNo
Were study individuals and setting described in detail?YesYesYesYesYesNoYesNoYesYesYesYesYesYesYes
Was the exposure measured in a valid and reliable way?YesUnclearYesYesYesUnclearYesUnclearYesYesNoNoYesYesNo
Were objective, standard criteria used for measurement of the condition?YesUnclearYesYesYesUnclearYesUnclearYesYesNoNoYesYesNo
Were confounding factors identified?aYesYesYesYes
Were strategies to deal with confounding factors stated?YesYesYesYes
Were the outcomes measured in a valid and reliable way?YesYesYesYesYesYesYesYesYesYesYesYesYesYesYes
Was appropriate statistical analysis used?YesYesYesYesYesYesYesYesYesYesYesYesYesYesYes

aNot applicable.


Principal Findings

Machine learning can enable researchers to identify specific features that impact depression, allowing providers to screen for these features in a clinical setting. In this scoping review, 15 studies published in the past 5 years reported on machine learning analysis of omics data to identify individuals with depression. Owing to the diversity of the data sources and methods, there was minimal overlap in comparable study results, indicating that this field is still in exploratory stages but will provide new avenues for future prediction of which patients are at risk of developing depression.

Future studies could help with diagnosing depression using genomic data in a more reliable way, helping to mitigate the potential biases of screening interviews. However, while the genomic studies identified many genetic variants associated with depression, the lack of overlap in study results indicates low reproducibility, which could be related to the low 40% heritability of depression. It may also be associated with the heterogeneity of depression symptoms, with different genetic variants correlating with different symptoms.

Genetic variants can be helpful in diagnosing depression, but they are not generally responsive to environmental stimuli. Most of the genomics studies in this review focused on identifying SNPs that differed between individuals with depression and healthy controls. One study focused on detecting pathways associated with depression, while another used gene probes as biomarkers [Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]26,Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35]. With the varied outcomes, it was difficult to compare these 2 studies to the others and determine if the results were consistent.

Transcriptomics can identify transcripts associated with depression or genes that are differentially expressed in depression. Gene expression has some responsiveness to the environment, as does DNAm. Of the 5 transcriptomics studies, 1 (20%) used brain and blood samples, while the other 4 (80%) used only blood samples, so it was expected that the results may vary. One of the studies reported downregulation of a single gene; another study reported general dysregulation of a few 100 genes, and 1 study identified DEGs and upregulation or downregulation of related pathways [Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]18,Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]19,Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]30]. Another study focused on DGMs, groups of genes that are coexpressed in individuals with depression [Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]37]. The fifth transcriptomics study emphasized the machine learning models and reported how many genes were selected by each model [Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]38]. It would be ideal for comparison if all the studies performed a transcriptome-wide analysis and reported upregulation or downregulation of each DEG identified.

The DNAm study of tryptophan hydroxylase-2 focused on the methylation of a single gene rather than an epigenome-wide approach, effectively limiting the results to that gene [Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]27]. Similarly, the postpartum depression DNAm study focused on only 2 specific genes, making it impossible to compare the results of the 2 studies [Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]39]. Epigenome-wide association studies would likely be more effective in identifying differentially expressed regions associated with depression and possibly replicating work across studies [Aberg KA, Dean B, Shabalin AA, Chan RF, Han LK, Zhao M, et al. Methylome-wide association findings for major depressive disorder overlap in blood and brain and replicate in independent brain samples. Mol Psychiatry. Jun 21, 2020;25(6):1344-1354. [FREE Full text] [CrossRef] [Medline]40].

Microbiomics was an interesting approach, as it did not use blood or saliva samples to sequence genetic material from the human participant [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24]. Analysis of microbiomics data obtained from stool samples found differences in the composition of gut microbiota between individuals with depression and healthy individuals [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24]. Stevens et al [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24] identified particular taxa that were more prominent or depleted in the 2 groups. Furthermore, they focused on identifying physiological pathways involving microbiota that were associated with depression [Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]24]. The multiomics study identified many DEGs and DMSs related to depression [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6]. This may be the most insightful method because of the volume of results. However, it might be challenging to determine which results are the most significant. In addition, in many studies, only 1 type of omics data is available, so the multiomics method is not feasible.

A total of 20% (3/15) of the studies focused on identifying biological pathways. The genomics pathways study used the top 500 genes determined through feature selection and found associations with pathways that regulate neurotransmitter signaling [Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]35]. The transcriptomics study identified pathways related to neurotransmitter reception, postsynaptic signal transmission, synaptic depression, and receptor activation, while the multi-omics study identified the Hippo signaling pathway, which is involved in cell proliferation and affects antidepressant response [Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]6,Breitfeld J, Scholl C, Steffens M, Laje G, Stingl JC. Gene expression and proliferation biomarkers for antidepressant treatment resistance. Transl Psychiatry. Mar 14, 2017;7(3):e1061. [FREE Full text] [CrossRef] [Medline]41]. The genomics and transcriptomics studies show relatively consistent results in finding associations with pathways affecting neurotransmitters. The multiomics study found a different type of pathway, which may reflect the heterogeneity of depression and could indicate that different mechanisms can lead to depression. Future omics studies could include pathways analysis to build upon the knowledge of which biological pathways are involved in depression.

All the machine learning methods performed well based on their individual performance metrics. However, supervised methods are preferred when attempting to identify biological features related to depression because of their interpretability. Of the 15 studies, 8 (53%) reported AUCs to indicate how well the machine learning models performed, while 5 (33%)only reported accuracy; 2 (13%) reported accuracy, sensitivity, and specificity; 1 (7%) reported partial correlation coefficients; and 1 (7%) only quantified the number of genes found in pathways related to mood disorders. A review of the literature found that the most common metric used to evaluate machine learning models was accuracy followed by sensitivity and specificity [Rjoob K, Bond R, Finlay D, McGilligan V, Leslie SJ, Rababah A, et al. Machine learning and the electrocardiogram over two decades: time series and meta-analysis of the algorithms, evaluation metrics and applications. Artif Intell Med. Oct 2022;132:102381. [CrossRef] [Medline]42]. However, the use of AUC as a performance metric is increasing [Rjoob K, Bond R, Finlay D, McGilligan V, Leslie SJ, Rababah A, et al. Machine learning and the electrocardiogram over two decades: time series and meta-analysis of the algorithms, evaluation metrics and applications. Artif Intell Med. Oct 2022;132:102381. [CrossRef] [Medline]42]. It was difficult to compare the performance of the machine learning models in this review due to the range of performance metrics; using a standardized metric could prove more useful when choosing a model and comparing results.

There are ethical considerations related to the prediction of depression, such as the possibility of increasing insurance premiums. The protection of patient privacy, confidentiality, and trust is central to using genomics data, especially given how sensitive the data are and how they could be used to predict the risk of future conditions. Moreover, if it becomes feasible to predict depression before an individual shows symptoms, providers will need to determine the appropriate timing for treatment. They could begin treating preemptively or wait for symptoms to manifest. Furthermore, the cost of analyzing omics data should be considered. Researchers should evaluate whether omics data have a higher predictive accuracy than formal psychiatric evaluation. If not, using omics data may not be the most cost-effective way to identify individuals with depression.

Limitations

Finally, this scoping review is not without limitations. First, many of the studies used data from biobanks, which did not provide detailed descriptions of the participants in the data sets. This makes it impossible to know the demographics and other sample characteristics. In addition, unknown sample characteristics make the generalizability of study results unclear. Moreover, some studies did not report how depression was screened or diagnosed among patients, so it is not known if validated screening measures or formal psychiatric diagnoses were used or only patient reports were used.

Future Work

In future research, it may be helpful to focus on machine learning methods that identify features rather than those that are more geared toward prediction. Identified features can include genetic variants, DEGs, or differentially methylated regions, which would provide more relevant information that could be used to identify depression. The long-term goal of this work is to be able to use these biomarkers for a more objective diagnosis of depression.

Nursing Implications

Nurses are in a unique position to provide mental health support to patients when they have received appropriate training and education in psychotherapy [da Silva Elias AD, Tavares CM, Muniz MP. The intersection between being a nurse and being a therapist in mental health. Rev Bras Enferm. 2020;73(1):e20180134. [FREE Full text] [CrossRef] [Medline]43]. Nurses have been called the “gateway” for care because they are typically the first point of contact with the health system and are in a position to build therapeutic relationships with patients [de Almeida JC, Barbosa CA, de Almeida LY, de Oliveira JL, de Souza J. Mental health actions and nurse's work. Rev Bras Enferm. 2020;73 Suppl 1(suppl 1):e20190376. [FREE Full text] [CrossRef] [Medline]44]. With their skills in establishing therapeutic relationships, building rapport, active listening, observing behaviors, and noticing the effects of medications, nurses serve an extremely important role in the health promotion of patients seeking mental health support [de Almeida JC, Barbosa CA, de Almeida LY, de Oliveira JL, de Souza J. Mental health actions and nurse's work. Rev Bras Enferm. 2020;73 Suppl 1(suppl 1):e20190376. [FREE Full text] [CrossRef] [Medline]44].

In addition, machine learning–based prediction of depression will eventually become part of common nursing clinical workflow. Therefore, it is imperative that nurses bring their expertise to the creation, evaluation, and implementation of artificial intelligence approaches to depression prediction. Of note, none of the 15 studies had nurse researchers as members of their study team. Nursing involvement in the entire life cycle of artificial intelligence will positively impact the usability and usefulness of data tools in clinical practice.

Conclusions

This scoping review describes different types of omics data and machine learning methods used to analyze these data to predict and diagnose depression. The findings indicate that the omics methods had similar performance in identifying variants, differentially methylated sites, and differences in gene expression. All machine learning methods performed well based on the metrics provided. Further research is needed in omics methods to identify more variants and differential sites and gene expression. When variants in omics data indicate the possibility of depression, it is important for clinicians, especially nurses, to assess individuals for symptoms of depression and provide a formal diagnosis and treatment if appropriate.

Acknowledgments

This work was supported by the National Center for Advancing Translational Sciences (TL1TR001875 [BT]) and the National Institute of Neurological Disorders and Stroke (R01NS123639 [RMC]).

Conflicts of Interest

None declared.

Multimedia Appendix 1

Search strategy and keywords.

DOCX File , 23 KB

Multimedia Appendix 2

Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews (PRISMA-ScR) checklist.

DOCX File , 84 KB

  1. Qi B, Fiori LM, Turecki G, Trakadis YJ. Machine learning analysis of blood microrna data in major depression: a case-control study for biomarker discovery. Int J Neuropsychopharmacol. Nov 26, 2020;23(8):505-510. [FREE Full text] [CrossRef] [Medline]
  2. Tomasik J, Han SY, Barton-Owen G, Mirea D, Martin-Key NA, Rustogi N, et al. A machine learning algorithm to differentiate bipolar disorder from major depressive disorder using an online mental health questionnaire and blood biomarker data. Transl Psychiatry. Jan 12, 2021;11(1):41. [FREE Full text] [CrossRef] [Medline]
  3. Greenberg PE, Fournier AA, Sisitsky T, Simes M, Berman R, Koenigsberg SH, et al. The economic burden of adults with major depressive disorder in the United States (2010 and 2018). Pharmacoeconomics. Jun 05, 2021;39(6):653-665. [FREE Full text] [CrossRef] [Medline]
  4. Di Y, Wang J, Liu X, Zhu T. Combining polygenic risk score and voice features to detect major depressive disorders. Front Genet. Dec 20, 2021;12:761141. [FREE Full text] [CrossRef] [Medline]
  5. Walther A, Cannistraci CV, Simons K, Durán C, Gerl MJ, Wehrli S, et al. Lipidomics in major depressive disorder. Front Psychiatry. Oct 15, 2018;9:459. [FREE Full text] [CrossRef] [Medline]
  6. Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim J, et al. Depression and suicide risk prediction models using blood-derived multi-omics data. Transl Psychiatry. Oct 17, 2019;9(1):262. [FREE Full text] [CrossRef] [Medline]
  7. Squarcina L, Villa FM, Nobile M, Grisan E, Brambilla P. Deep learning for the prediction of treatment response in depression. J Affect Disord. Feb 15, 2021;281:618-622. [CrossRef] [Medline]
  8. Kalibatseva Z, Leong FT. Cultural factors, depressive and somatic symptoms among Chinese American and European American college students. J Cross Cult Psychol. Sep 29, 2018;49(10):1556-1572. [CrossRef]
  9. Phoenix BJ, Hurd M, Chapman SA. Experience of psychiatric mental health nurse practitioners in public mental health. Nurs Adm Q. 2016;40(3):212-224. [CrossRef] [Medline]
  10. Chao YS, Lin KF, Wu CJ, Wu HC, Hsu HT, Tsao LC, et al. Simulation study to demonstrate biases created by diagnostic criteria of mental illnesses: major depressive episodes, dysthymia, and manic episodes. BMJ Open. Nov 10, 2020;10(11):e037022. [FREE Full text] [CrossRef] [Medline]
  11. Zhao S, Bao Z, Zhao X, Xu M, Li MD, Yang Z. Identification of diagnostic markers for major depressive disorder using machine learning methods. Front Neurosci. Jun 18, 2021;15:645998. [FREE Full text] [CrossRef] [Medline]
  12. Braun PR, Han S, Hing B, Nagahama Y, Gaul LN, Heinzman JT, et al. Genome-wide DNA methylation comparison between live human brain and peripheral tissues within individuals. Transl Psychiatry. Jan 31, 2019;9(1):47. [FREE Full text] [CrossRef] [Medline]
  13. Ferguson LB, Roberts AJ, Mayfield RD, Messing RO. Blood and brain gene expression signatures of chronic intermittent ethanol consumption in mice. PLoS Comput Biol. Feb 17, 2022;18(2):e1009800. [FREE Full text] [CrossRef] [Medline]
  14. Nishitani S, Isozaki M, Yao A, Higashino Y, Yamauchi T, Kidoguchi M, et al. Cross-tissue correlations of genome-wide DNA methylation in Japanese live human brain and blood, saliva, and buccal epithelial tissues. Transl Psychiatry. Feb 27, 2023;13(1):72. [FREE Full text] [CrossRef] [Medline]
  15. Arloth J, Eraslan G, Andlauer TF, Martins J, Iurato S, Kühnel B, et al. DeepWAS: multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput Biol. Feb 3, 2020;16(2):e1007616. [FREE Full text] [CrossRef] [Medline]
  16. Lin E, Kuo PH, Lin WY, Liu YL, Yang AC, Tsai SJ. Prediction of probable major depressive disorder in the Taiwan biobank: an integrated machine learning and genome-wide analysis approach. J Pers Med. Jun 24, 2021;11(7):597. [FREE Full text] [CrossRef] [Medline]
  17. Schultebraucks K, Choi KW, Galatzer-Levy IR, Bonanno GA. Discriminating heterogeneous trajectories of resilience and depression after major life stressors using polygenic scores. JAMA Psychiatry. Jul 01, 2021;78(7):744-752. [FREE Full text] [CrossRef] [Medline]
  18. Qi B, Ramamurthy J, Bennani I, Trakadis YJ. Machine learning and bioinformatic analysis of brain and blood mRNA profiles in major depressive disorder: a case-control study. Am J Med Genet B Neuropsychiatr Genet. Mar 2021;186(2):101-112. [CrossRef] [Medline]
  19. Verma P, Shakya M. Machine learning model for predicting major depressive disorder using RNA-Seq data: optimization of classification approach. Cogn Neurodyn. Apr 22, 2022;16(2):443-453. [FREE Full text] [CrossRef] [Medline]
  20. Yao Q, Chen Y, Zhou X. The roles of microRNAs in epigenetic regulation. Curr Opin Chem Biol. Aug 2019;51:11-17. [CrossRef] [Medline]
  21. Chen D, Meng L, Pei F, Zheng Y, Leng J. A review of DNA methylation in depression. J Clin Neurosci. Sep 2017;43:39-46. [CrossRef] [Medline]
  22. Limbana T, Khan F, Eskander N. Gut microbiome and depression: how microbes affect the way we think. Cureus. Aug 23, 2020;12(8):e9966. [FREE Full text] [CrossRef] [Medline]
  23. Martinez JE, Kahana DD, Ghuman S, Wilson HP, Wilson J, Kim SC, et al. Unhealthy lifestyle and gut dysbiosis: a better understanding of the effects of poor diet and nicotine on the intestinal microbiome. Front Endocrinol (Lausanne). 2021;12:667066. [FREE Full text] [CrossRef] [Medline]
  24. Stevens BR, Roesch L, Thiago P, Russell JT, Pepine CJ, Holbert RC, et al. Depression phenotype identified by using single nucleotide exact amplicon sequence variants of the human gut microbiome. Mol Psychiatry. Aug 27, 2021;26(8):4277-4287. [CrossRef] [Medline]
  25. Radjabzadeh D, Bosch JA, Uitterlinden AG, Zwinderman AH, Ikram MA, van Meurs JBJ, et al. Gut microbiome-wide association study of depressive symptoms. Nat Commun. Dec 06, 2022;13(1):7128. [FREE Full text] [CrossRef] [Medline]
  26. Sekaran K, Sudha M. Prediction of lipopolysaccharides simulation responsiveness on gene expression profiles of major depression disorder affected cases using machine learning. Int J Sci Technol Res. 2019;8(11):21-24. [FREE Full text]
  27. Fan R, Hua T, Shen T, Jiao Z, Yue Q, Chen B, et al. Identifying patients with major depressive disorder based on tryptophan hydroxylase-2 methylation using machine learning algorithms. Psychiatry Res. Dec 2021;306:114258. [CrossRef] [Medline]
  28. Musolf AM, Holzinger ER, Malley JD, Bailey-Wilson JE. What makes a good prediction? Feature importance and beginning to open the black box of machine learning in genetics. Hum Genet. Sep 04, 2022;141(9):1515-1528. [FREE Full text] [CrossRef] [Medline]
  29. Shatte AB, Hutchinson DM, Teague SJ. Machine learning in mental health: a scoping review of methods and applications. Psychol Med. Jul 2019;49(9):1426-1448. [CrossRef] [Medline]
  30. Ciobanu LG, Sachdev PS, Trollor JN, Reppermund S, Thalamuthu A, Mather KA, et al. Downregulated transferrin receptor in the blood predicts recurrent MDD in the elderly cohort: a fuzzy forests approach. J Affect Disord. Apr 15, 2020;267:42-48. [CrossRef] [Medline]
  31. Tricco AC, Lillie E, Zarin W, O'Brien KK, Colquhoun H, Levac D, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. Oct 02, 2018;169(7):467-473. [FREE Full text] [CrossRef] [Medline]
  32. Koch L, Potenski C, Trenkmann M. Sequencing moves to the twenty-first century. Nature. 2021. URL: https://www.nature.com/articles/d42859-020-00100-w [accessed 2024-04-29]
  33. Checklist for analytical cross sectional studies. The Joanna Briggs Institute. 2020. URL: http://joannabriggs.org/research/critical-appraisal-tools.html [accessed 2022-07-20]
  34. Wang X, Cheng Z. Cross-sectional studies: strengths, weaknesses, and recommendations. Chest. Jul 2020;158(1S):S65-S71. [CrossRef] [Medline]
  35. Arabnejad M, Dawkins BA, Bush WS, White BC, Harkness AR, McKinney BA. Transition-transversion encoding and genetic relationship metric in ReliefF feature selection improves pathway enrichment in GWAS. BioData Min. Nov 3, 2018;11(1):23. [FREE Full text] [CrossRef] [Medline]
  36. Takahashi Y, Ueki M, Tamiya G, Ogishima S, Kinoshita K, Hozawa A, et al. Machine learning for effectively avoiding overfitting is a crucial strategy for the genetic prediction of polygenic psychiatric phenotypes. Transl Psychiatry. Aug 17, 2020;10(1):294. [FREE Full text] [CrossRef] [Medline]
  37. Le TT, Fu W, Moore JH. Scaling tree-based automated machine learning to biomedical big data with a feature set selector. Bioinformatics. Jan 01, 2020;36(1):250-256. [FREE Full text] [CrossRef] [Medline]
  38. Parvandeh S, Yeh HW, Paulus MP, McKinney BA. Consensus features nested cross-validation. Bioinformatics. May 01, 2020;36(10):3093-3098. [FREE Full text] [CrossRef] [Medline]
  39. Payne JL, Osborne LM, Cox O, Kelly J, Meilman S, Jones I, et al. DNA methylation biomarkers prospectively predict both antenatal and postpartum depression. Psychiatry Res. Mar 2020;285:112711. [FREE Full text] [CrossRef] [Medline]
  40. Aberg KA, Dean B, Shabalin AA, Chan RF, Han LK, Zhao M, et al. Methylome-wide association findings for major depressive disorder overlap in blood and brain and replicate in independent brain samples. Mol Psychiatry. Jun 21, 2020;25(6):1344-1354. [FREE Full text] [CrossRef] [Medline]
  41. Breitfeld J, Scholl C, Steffens M, Laje G, Stingl JC. Gene expression and proliferation biomarkers for antidepressant treatment resistance. Transl Psychiatry. Mar 14, 2017;7(3):e1061. [FREE Full text] [CrossRef] [Medline]
  42. Rjoob K, Bond R, Finlay D, McGilligan V, Leslie SJ, Rababah A, et al. Machine learning and the electrocardiogram over two decades: time series and meta-analysis of the algorithms, evaluation metrics and applications. Artif Intell Med. Oct 2022;132:102381. [CrossRef] [Medline]
  43. da Silva Elias AD, Tavares CM, Muniz MP. The intersection between being a nurse and being a therapist in mental health. Rev Bras Enferm. 2020;73(1):e20180134. [FREE Full text] [CrossRef] [Medline]
  44. de Almeida JC, Barbosa CA, de Almeida LY, de Oliveira JL, de Souza J. Mental health actions and nurse's work. Rev Bras Enferm. 2020;73 Suppl 1(suppl 1):e20190376. [FREE Full text] [CrossRef] [Medline]


AUC: area under the curve
CpG: cytosine-phosphodiester bond-guanine
DeepWAS: functional unit–wide association study
DEG: differentially expressed gene
DGM: depression gene module
DMS: differentially methylated site
DNAm: DNA methylation
dSNP: single nucleotide polymorphisms associated with a disease
FU: functional unit
GWAS: genome-wide association study
mRNA: messenger RNA
PRISMA: Preferred Reporting Items for Systematic Reviews and Meta-Analyses
PRISMA-ScR: Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews
SNP: single nucleotide polymorphism
TPOT: tree-based pipeline optimization tool
XGBoost: extreme gradient boost


Edited by E Borycki; submitted 22.11.23; peer-reviewed by Y Pan, J Chen; comments to author 23.02.24; revised version received 16.04.24; accepted 22.04.24; published 19.07.24.

Copyright

©Brittany Taylor, Mollie Hobensack, Stephanie Niño de Rivera, Yihong Zhao, Ruth Masterson Creber, Kenrick Cato. Originally published in JMIR Nursing (https://nursing.jmir.org), 19.07.2024.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Nursing, is properly cited. The complete bibliographic information, a link to the original publication on https://nursing.jmir.org/, as well as this copyright and license information must be included.