Combining multiple tools outperforms individual methods in gene set enrichment analyses
- Author(s)
- Alhamdoosh, Monther; Ng, Milica; Wilson, Nicholas J; Sheridan, Julie M; Huynh, Huy; Wilson, Michael J; Ritchie, Matthew E;
- Journal Title
- Bioinformatics
- Publication Type
- Journal Article in press
- Abstract
- Motivation: Gene set enrichment (GSE) analysis allows researchers to efficiently extract biological insight from long lists of differentially expressed genes by interrogating them at a systems level. In recent years, there has been a proliferation of GSE analysis methods and hence it has become increasingly difficult for researchers to select an optimal GSE tool based on their particular data set. Moreover, the majority of GSE analysis methods do not allow researchers to simultaneously compare gene set level results between multiple experimental conditions.Results: The ensemble of genes set enrichment analyses (EGSEA) is a method developed for RNA-sequencing data that combines results from twelve algorithms and calculates collective gene set scores to improve the biological relevance of the highest ranked gene sets. EGSEA’s gene set database contains around 25,000 gene sets from sixteen collections. It has multiple visualization capabilities that allow researchers to view gene sets at various levels of granularity. EGSEA has been tested on simulated data and on a number of human and mouse data sets and, based on biologists’ feedback, consistently outperforms the individual tools that have been combined. Our evaluation demonstrates the superiority of the ensemble approach for GSE analysis, and its utility to effectively and efficiently extrapolate biological functions and potential involvement in disease processes from lists of differentially regulated genes.Availability and Implementation: EGSEA is available as an R package at http://www.bioconductor.org/packages/EGSEA/. The gene sets collections are available in the R package EGSEAdata from http://www.bioconductor.org/packages/EGSEAdata/.Contact: monther.alhamdoosh@csl.com.au, mritchie@wehi.edu.au
- Publisher
- OUP
- Research Division(s)
- Molecular Medicine
- PubMed ID
- 27694195
- Publisher's Version
- https://doi.org/10.1093/bioinformatics/btw623
- Open Access at Publisher's Site
- http://bioinformatics.oxfordjournals.org/content/early/2016/09/27/bioinformatics.btw623.abstract
- NHMRC Grants
- NHMRC/1050661, NHMRC/1045936, NHMRC/1057854, NHMRC/1104924,
- Terms of Use/Rights Notice
- Refer to copyright notice on published article.
Creation Date: 2016-10-04 09:29:22
Last Modified: 2016-10-11 03:55:11