A weighted average difference method for detecting differentially expressed genes from microarray data
- PMID: 18578891
- PMCID: PMC2464587
- DOI: 10.1186/1748-7188-3-8
A weighted average difference method for detecting differentially expressed genes from microarray data
Abstract
Background: Identification of differentially expressed genes (DEGs) under different experimental conditions is an important task in many microarray studies. However, choosing which method to use for a particular application is problematic because its performance depends on the evaluation metric, the dataset, and so on. In addition, when using the Affymetrix GeneChip(R) system, researchers must select a preprocessing algorithm from a number of competing algorithms such as MAS, RMA, and DFW, for obtaining expression-level measurements. To achieve optimal performance for detecting DEGs, a suitable combination of gene selection method and preprocessing algorithm needs to be selected for a given probe-level dataset.
Results: We introduce a new fold-change (FC)-based method, the weighted average difference method (WAD), for ranking DEGs. It uses the average difference and relative average signal intensity so that highly expressed genes are highly ranked on the average for the different conditions. The idea is based on our observation that known or potential marker genes (or proteins) tend to have high expression levels. We compared WAD with seven other methods; average difference (AD), FC, rank products (RP), moderated t statistic (modT), significance analysis of microarrays (samT), shrinkage t statistic (shrinkT), and intensity-based moderated t statistic (ibmT). The evaluation was performed using a total of 38 different binary (two-class) probe-level datasets: two artificial "spike-in" datasets and 36 real experimental datasets. The results indicate that WAD outperforms the other methods when sensitivity and specificity are considered simultaneously: the area under the receiver operating characteristic curve for WAD was the highest on average for the 38 datasets. The gene ranking for WAD was also the most consistent when subsets of top-ranked genes produced from three different preprocessed data (MAS, RMA, and DFW) were compared. Overall, WAD performed the best for MAS-preprocessed data and the FC-based methods (AD, WAD, FC, or RP) performed well for RMA and DFW-preprocessed data.
Conclusion: WAD is a promising alternative to existing methods for ranking DEGs with two classes. Its high performance should increase researchers' confidence in microarray analyses.
Figures
Similar articles
-
Ranking differentially expressed genes from Affymetrix gene expression data: methods with reproducibility, sensitivity, and specificity.Algorithms Mol Biol. 2009 Apr 22;4:7. doi: 10.1186/1748-7188-4-7. Algorithms Mol Biol. 2009. PMID: 19386098 Free PMC article.
-
Arrow plot: a new graphical tool for selecting up and down regulated genes and genes differentially expressed on sample subgroups.BMC Bioinformatics. 2012 Jun 26;13:147. doi: 10.1186/1471-2105-13-147. BMC Bioinformatics. 2012. PMID: 22734592 Free PMC article.
-
Evaluating methods for ranking differentially expressed genes applied to microArray quality control data.BMC Bioinformatics. 2011 Jun 6;12:227. doi: 10.1186/1471-2105-12-227. BMC Bioinformatics. 2011. PMID: 21639945 Free PMC article.
-
A unified framework for finding differentially expressed genes from microarray experiments.BMC Bioinformatics. 2007 Sep 18;8:347. doi: 10.1186/1471-2105-8-347. BMC Bioinformatics. 2007. PMID: 17877806 Free PMC article.
-
Gene Prioritization and Network Topology Analysis of Targeted Genes for Acquired Taxane Resistance by Meta-Analysis.Crit Rev Eukaryot Gene Expr. 2019;29(6):581-597. doi: 10.1615/CritRevEukaryotGeneExpr.2019026317. Crit Rev Eukaryot Gene Expr. 2019. PMID: 32422012 Review.
Cited by
-
Ranking differentially expressed genes from Affymetrix gene expression data: methods with reproducibility, sensitivity, and specificity.Algorithms Mol Biol. 2009 Apr 22;4:7. doi: 10.1186/1748-7188-4-7. Algorithms Mol Biol. 2009. PMID: 19386098 Free PMC article.
-
Neurodegenerative processes accelerated by protein malnutrition and decelerated by essential amino acids in a tauopathy mouse model.Sci Adv. 2021 Oct 22;7(43):eabd5046. doi: 10.1126/sciadv.abd5046. Epub 2021 Oct 22. Sci Adv. 2021. PMID: 34678069 Free PMC article.
-
Periostin attenuates tumor growth by inducing apoptosis in colitis-related colorectal cancer.Oncotarget. 2018 Apr 13;9(28):20008-20017. doi: 10.18632/oncotarget.25026. eCollection 2018 Apr 13. Oncotarget. 2018. PMID: 29731999 Free PMC article.
-
Identifying clinically relevant drug resistance genes in drug-induced resistant cancer cell lines and post-chemotherapy tissues.Oncotarget. 2015 Dec 1;6(38):41216-27. doi: 10.18632/oncotarget.5649. Oncotarget. 2015. PMID: 26515599 Free PMC article.
-
Extracellular lipidome change by an SGLT2 inhibitor, luseogliflozin, contributes to prevent skeletal muscle atrophy in db/db mice.J Cachexia Sarcopenia Muscle. 2022 Feb;13(1):574-588. doi: 10.1002/jcsm.12814. Epub 2021 Dec 2. J Cachexia Sarcopenia Muscle. 2022. PMID: 34854254 Free PMC article.
References
LinkOut - more resources
Full Text Sources
