Linear models and empirical bayes methods for assessing differential expression in microarray experiments
- PMID: 16646809
- DOI: 10.2202/1544-6115.1027
Linear models and empirical bayes methods for assessing differential expression in microarray experiments
Abstract
The problem of identifying differentially expressed genes in designed microarray experiments is considered. Lonnstedt and Speed (2002) derived an expression for the posterior odds of differential expression in a replicated two-color experiment using a simple hierarchical parametric model. The purpose of this paper is to develop the hierarchical model of Lonnstedt and Speed (2002) into a practical approach for general microarray experiments with arbitrary numbers of treatments and RNA samples. The model is reset in the context of general linear models with arbitrary coefficients and contrasts of interest. The approach applies equally well to both single channel and two color microarray experiments. Consistent, closed form estimators are derived for the hyperparameters in the model. The estimators proposed have robust behavior even for small numbers of arrays and allow for incomplete data arising from spot filtering or spot quality weights. The posterior odds statistic is reformulated in terms of a moderated t-statistic in which posterior residual standard deviations are used in place of ordinary standard deviations. The empirical Bayes approach is equivalent to shrinkage of the estimated sample variances towards a pooled estimate, resulting in far more stable inference when the number of arrays is small. The use of moderated t-statistics has the advantage over the posterior odds that the number of hyperparameters which need to estimated is reduced; in particular, knowledge of the non-null prior for the fold changes are not required. The moderated t-statistic is shown to follow a t-distribution with augmented degrees of freedom. The moderated t inferential approach extends to accommodate tests of composite null hypotheses through the use of moderated F-statistics. The performance of the methods is demonstrated in a simulation study. Results are presented for two publicly available data sets.
Similar articles
-
An empirical bayesian method for differential expression studies using one-channel microarray data.Stat Appl Genet Mol Biol. 2003;2:Article8. doi: 10.2202/1544-6115.1024. Epub 2003 Nov 29. Stat Appl Genet Mol Biol. 2003. PMID: 16646786
-
Quality optimised analysis of general paired microarray experiments.Stat Appl Genet Mol Biol. 2006;5:Article10. doi: 10.2202/1544-6115.1209. Epub 2006 Apr 21. Stat Appl Genet Mol Biol. 2006. PMID: 16646864
-
Assessing differential gene expression with small sample sizes in oligonucleotide arrays using a mean-variance model.Biometrics. 2007 Mar;63(1):41-9. doi: 10.1111/j.1541-0420.2006.00675.x. Biometrics. 2007. PMID: 17447928
-
Accurate ranking of differentially expressed genes by a distribution-free shrinkage approach.Stat Appl Genet Mol Biol. 2007;6:Article9. doi: 10.2202/1544-6115.1252. Epub 2007 Feb 23. Stat Appl Genet Mol Biol. 2007. PMID: 17402924
-
Semi-parametric differential expression analysis via partial mixture estimation.Stat Appl Genet Mol Biol. 2008;7(1):Article15. doi: 10.2202/1544-6115.1333. Epub 2008 Apr 28. Stat Appl Genet Mol Biol. 2008. PMID: 18454730
Cited by
-
Large scale expression changes of genes related to neuronal signaling and developmental processes found in lateral septum of postpartum outbred mice.PLoS One. 2013 May 22;8(5):e63824. doi: 10.1371/journal.pone.0063824. Print 2013. PLoS One. 2013. PMID: 23717492 Free PMC article.
-
The glucocorticoid receptor and KLF15 regulate gene expression dynamics and integrate signals through feed-forward circuitry.Mol Cell Biol. 2013 Jun;33(11):2104-15. doi: 10.1128/MCB.01474-12. Epub 2013 Mar 18. Mol Cell Biol. 2013. PMID: 23508109 Free PMC article.
-
The draft genome of watermelon (Citrullus lanatus) and resequencing of 20 diverse accessions.Nat Genet. 2013 Jan;45(1):51-8. doi: 10.1038/ng.2470. Epub 2012 Nov 25. Nat Genet. 2013. PMID: 23179023
-
Defining the genomic signature of totipotency and pluripotency during early human development.PLoS One. 2013 Apr 17;8(4):e62135. doi: 10.1371/journal.pone.0062135. Print 2013. PLoS One. 2013. PMID: 23614026 Free PMC article.
-
Systematic interaction network filtering identifies CRMP1 as a novel suppressor of huntingtin misfolding and neurotoxicity.Genome Res. 2015 May;25(5):701-13. doi: 10.1101/gr.182444.114. Genome Res. 2015. PMID: 25908449 Free PMC article.
LinkOut - more resources
Full Text Sources
Other Literature Sources