The intraclass correlation coefficient applied for evaluation of data correction, labeling methods, and rectal biopsy sampling in DNA microarray experiments

Physiol Genomics. 2003 Dec 16;16(1):99-106. doi: 10.1152/physiolgenomics.00111.2003.

Abstract

We show that the intraclass correlation coefficient (ICC) can be used as a relatively simple statistical measure to assess methodological and biological variation in DNA microarray analysis. The ICC is a measure that determines the reproducibility of a variable, which can easily be calculated from an ANOVA table. It is based on the assessment of both systematic deviation and random variation, and it facilitates comparison of multiple samples at once. We used the ICC first to optimize our microarray data normalization method and found that the use of median values instead of mean values improves data correction. Then the reproducibility of different labeling methods was evaluated, and labeling by indirect fluorescent dye incorporation appeared to be more reproducible than direct labeling. Finally, we determined optimal biopsy sampling by analyzing overall variation in gene expression. The variation in gene expression of rectal biopsies within persons decreased when two biopsies were taken instead of one, but it did not considerably improve when more than two biopsies were taken from one person, indicating that it is sufficient to use two biopsies per person for DNA microarray analysis under our experimental conditions. To optimize the accuracy of the microarray data, biopsies from at least six different persons should be used per group.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Analysis of Variance
  • Biopsy*
  • Cell Line, Tumor
  • Fluorescent Dyes
  • Gene Expression Profiling / methods*
  • Gene Expression Profiling / standards*
  • Humans
  • Oligonucleotide Array Sequence Analysis / methods*
  • Oligonucleotide Array Sequence Analysis / standards*
  • RNA, Messenger / analysis
  • RNA, Messenger / genetics
  • Rectum / metabolism*
  • Reproducibility of Results
  • Research Design
  • Sample Size
  • Staining and Labeling / standards*

Substances

  • Fluorescent Dyes
  • RNA, Messenger