Large-scale identification of single-feature polymorphisms in complex genomes

Genome Res. 2003 Mar;13(3):513-23. doi: 10.1101/gr.541303.

Abstract

We have developed a high-throughput genotyping platform by hybridizing genomic DNA from Arabidopsis thaliana accessions to an RNA expression GeneChip (AtGenome1). Using newly developed analytical tools, a large number of single-feature polymorphisms (SFPs) were identified. A comparison of two accessions, the reference strain Columbia (Col) and the strain Landsberg erecta (Ler), identified nearly 4000 SFPs, which could be reliably scored at a 5% error rate. Ler sequence was used to confirm 117 of 121 SFPs and to determine the sensitivity of array hybridization. Features containing sequence repeats, as well as those from high copy genes, showed greater polymorphism rates. A linear clustering algorithm was developed to identify clusters of SFPs representing potential deletions in 111 genes at a 5% false discovery rate (FDR). Among the potential deletions were transposons, disease resistance genes, and genes involved in secondary metabolism. The applicability of this technique was demonstrated by genotyping a recombinant inbred line. Recombination break points could be clearly defined, and in one case delimited to an interval of 29 kb. We further demonstrate that array hybridization can be combined with bulk segregant analysis to quickly map mutations. The extension of these tools to organisms with complex genomes, such as Arabidopsis, will greatly increase our ability to map and clone quantitative trait loci (QTL).

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Arabidopsis / genetics
  • Chromosome Deletion
  • Chromosomes, Plant / genetics
  • Databases, Genetic
  • Gene Expression Profiling / methods
  • Genes, Plant / genetics
  • Genetic Testing / methods*
  • Genetic Testing / trends
  • Genome, Plant*
  • Genotype
  • Oligonucleotide Array Sequence Analysis / methods
  • Plants, Genetically Modified / genetics
  • Polymorphism, Genetic / genetics*
  • RNA, Plant / genetics
  • Recombination, Genetic / genetics

Substances

  • RNA, Plant