Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 1999 Aug 1;27(15):3219-28.
doi: 10.1093/nar/27.15.3219.

Intron-exon Structures of Eukaryotic Model Organisms

Free PMC article

Intron-exon Structures of Eukaryotic Model Organisms

M Deutsch et al. Nucleic Acids Res. .
Free PMC article


To investigate the distribution of intron-exon structures of eukaryotic genes, we have constructed a general exon database comprising all available intron-containing genes and exon databases from 10 eukaryotic model organisms: Homo sapiens, Mus musculus, Gallus gallus, Rattus norvegicus, Arabidopsis thaliana, Zea mays, Schizosaccharomyces pombe, Aspergillus, Caenorhabditis elegans and Drosophila. We purged redundant genes to avoid the possible bias brought about by redundancy in the databases. After discarding those questionable introns that do not contain correct splice sites, the final database contained 17 102 introns, 21 019 exons and 2903 independent or quasi-independent genes. On average, a eukaryotic gene contains 3.7 introns per kb protein coding region. The exon distribution peaks around 30-40 residues and most introns are 40-125 nt long. The variable intron-exon structures of the 10 model organisms reveal two interesting statistical phenomena, which cast light on some previous speculations. (i) Genome size seems to be correlated with total intron length per gene. For example, invertebrate introns are smaller than those of human genes, while yeast introns are shorter than invertebrate introns. However, this correlation is weak, suggesting that other factors besides genome size may also affect intron size. (ii) Introns smaller than 50 nt are significantly less frequent than longer introns, possibly resulting from a minimum intron size requirement for intron splicing.

Similar articles

See all similar articles

Cited by 139 articles

See all "Cited by" articles


    1. Nucleic Acids Res. 1988 Nov 11;16(21):9893-908 - PubMed
    1. Mol Biol Evol. 1998 Jun;15(6):770-3 - PubMed
    1. Proc Natl Acad Sci U S A. 1991 Feb 15;88(4):1276-80 - PubMed
    1. Nucleic Acids Res. 1992 May 25;20(10):2553-7 - PubMed
    1. Curr Opin Genet Dev. 1991 Dec;1(4):470-7 - PubMed

Publication types

MeSH terms