Chorion proteins of Lepidoptera have a tripartite structure, which consists of a central domain and two, more variable, flanking arms. The central domain is highly conserved and it is used for the classification of chorion proteins into two major classes, A and B. Annotated and unreviewed Lepidopteran chorion protein sequences are available in various databases. A database, named LepChorionDB, was constructed by searching 5 different protein databases using class A and B central domain-specific profile Hidden Markov Models (pHMMs), developed in this work. A total of 413 Lepidopteran chorion proteins from 9 moths and 1 butterfly species were retrieved. These data were enriched and organised in order to populate LepChorionDB, the first relational database, available on the web, containing Lepidopteran chorion proteins grouped in A and B classes. LepChorionDB may provide insights in future functional and evolutionary studies of Lepidopteran chorion proteins and thus, it will be a useful tool for the Lepidopteran scientific community and Lepidopteran genome annotators, since it also provides access to the two pHMMs developed in this work, which may be used to discriminate A and B class chorion proteins. LepChorionDB is freely available at http://bioinformatics.biol.uoa.gr/LepChorionDB.
Copyright © 2012 Elsevier Ltd. All rights reserved.