Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain

DNA Res. 1996 Oct 31;3(5):321-9, 341-54. doi: 10.1093/dnares/3.5.321.

Abstract

In this series of projects of sequencing human cDNA clones which correspond to relatively long and nearly full-length transcripts, we newly determined the sequences of 80 clones, and predicted the coding sequences of the corresponding genes, named KIAA0201 to KIAA0280. Among the sequenced clones, 68 were obtained from human immature myeloid cell line KG-1 and 12 from human brain. The average size of the clones was 5.3 kb, and that of distinct ORFs in clones was 2.8 kb, corresponding to a protein of approximately 100 kDa. Computer search against the public databases indicated that the sequences of 22 genes were unrelated to any reported genes, while the remaining 58 genes carried sequences which show some similarities to known genes. Protein motifs that matched those in the PROSITE motif database were found in 25 genes and significant transmembrane domains were identified in 30 genes. Among the known genes to which significant similarity was shown, the genes that play key roles in regulation of developmental stages, apoptosis and cell-to-cell interaction were included. Taking into account of both the search data on sequence similarity and protein motifs, at least seven genes were considered to be related to transcriptional regulation and six genes to signal transduction. When the expression profiles of the cDNA clones were examined with different human tissues, about half of the clones from brain (5 of 11) showed significant tissue-specificity, while approximately 80% of the genes from KG-1 were expressed ubiquitously.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Brain / metabolism*
  • Cell Line
  • DNA, Complementary / genetics*
  • Gene Expression
  • Genes / genetics*
  • Humans
  • Molecular Sequence Data
  • Open Reading Frames / genetics*
  • Sequence Analysis, DNA / methods

Substances

  • DNA, Complementary

Associated data

  • GENBANK/D86956
  • GENBANK/D86957
  • GENBANK/D86958
  • GENBANK/D86959
  • GENBANK/D86960
  • GENBANK/D86961
  • GENBANK/D86962
  • GENBANK/D86963
  • GENBANK/D86964
  • GENBANK/D86965
  • GENBANK/D86966
  • GENBANK/D86967
  • GENBANK/D86968
  • GENBANK/D86969
  • GENBANK/D86970
  • GENBANK/D86971
  • GENBANK/D86972
  • GENBANK/D86973
  • GENBANK/D86974
  • GENBANK/D86975
  • GENBANK/D86976
  • GENBANK/D86977
  • GENBANK/D86978
  • GENBANK/D86979
  • GENBANK/D86980
  • GENBANK/D86981
  • GENBANK/D86982
  • GENBANK/D86983
  • GENBANK/D86984
  • GENBANK/D86985