The recent completion of a reference sequence of the human genome now allows a complete characterization of the type II keratin gene domain on chromosome 12q13.13. This, domain, approximately 780 kb in size, is present on nine bacterial artificial chromosome clones sequenced by the Human Genome Sequencing Project. The type II keratin domain contains 27 keratin genes and eight pseudogenes. Twenty-three of these genes and four pseudogenes have been previously reported. This study describes, in addition to the genomic sequencing of the K2p gene and the bioinformatic identification of four keratin pseudogenes, the characterization of cDNA corresponding to three previously undescribed keratin genes K1b, K6l, and Kb20, as well as cDNA sequences for the previously described keratin genes hHb2, hHb4, and K3. Northern analysis of the new keratins K1b, K6l, K5b, and Kb20 using mRNA of major organs as well as of specific epithelial subtypes shows singular expression of these keratins in skin, hair follicles and, for K5b and Kb20, in tongue, respectively. In addition, the obvious discrepancies between the current reference sequence of the human genome and the previously described gene/cDNA sequences for K6c, K6d, K6e, K6f, K6h are investigated, leading to the conclusion that K6c, K6d as well as K6e, K6f are probably polymorphic variants of K6a and K6h, respectively. All 26 human type II keratins found on this domain as well as K18, dtype 1 Keratin, are identified at the genomic and transcriptional level. This appears to be the total complement of functional type II keratins in humans.