Amplification and overexpression of genes involved in cellular growth control occur frequently in human cancers. Here, we report characterization of the full length OS4 cDNA derived from 12q13-q15 (Su et al., Proc. Natl. Acad. Sci. USA, 91: 9121-9125, 1994), a region frequently amplified in sarcomas and brain tumors. This cDNA consists of 4833 base pairs (bp) encoding an open reading frame (ORF) of 283 amino acids. The ORF predicts a water-soluble acidic (pI 5.50) polypeptide with a molecular weight of 31759. Database searches revealed highly significant similarity between OS4 and eight proteins predicted from genomic sequences of Caenorhabditis elegans, Schizosaccaharomyces pombe, and Saccharomyces cerevisiae. Thus, OS4 defines a novel evolutionarily conserved gene superfamily. Northern and database analyses revealed OS4 transcripts in numerous human tissues demonstrating its ubiquitous expression. We also observed overexpression of OS4 in three cancer cell lines with amplification of this gene. Furthermore, we detected OS4 amplification in 5/5 primary sarcomas with known amplification of the closely linked marker CDK4. These results demonstrate that the highly conserved OS4 gene is frequently included in the 12q13-q15 amplicon and may contribute to the development of a subset of sarcomas.