Recent analysis of the endA cellulase gene from Ruminococcus flavefaciens 17 has revealed that it encodes a product of 759 amino acids that provides the first example of a multidomain cellulase from a Ruminococcus sp. Following the family 5 catalytic domain in the predicted EndA enzyme is a 282 amino acid domain of unknown function for which no close relationship was found to other protein sequences. However, the C-terminal sequences of EndA contain a 34 amino acid threonine-rich linker connected to an 81 amino acid region, both of which show strong similarities to sequences present in two xylanases from R. flavefaciens 17. A distant relationship is evident between regions of the 80 amino acid sequences of EndA, XynD and XynB and the duplicated 23 amino acid dockerin sequences found in cellulolytic Clostridium sp., suggesting that as in Clostridium sp. these sequences could mediate the binding of enzymatic polypeptides to another component in the cell surface enzyme complex of R. flavefaciens.