Ancestral sequence reconstruction (ASR) infers predicted ancestral states for sites within sequences and can constrain the functions and properties of ancestors of extant protein families. Here, we compare the likely sequences of inferred nitrogenase ancestors to extant nitrogenase sequence diversity. We show that the most-likely combinations of ancestral states for key substrate channel residues are not represented in extant sequence space, and rarely found within a more broadly defined physiochemical space-supporting that the earliest ancestors of extant nitrogenases likely had alternative substrate channel composition. These differences may indicate differing environmental selection pressures acting on nitrogenase substrate specificity in ancient environments. These results highlight ASR's potential as an in silico tool for developing hypotheses about ancestral enzyme functions, as well as improving hypothesis testing through more targeted in vitro and in vivo experiments.
Keywords: Nif; ancestral sequence reconstruction; early life; nitrogenase.
© The Author(s) 2022. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution.