Secondary Structures of Proteins Follow Menzerath-Altmann Law

Int J Mol Sci. 2022 Jan 29;23(3):1569. doi: 10.3390/ijms23031569.

Abstract

This article examines the presence of the empirical tendency known as the Menzerath-Altmann Law (MAL) on protein secondary structures. MAL is related to optimization principles observed in natural languages and in genetic information on chromosomes or protein domains. The presence of MAL is examined on a non-redundant dataset of 4728 proteins by verifying significant, negative correlations and testing classical and newly proposed formulas by fitting the observed trend. We conclude that the lengths of secondary structures are specifically dependent on their number inside the protein sequence, while possibly reflecting the formula proposed in this paper. This behavior is observed on average but is individually avoidable and possibly driven by a latent cost function. The data suggest that MAL could provide a useful guiding principle in protein design.

Keywords: Menzerath–Altmann law; empirical law; formula fitting; proteins; quantitative linguistics; secondary structures.

MeSH terms

  • Algorithms
  • Databases, Protein
  • Models, Molecular*
  • Protein Structure, Secondary
  • Proteins / chemistry*
  • Statistics as Topic
  • Subcellular Fractions / metabolism

Substances

  • Proteins