The efficacy of current influenza vaccines requires a close antigenic match between circulating and vaccine strains. As such, timely identification of emerging influenza virus antigenic variants is central to the success of influenza vaccination programs. Empirical methods to determine influenza virus antigenic properties are time-consuming and mid-throughput and require live viruses. Here, we present a novel, experimentally validated, computational method for determining influenza virus antigenicity on the basis of hemagglutinin (HA) sequence. This method integrates a bootstrapped ridge regression with antigenic mapping to quantify antigenic distances by using influenza HA1 sequences. Our method was applied to H3N2 seasonal influenza viruses and identified the 13 previously recognized H3N2 antigenic clusters and the antigenic drift event of 2009 that led to a change of the H3N2 vaccine strain.
Importance: This report supplies a novel method for quantifying antigenic distance and identifying antigenic variants using sequences alone. This method will be useful in influenza vaccine strain selection by significantly reducing the human labor efforts for serological characterization and will increase the likelihood of correct influenza vaccine candidate selection.