Human ANP32B forms a complex with monomeric H5N1 FluPolA
To gain insight into the structural basis of genome replication in IAV, we employed a recombinant baculovirus co-expressing Tky05 FluPolA subunits PB1, PB2 and PA, along with human ANP32B, in Sf9 insect cells. We then purified the FluPolA-ANP32B complexes via affinity and size exclusion chromatography. Cryo-EM analysis revealed that FluPolA is present in equilibrium between monomeric and dimeric forms, mostly bound to ANP32B (Supplementary Fig. 2). We did not observe symmetric dimers or higher-order oligomers in the samples. We determined the structure of a monomeric FluPolA in complex with ANP32B at a final resolution of 3.1 Å (Fig. 1a, Supplementary Fig. 2, Supplementary Table 1). The endonuclease domain of PA (PAEndo), the C-terminal domains of PB2 (PB2-C) and most of the highly acidic ANP32BLCAR (amino acids 160–251) remained flexible and were not resolved. The ANP32BLRR domain (amino acids 1-149) makes extensive contacts with the large C-terminal domain of PA (PA-C). Amino acid residues N129 and D130, which are essential for interaction between FluPol and ANP32A proteins11, 37, form hydrogen bonds with PA K635. N129 also interacts with the first methionine of the PB1 subunit (Fig. 1b). The PA 550 loop (residues 550 to 560)38 forms a β-hairpin that inserts between the ANP32BLRR and the resolved N-terminal section of the ANP32BLCAR (residues 150–159), with the hydrophobic residues A553 and V554 contacting the concave face of the ANP32BLRR. PA R551 forms a salt bridge with ANP32B D159, while PA R559 interacts with ANP32B D119 and D146, in addition to π-stacking with F121 (Fig. 1c). PA R556 forms two salt bridges, with ANP32B E154 and D157, confirming its role in strengthening the interaction with ANP32B.
The PA-C domain of FluPolA not only binds ANP32B, it also interacts with the host RNAP II CTD5, 8 during viral transcription5, 8, 9, 39. Comparison of the ANP32B and RNAP II CTD binding sites suggests that their binding is mutually exclusive (Fig. 1d). We performed competition experiments by immobilizing RNAP II CTD peptides6 on streptavidin resin, followed by incubation with FluPolA in the presence or absence of ANP32B. As expected, FluPolA binds the serine 5 phosphorylated (S5P) version of the CTD, a hallmark of RNAP II transcription initiation40, but in the presence of ANP32B binding is reduced > 6-fold (Fig. 1e). FluPolA pulldown is dependent on ANP32B concentration, while absence of ANP32B signal on the gel confirms that it does not bind CTD peptides itself, i.e. CTD binding is mediated solely by FluPolA (Supplementary Fig. 3). These data suggest that ANP32B can outcompete RNAP II for binding to FluPolA, in a concentration-dependent manner. Altogether, our data suggest that newly synthesised, RNA-free FluPolA binds ANP32B as it enters the cell nucleus, and that this interaction prevents binding to the RNAP II CTD.
Dimeric H5N1 FluPolA forms a replication platform with human ANP32B
We next determined the structure of the Tky05 replication platform, comprising an asymmetric dimer of FluPolA and ANP32B, to a final resolution of 3.2 Å (Fig. 2a, Supplementary Fig. 2, Supplementary Table 1, Supplementary Movie 1). In this complex, the active site of the FluPolR PAEndo domain is buried and thus cannot perform cleavage of capped host RNA, i.e. FluPolR is transcription incompetent. Its orientation is stabilised by the arrangement of the PB2-C domains: the PB2 mid-link (PB2Mid − link) and cap binding domain (PB2CBD) pack against the PB1 palm subdomain (PB1Palm) to form the product exit channel (Supplementary Fig. 1c). The PB2627 domain sits at the dimer interface, while the PB2 nuclear localisation signal (PB2NLS) domain packs against the PB2 lid (PB2Lid) and PAEndo domains, locking the overall FluPolR conformation (Fig. 2b, Supplementary Fig. 1b).
The PB2-C domains of FluPolE are arranged differently from published transcriptase (FluPolT) or replicase (FluPolR) conformations14, 16, 41. In those previous structures, the PB2627 and PB2NLS domains pack against the PA-C domain (Supplementary Fig. 1b, Supplementary Movie 1), while in this study the PB2Mid − link and PB2CBD of FluPolE pack against the PB2 N2 (PB2N2) domain (Fig. 2b, Supplementary Fig. 1b). In the transcriptase and replicase conformations, the PB2Lid domain packs against the PB1 thumb subdomain (PB1Thumb) and the C-terminal region to form the template exit channel (Supplementary Fig. 1b and 1c). In contrast, in FluPolE the PB2Lid flips over to mediate the stacking between PAEndo and PB2CBD via its C-terminal α-helix (Fig. 2b, Supplementary Fig. 1b, Supplementary Fig. 1d). This position of the PB2Lid and PAEndo domains is unique – the direct interaction between the PB2CBD and the PAEndo or PB2Lid domains has not been observed before. To assume this configuration, PAEndo rotates by 70° so that its loop51–72 stacks against the PB2CBD domain, sandwiching the PB2Lid between the PAEndo α-helix84–98 and the PB2CBD domains (Supplementary Fig. 1d). The interface is stabilized through extensive hydrophobic and hydrogen bonding interactions (Fig. 2c) and secures the overall FluPolE conformation. This explains why the PA loop51–72 was found to be essential for viral replication but not transcription42. Without the loop the FluPolE configuration cannot form, and without FluPolE the replication platform cannot form.
The interaction between FluPolR and FluPolE is predominantly mediated by their respective PB2 and PA subunits, forming a large interface. The PA-C domain and PB1 β-hairpin of FluPolE stack against the PB2N2 and PB2Mid − link domains of FluPolR (Fig. 2d, e), while the PB2627 domain of FluPolR interacts with the PA-C and the PB2627 − NLS domains of FluPolE (Fig. 2f). The conformation of ANP32B in the replication platform is practically identical to that in the ANP32B-bound monomeric FluPolA structure (root-mean-square deviation (RMSD) = 1.056 Å). Both structures have clear density for the ANP32BLRR (amino acid 1-149) and the N-terminal region of the ANP32BLCAR (residue 150–159). However, in the replication platform, ANP32B interacts with the FluPolE PB2627domain, while we observe no direct contacts with FluPolR. Of note, the canonical mammalian adaptive residue PB2 K627 of FluPolE forms a salt bridge with ANP32B E151 (Fig. 2g). PB2 R630 forms a hydrogen bond with the main chain of ANP32B E154, as well as π-stacking with ANP32B P156 (Fig. 2g). A proline-to-serine substitution at position 156 of swine ANP32A allowed avian FluPolA to replicate to some extent in porcine cells23, 43, lending credence to the idea of pigs as intermediary hosts for avian influenza viruses. Our structure suggests that an ANP32 P156S mutation would allow additional hydrogen bonding with FluPolE. Mutagenesis of PB2 residues 627 and 630, and ANP32B residues 151 and 156 leads to significantly reduced FluPolA activity in vRNP reconstitution assays, suggesting this interface is biologically relevant (Supplementary Fig. 4).
PB2 K627 of FluPolR is situated in a positively charged groove that may be a binding site for part of the unresolved section of the ANP32B LCAR domain (residues 160–251) (Fig. 2h). The ubiquitous PB2 627E-to-K switch, observed when avian influenza viruses spill over into mammals, likely allows a region of the fully acidic mammalian LCAR domain (for example, residues 176-DEEDEDD-183 in human ANP32B; acidic residues underlined) to interact with FluPolA, as suggested by Carrique et al (2020). Avian ANP32A contains acidic as well as basic residues in this region (176-VLSLVKDR-183; acidic and basic residues underlined), allowing interaction with either glutamate or lysine14 (Supplementary Fig. 5).
Comparison of the IAV and IVC replication platforms
The marked similarity of the ANP32A and ANP32B LRR domains44 allows direct comparison between the IAV (with ANP32B) and ICV (with ANP32A) replication platforms, despite the fact that influenza A and C viruses, although closely related, represent different influenza types that diverged around 8,000 years ago45. FluPolA and FluPolC are only about 20–40% identical at the amino acid level46, so differences in their interactions with ANP32 proteins of their respective hosts (humans and swine only in the case of ICV; a wide variety of birds and mammals for IAV) are to be expected. Nevertheless, the replication platforms share a similar overall arrangement, albeit exhibiting some striking differences (Supplementary Movie 1).
Notably, in the IAV replication platform, only the C-terminal region of the ANP32BLRR binds to the complex, exclusively interacting with FluPolE (Fig. 3a). This reinforces the essential role of ANP32 residues N129 and D130 in supporting replication. Unlike the ICV complex, no direct interaction is observed between the N-terminal region of the ANP32BLRR and FluPolR, suggesting that IAV has evolved to optimise the binding of ANP32 exclusively to FluPolE. We speculate that a single, optimised binding interface may allow IAV to target a wide range of host animals.
Insertions into regions of the ICV P3-C domain (P3 being the ICV equivalent of PA), in particular the loops between residues 490 and 500 and the 550 loop (Fig. 3c), lead to ANP32B binding to the IAV FluPolE PA-C in an alternative orientation (Fig. 3b). In ICV, the P3 550 loop is positioned between the PB2627 − NLS domain and the ANP32ALCAR, preventing their interaction (Fig. 3d). In IAV, the shorter PA 550 loop is inserted between the ANP32BLRR and the ANP32BLCAR and, since the PA490–500 loop is also shorter, the PB2627 − NLS is rotated towards the ANP32BLCAR, promoting direct interactions (Fig. 3d). These differences also affect how the remaining PB2-C domains are arranged: in IAV, the stacking of the PAEndo domain against the PB2CBD is mediated by PA loop51–72, which is not conserved in ICV (Fig. 3c).
In summary, like their interaction with the C-terminal domain of host RNAP II39, differences in ANP32 protein binding between IAV and ICV likely reflects their evolutionary distance, and is informative with regard to key amino acid interactions.
Molecular basis of mammalian adaptations in avian FluPolA
Mutations in the viral polymerase (Fig. 4a) are vital to the process of avian IAV adaptation to mammalian hosts47. PB2 E627K is the hallmark of mammalian adaptation; all 20th century influenza viruses circulating in humans had this signature. In contrast, the 2009 pandemic H1N1 viruses had PB2 627E – their replication in mammals is attributed predominantly to a glutamine-to-arginine mutation at position 591 of PB248, 49. Our structure shows that K627 and Q591 are adjacent in FluPolE and could substitute each other to form a salt bridge with ANP32 E151 upon mutation into a basic residue (Fig. 4b). The more recently identified adaptive mutation PB2 M631L is in the same FluPolE cluster (Fig. 4b). In combination with PA E349K, this mutation allowed an avian H9N2 FluPolA to replicate using the normally non-functional chicken ANP32B and ANP32E (as opposed to the proviral chicken ANP32A), as well as human ANP32 proteins36. Other mammalian adaptations in the PB2 subunit of avian FluPolA – including 2009 pH1N1 viruses – are T271A and D701N50, 51. It has been suggested that PB2 Q591R, E627K and D701N specifically adapt avian FluPolA to mammalian ANP32 proteins52, while we hypothesise that T271A stabilises the FluPolE conformation through interaction with the 424 loop of PB2CBD 53. Of note, on FluPolR Q591, K627 and M631 also form a cluster (Fig. 4a), on the interface with the PB2627 domain of FluPolE, which is the presumed location of the LCAR domain of ANP32. Thus some residues may fulfil a double function of strengthening FluPolE (via ANP32 residue 151) as well as FluPolR (through LCAR residues) binding to ANP32. Nevertheless, it is not entirely clear from the current structure how the 33 amino acid insertion in avian ANP32A allows both PB2 627E and 627K FluPolA activity.
Considering FluPolA adaptations that occur in nature when avian IAVs, including the highly pathogenic H5Nx and H7Nx subtypes, spill over into mammals (Fig. 4a and Supplementary Table 2), it appears that they fall in one of two broad categories. Substitutions like E627K in PB2 and Q556R in PA strengthen interactions between mammalian ANP32 proteins and FluPolE. Adaptations including PA E349K and PB2 K526R may shift the FluPolA oligomeric equilibrium (in particular, asymmetric versus symmetric dimers) in favour of the asymmetric replication platform, thus promoting replication.
Besides the PB2 591/627/631 clusters, we observe a potential mutational hotspot at the PA-PA interface of the platform. Arai and colleagues54 identify a pair of PA substitutions – S388R and A448E – from an Egyptian H5N1 strain that replicates to high titres in human cells. These residues sit on opposite sides of the FluPolE-FluPolR interface and may enhance affinity between the molecules by forming an additional hydrogen bond (Fig. 4c). One of the mammalian adaptations that is already present in wildtype Tky05 FluPolA, PA N383D, maps a short distance across from K605 of the FluPolR PA subunit. Here a hydrogen bond can form between the FluPolE D383 and the FluPolR K605 (Fig. 4c). Clustered mutational hotspots are potential targets for antiviral intervention.