SymProFold can predict the structural organization of higher symmetrical assemblies, such as S-layers, as present in their native state in assembled form at the cell surfaces. As only sequential information is required as an input, this approach enables numerous avenues for investigating symmetrical formations, even those that pose significant experimental challenges, such as high toxicity, pathogenicity, extreme growth conditions, and substantial costs for the experimental investigation. Furthermore, the self-assembly property and significant sequence variation of S-layers presents a challenge for structural characterization using techniques such as X-ray crystallography and electron microscopy.
Due to the considerable size diversity of SLPs, some outreach 1000 amino acids 6, predictions of tetramers or hexamers can easily exceed the computing power needed to get highly confident models with AlphaFold. Nevertheless, the rapid improvements of AlphaFold in predicting large protein complexes will increase the size limit of SymProFold calculations. Calculated models can potentially be improved by manual pre-processing of input sequences according to prior knowledge. Regions like signal sequences or domains not involved in the assembly, such as cell wall anchors, can be removed before the calculation, thereby reducing the size of the input sequence. The SymProFold method has several stringent restrictions that lead to early termination of the pipeline when sequences of non-assembling proteins are used. In this case, it would be highly unlikely to find more than one defined symmetry complex. A superposition and assembly are highly unlikely even if two symmetry complexes are found due to a prediction artifact.
In case SymProFold fails to detect more than one strong intermolecular interaction, the construction of a fully assembled layer is not possible and results in a partially assembled model. This limitation causes a loss of information regarding weak but critical interactions, which may be required for a complete assembly. S-layers and viral capsids, which consist of multiple proteins, also represent a challenge for SymProFold predictions. In such cases, prior knowledge may be needed as the identification of the interactions important for the initiation of the assembly could be difficult via the automatic pipeline. Nevertheless, manual intervention and/or an extension of the software could overcome this limitation and enable the prediction of assemblies composed of more than one protein. Further adaptations in SymProFold are also needed for viral capsid automated predictions to address potential challenges like pseudosymmetry, higher triangulation number and variations in assembly curvature.
Many biological functions of S-layers depend on the completeness of the cell coverage as well as the structural and physicochemical repetitive uniformity, down to the subnanometer scale. Obtained structural data of assembled S-layers reveal properties of the exposed surface regions (Figure S31), internal pores, and anchoring domains essential for cell attachment and highlight interaction interfaces within the layer. Understanding these aspects is crucial for elucidation of the specific function of a particular S-layer, depending on the microorganism. This is especially important in pathogenic bacteria since S-layers play a role in surface adhesion and in interactions with the host's immune system. Furthermore, numerous studies on the in vivo and in vitro morphogenesis of S-layers demonstrated that lattice growth on growing cells is a highly dynamic process 6,7,11,41–45. Approximately 500 subunits per second must be synthesized at high growth rates, translocated to the cell surface, and incorporated in a defined orientation to the existing lattice while maintaining an equilibrium of the lowest free energy 6,44–47. The adaptable lattice of S-layers on growing and dividing bacterial and archaeal cells represents an advanced evolutionary stage in morphogenesis 44. At these specific sites, bonds must swiftly open and re-form. The dynamics is also present in the assembly of the capsid during the viral life cycle, including disassembly upon infection and reassembly during viral packaging. To gain insights into these dynamic processes, the SymProFold predicted assemblies can serve as a starting point to identify the required interactions within the lattice and perform molecular dynamic studies.
Structural insights into the S-layer assembly offer a vast opportunity for developing new technologies that can mimic the unique properties of these versatile and adaptable structures. Rational engineering of artificial structures with different properties could include developing new materials with advanced self-assembly properties or creating new drug delivery systems and biosensors 48. SymProFold predictions enable the development of computational models to simulate S-layers' behavior under different conditions. Since the S-layers of pathogens serve as favorable regions for drug targeting 34,49, understanding of anchoring mechanisms and/or interactions within the self-assembled S-layer provides the ideal basis for rational drug design and opens new strategies for weakening S-layer protective function. Furthermore, structural information now allows for analyzing the unique antifouling properties of S-layer lattices6 and offers a foundation for mimicking these structures in polymer technologies 50. The precise biophysical characterization of the pores allows for custom alterations of S-layer permeability, which was previously possible only through chemical modifications 11. S-layers have also shown great potential as a platform for drug delivery because of their biocompatibility, stability, and regular pore structure. Lipidic nanoformulations such as liposomes, solid lipid nanoparticles (SLNs), or emulsomes as drug delivery systems show better resistance to oxidative stress and membrane damage when covered with a crystalline S-layer 51,52. Uptake studies with emulsomes coated with S-layer proteins did not show significant cytotoxicity by human liver carcinoma cells (HepG2). The capacity to recrystallize S-layer proteins in lipidic formulations allows extremely precise targeted delivery of specific antibodies in high concentrations53,54 or drug-loaded particles, especially poorly water-soluble targets such as antimicrobial peptides or easily degradable biologicals such as enzymes used for the enzyme replacement therapy 6,51–53,55. New functionalities, such as cellular targeting and generation of fusion proteins with specific properties56 can be introduced. The S-layers of bacteria and archaea in the human microbiome are perfect candidates for these biotechnological applications. Altogether, this opens a new chapter in rational engineering and adaptation of many S-layers for applications in medicine and diagnostics, like treating fungal and viral infections, dermal conditions, cancer, immune deficiency, or rare genetic disorders.