Protein expression and purification
The SEM2(49-107) of Homo sapiens semenogelin 2 gene was amplified from human genomic DNA. Sites of Ndel and Xhol restriction endonucleases as well as stop codon TGA were introduced by constructing following forward and reverse primers SEMG2_59_fwd (ttttttttcatatgTTTGGACAAAAAGACCAACAACATACTAAATCCAAA) and SEMG2_rev (ttttttttctcGAGCAGTTGTTGACTTCCACCTAG) which were purchased by Evrogen (Moscow, Russia) To construct SEM2(49-107) peptide with additional 6×his-tag at the С-terminus, the structural gene sem2.59 was cloned into pGS21a vector. The expression vector was multiplied by transformation into E. coli DH5α strain. Then E. coli strain BL21 star (DE3) pLysS was transformed with SEMG2(59)::pGS21a vector for heterologous expression. Protein expression was performed on selective synthetic minimal medium (M9) containing antibiotics (ampicillin - 100μg/mL; chloramphenicol, 25 μg/mL) with isotope labeled glucose (13С) and ammonium chloride (15N) (Cambridge Isotope Laboratories, UK) as a source of carbon and nitrogen (Usachev 2018). Primarily the cells were grown in LB rich nutrient medium at 37 °C with shaking at the rate of 180 rpm. At optical density OD600 equal to 0.9 cells were harvested in sterile environment, washed and transferred into M9 medium. Then we grew the cells with same conditions for 40 min.
The expression of the sem2.59 was induced by adding isopropy-β-D-1-thiogalactopyranoside (IPTG OD600=0.6–0.8) to a final concentration of 1 mM. Expression was carried out for 16h at the temperature of 37 °C and shaking rate of 180 rpm. Then cells were collected by centrifugation (5000 rpm, 10 min, 4 °C) using Avanti JXN-26, rotor JLA-9.1000 (Beckman, USA), the pellet was frozen in a liquid nitrogen and stored at –24 °C.
The cells were disrupted at 4 °C in buffer 1 (50 mM Tris-HCl, pH 7.6, 0.3 M NaCl) with a HD2070 ultrasonic homogenizer (Bandelin, Germany) in the presence of protease inhibitor cocktail (Mini Protease Inhibitor Cocktail (Roche, Switzerland)). Cell debris was pelleted by centrifugation at 100,000g for 1 h at 4°C using Optima XPN (45Ti rotor) centrifuge (Beckman Coulter, USA). The supernatant was discarded, and the precipitate with the target protein in the inclusion bodies was used for refolding. For further extraction of SEM2.59 protein, the precipitate was resuspended in buffer 2 (50 mM Tris-HCl, pH 7.6, 0.3 M NaCl, 1M Urea) using an ultrasonic homogenizer, centrifuged at 27000g for 40 min at 4°C, the supernatant was drained, and the precipitate was repeated twice more. After the third centrifugation, the precipitate was left to dissolve overnight on a magnetic stirrer at 4°C in a solution of 8M Urea. The resulting mixture was applied to a NiNTA resin column equilibrated with 8M Urea solution. Chelate affinity chromatography (IMAC) was performed on NiNTA resin followed by elution of SEM2.59 protein with buffer 3 (8M Urea; 300 mM imidazole), fractions of 0.5 mL each.Gel filtration was performed using an NGS Discover chromatographic system and Enrich SEC70 column (BioRad, USA) in buffer 4 (25 mM Bis-Tris pH 6.0, 0.2 % SDS).
NMR spectroscopy
SEM2(49-107) is a peptide of 59 amino acid residues corresponding to 49-107 amino acid residues of the human semenogelin-2 protein. Amino acid sequence of SEM2(49-107) peptide: MFGQKDQQHTKSKGSFSIQHTYHVDINDHDWTRKSQQYDLNALHKATKSKQHLGGSQQLLEHHHHH. The SEM2(49-107) construct used in this study comprises 59 amino acids including 6 histidine residues (6 × his tag) at the C-terminus to aid in protein purification using affinity chromatography. The 1D (1H), 2D (1H-15N) and 3D (1H-13C-15N) NMR spectra of the SEM2 (49-107) peptide (C=0.46mM, H2O+D2O/ 90%+10%) were carried out on NMR spectrometer with a 700 MHz frequency (Bruker, AVANCE III 700) equipped with a quadruple resonance (1H, 13C, 15N, 31P) CryoProbe at temperature of 293K. Data processing was performed using the Bruker Topspin 3.6 software. All spectra were analyzed in the CCP-NMR program (Vranken, W.F. 2005). During the experiments, no changes were observed in the 1H NMR spectra of the peptide SEM2 (49-107). Based on this, it is assumed that there have been no structural changes in the peptide.
The 2D 1H-15N HSQC experiment was used to determine the chemical shifts of the nitrogen 15N signals and assign their spin systems.
The following 3D (1H-13C-15N) NMR experiments were performed to reconstruct the amino acid sequences: HNCO, HN(CO)CA, HNCA, HN(CA)CO, HN(CO)CACB, HNCACB, HN(CO)CC, 15N-1H HSQC-TOCSY (0.12s mixing time) and 15N-1H HSQC-NOESY (0.1 s mixing time), to find the HN, N, CO signals of the main chain, and the Ca, Cb, Ha, Hb and others signals of the side chains. The list of recorded NMR spectra is provided in Table 1. The obtained values of chemical shifts for various amino acids were compared to the literature data (https://bmrb.io/ref_info/csstats.php). Spectra for the measurement of longitudinal (R1) and transverse (R2) relaxation rates of the 15N amide nuclei and 15N-1H heteronuclear NOE values were collected on NMR spectrometer with a 700 MHz frequency at 293K.
Assignments and data deposition
The 2D 1H–15N HSQC NMR spectrum of SEM2(49-107) peptide with resonance assignments are presented in Fig. 1. This experiment was used to determine the chemical shifts of the nitrogen 15N signals and assign their spin systems. We showed the labeled 1H-15N HSQC spectrum for SEM2(49-107) peptide in Fig. 1, while the residues in His-tag were not assigned in this figure. The unassigned/unpicked resonances in Fig. 1 were from NH2 region and 6 histidine residues (6 × his tag) at the C-terminus of amino acids.
Backbone and side-chain resonance assignments of SEM2(49-107) peptide were obtained based on the conventional multi-dimensional experiments. The backbone data was obtained from HNCO, HN(CA)CO, HN(CO)CA, HNCA, HN(CO)CACB, HNCACB spectra. Side-chain resonance assignment of SEM2(49-107) was carried out using HNCACB, HBHACONH, 15N-1H HSQC-TOCSY, 15N-1H HSQC-NOESY and HC(CO)NH experiments. We assigned most of backbone and side-chain resonances of the SEM2(49-107) peptide, including 82.8% of HN, 82.8% of 15N, 81.4% of 13Cα, 76.3% of 13Cβ, 76.3% of 1Hα, 62.7% of 1Hβ and 83.1% of 13C chemical shifts.
Standard triple resonance assignment of protein backbone is based on HNCACB and HN(CO)CACB spectra (Schmieder 2006). The idea is that the CBCANH correlates each NH group with the Cα and Cβ chemical shifts of its own residue (strongly) and of the previous residue (weakly) (Sanchugova 2021). By superimposing the HNCACB spectrum on HN(CO)CACB, signals belonging to Cαi-1 and Cβi-1 of the previous residue can be identified. The HNCA and HN(CO)CA experiments give the same information as the HNCACB and HN(CO)CACB expect for the absence of Cβ resonance.
The 1H, 15N, and 13C chemical shift values for SEM2(49-107) have been deposited at the BioMagResBank (http://www.bmrb.wisc.edu/) under BMRB accession number 52356. Most of the unassigned residues were in the central region of SEM2(49-107) and showed severe overlapping, broadening or were otherwise difficult to distinguish from noise signals in the spectra. The spectra did not contain signals of amide proton HN in the center of the chain between 72 (Asp) and 78 (Thr) amino acids, 86 (Asp) and 88 (Asn). This may be due to their high conformational mobility and rapid chemical exchange of amide protons HN and deuterium of the solvent for these residues.
Table 1. Acquisition parameters used in the assignment experiments
№
|
Experiment
|
Time domain
data size (points)
|
Spectral width
(ppm)
|
ns
|
Delay time
[s]
|
T1
|
T2
|
T3
|
F1
|
F2
|
F3
|
|
|
1
|
1H-15N HSQC
|
512
|
2048
|
-
|
12 (1H)
|
40 (15N)
|
-
|
2
|
1.6
|
2
|
HNCO
|
88
|
48
|
2048
|
22 (13С)
|
36 (15N)
|
15 (1H)
|
8
|
1
|
3
|
HN(CA)CO
|
128
|
48
|
2048
|
22 (13C)
|
36 (15N)
|
15 (1H)
|
16
|
1
|
4
|
HNCA
|
128
|
52
|
2048
|
50 (13C)
|
35 (15N)
|
13 (1H)
|
8
|
1
|
5
|
HNCACB
|
128
|
48
|
2048
|
75 (13C)
|
35 (15N)
|
15 (1H)
|
16
|
1
|
6
|
HN(CO)CACB
|
128
|
64
|
2048
|
80 (13C)
|
35 (15N)
|
14 (1H)
|
8
|
1
|
7
|
HN(CO)HCC
|
128
|
56
|
2048
|
12 (1H)
|
30 (15N)
|
12 (1H)
|
16
|
1.2
|
8
|
HN(CO)HAHB
|
128
|
80
|
2048
|
14 (1H)
|
31 (15N)
|
14 (1H)
|
12
|
1.4
|
9
|
HNHAHB
|
128
|
64
|
2048
|
14 (1H)
|
31 (15N)
|
14 (1H)
|
12
|
1.6
|
10
|
HNHA
|
64
|
96
|
2048
|
12 (1H)
|
12 (15N)
|
34 (1H)
|
16
|
1.2
|
11
|
13C-NOESY-HSQC
|
256
|
48
|
2048
|
12 (1H)
|
70 (13C)
|
12 (1H)
|
8
|
1.2
|
12
|
15N-HSQC-TOCSY
|
164
|
48
|
2048
|
12 (1H)
|
34 (15N)
|
12 (1H)
|
8
|
1.2
|
13
|
1H-15N HSQC for T2
|
512
|
2048
|
-
|
12 (1H)
|
34 (15N)
|
-
|
2
|
2.0
|
14
|
1H-15N HSQC for T1
|
512
|
2048
|
-
|
12 (1H)
|
34 (15N)
|
-
|
2
|
2.0
|
15
|
HetNOE
(1H-15N HSQC-NOE)
|
512
|
2048
|
-
|
12 (1H)
|
34 (15N)
|
12
|
8
|
3.5
|
The secondary structure of SEM2(49-107) peptide was valued by using two approaches, secondary chemical shifts analysis (CSI) (Wishart, D. S.1992) and TALOS-N (Shen, Y. 2009) prediction (Fig. 3 and Fig. 4(a) and 4(b). Secondary chemical shifts are obtained by calculating the differences between the assigned chemical shifts and the theoretical random coil chemical shift (Fig 2). According to the CSI (chemical shift index) values, a sequence of three or more consecutive "-1" is referred to as an α-helical element. A sequence consisting of three or more consecutive "1", on the other hand, is termed a β-sheet.
To calculate secondary chemical shifts, one typically compares the assigned chemical shifts of a protein to the corresponding theoretical chemical shifts expected for a random coil conformation. The value of these secondary chemical shifts for Hα, Cα, Cβ and C’ (Δδ13Cα, Δδ13Cβ, Δδ13C’, Δδ13Hα is related to the secondary structure of the individual residues. Positive values of Δδ13 Cα and Δδ13 correspond to a a-helical conformation whereas negative values correspond to β - sheet structure. For Δδ13Cβ positive values reflect β-sheet structure and negative values α-helical conformation (Jacobsen 2022) (Fig.3).
By using the consensus chemical shift index from all spectra mentioned above we found that SEM2 (49-107) peptide contains alpha-helixes fragments at the C-terminus.
The secondary structure elements were predicted by the TALOS-N webserver (Shen 2013) using Cα, Cβ, CO, N and HN chemical shift values of SEM2(49-107) peptide. Figure 4a represents dihedral angles of SEM2(49-107) peptide and Fig. 4b depicts the α-helices and random coil probability predicted by TALOS-N (Maksimov 2021).
Relaxation measurements and analysis
In NMR experiments we determined the relaxation parameters of 15 N nuclei at a temperature of 293 K. We determined the rates of longitudinal relaxation (R1) and transverse relaxation (R2) and the magnitude of heteronuclear Overhauser effects 15N{1H}.
To assess the mobility of the SEM2 (49-107) peptide, a series of NMR relaxation experiments were performed. Specifically, a phase-sensitive 2D 1H-15N HSQC experiment was used to measure relaxation times T1 and T2 at various delay times. The relaxation delays to measure R1 were 0.01, 0.05, 0.1, 0.2, 0.4, 0.6, 0.8, 1.2, 1.6, and 2.0 s; and to measure R2 the values were 0, 16, 32, 48, 64, 96, 128, 160, 192 and 256 ms. Based on these experiments, values for the longitudinal and transverse relaxation rates R1 and R2 were determined (Figure 5).
The observed difference in the R2/R1 rate values suggests that there is a variation in mobility and conformational dynamics along the peptide sequence. Specifically, it indicates that the C-terminus of the peptide has less mobility compared to the N-terminus. In addition, the R2/R1 rate values indicate that there is secondary structure in the form of an alpha helix within the amino acid range of 89 (Ala) to 98 (Gln). This suggests that this region of the peptide adopts a relatively constrained conformation, which is consistent with the results of the secondary structure analysis.
The correlation time (τc) is used to estimate the dynamics of the SEM2(49-107) peptide. This parameter of τc shows the time it takes for a protein molecule to rotate by one radian in solution (Rossi et al., 2010). For the SEM2 (49-107) peptide, an average correlation time τc=9.1 ns was calculated. This value may indicate that the protein exists in a monomeric state. The results of the hetNOE 15N{1H} experiment is consistent with the NMR experiments for measuring the rate of R1 and R2 relaxation. In the NMR experiments, the average hetNOE value was approximately 0.5-0.6.