Sample collection
The materials were collected from the Shanxi Province of China in March 2023 (Fig. 1). The materials were directly picked up from stones using knives and tweezers and transferred to the laboratory subsequently. Water quality parameters, including Water temperature (WT), pH, salinity, Secchi depth (SD), dissolved oxygen (DO), electronic conductivity (EC), and total dissolved solids (TDS), were measured using handheld meters (YSI Professional Plus Multiparameter Water Quality Instrument 19E102487, YSI Incorporated, Brannum Lane Yellow Springs, Ohio, USA). COD, ammonium (NH4+), total nitrogen (TN), and total phosphorus (TP) were determined by dichromate method, Nessler’s reagent spectrophotometry, ultraviolet spectrophotometry, and ammonium molybdate spectrophotometric, respectively (State Environmental Protection Administration 2002). The samples were washed with sterile water several times to remove impurities. Voucher specimens were preserved in 4 % formaldehyde. Voucher specimens were deposited in the Herbarium of Shanxi University (SXU), Shanxi University, Taiyuan, Shanxi Province, China (Voucher number: SXU-SX230328).
Morphological observations
Morphological characters of the specimens were observed under an Olympus BX-51 microscope (Olympus, Tokyo, Japan) equipped with a digital camera for photographing (DP72 Olympus, Tokyo, Japan).
DNA extraction, amplification, and sequencing
Total DNA was extracted from the fresh thalli collected from Shanxi province using a plant DNA extraction kit (Sangon Biotech, Shanghai, China). The four genes (SSU, LSU, ITS, and rbcL) were amplified using the paired primers listed in Table 1. The polymerase chain reaction (PCR) amplifications were conducted in 50 μL volumes containing 37.75 μL ddH2O, 5.0 μL 10 × buffer, 4.0 μL 2.5 mM dNTPs, 0.25 μL Taq DNA polymerase (Sangon Biotech, Shanghai, China), 1.0 μL of each primer (10 mM) and 1.0 μL of genomic DNA. The amplifications were performed using the following programs: 94 ℃ for 5 min, 35 cycles of 94 ℃ for 30–60 s, 46.5–59 ℃ for 30–60 s, and 72 ℃ for 2 min, and final 72 ℃ for 10 min. The reaction was undertaken in a MyCycler thermal cycler (Bio-Rad, Hercules, CA, USA). In each experiment, three replicates were used. The sequencing was performed on an ABI 3730XL sequencer. The DNA sequences generated in this study have been deposited in GenBank under accession numbers (OR230247, OR336050, OR284295, and PP025381).
Table 1 Primers for amplifying and sequencing of the nuclear SSU, LSU, and rbcL rDNA
Designation
|
Sequence (5’–3’)
|
Reference
|
SSU
|
|
|
16s
|
CCGAATTCGTCGACAACCTGGTTGATCCTGCCAGT
|
(Medlin et al., 1988)
|
16f
|
CCCGGGATCCAAGCTTGATCCTTCTGCAGGTTCACCTAC
|
LSU
|
|
|
5.8SF
|
CGATGAAGAACGCAGCGAAATGCGAT
|
(Riisberg et al., 2009)
|
LSU 4256R
|
GGAWTATGACTGAACGCCTCTAAGTCAGA
|
28S_25F
|
ACCCGCTGAATTTAAGCATATA
|
(Jo et al., 2011)
|
28S_861R
|
GTTCGATTAGTCTTTCGCCCCT
|
28S_736F
|
CCCGAAAGATGGTGAACTC
|
28S_1440R
|
TGCTGTTCACATGGAACCTTTC
|
28S_1228F
|
CCTGAAAATGGATGGCGC
|
28S_2160R
|
CCGCGCTTGGTTGAATTC
|
28S_2038F
|
GACAAGGGGAATCCGACT
|
28S_2812R
|
GATAGGAAGAGCCGACATCGAA
|
rbcL
|
|
|
Chryso_rbcL_F4
|
TGGACDGAYTTATTAACDGC
|
(Pusztai and Škaloud, 2019)
|
Chryso_rbcL_R7
|
CCWCCACCRAAYTGTARWA
|
ITS
|
|
|
ITS4
|
TCCTCCGCTTATTGATATGC
|
(White et al., 1990)
|
KN1.1
|
CAAGGTTTCCGTAGGTGAACC
|
(Wee et al., 2001)
|
Sequence alignment and phylogenetic analysis
Newly obtained sequences in this study and downloaded sequence data from GenBank (listed in Table S1) were aligned using MAFFT version 7 (Katoh et al. 2019). The sequences of SSU, LSU, and rbcL were concatenated based on the methods of Zhang et al. (2020). Pairwise genetic P-distances of concatenated sequences were calculated in MEGA 5.0 (Tamura et al. 2011). Synchroma grande and Nannochloropsis limnetica were used as outgroups in the phylogenetic tree. The appropriate model was built using the software PartitionFinder 2, with all algorithm and AIC criterion (for BI: Subset (1)(2)(3) = GTR + I + G; for ML: Subset (1)(2)(3) = GTR + I + G) (Lanfear et al. 2017). IQ-TREE was used to construct maximum likelihood (ML) trees with 5000 ultrafast bootstraps repetitions (Nguyen et al. 2015). Bayesian inference (BI) phylogenies were inferred using MrBayes 3.2.6, and the BI analysis was run for 3,000,000 generations (Ronquist et al. 2012). The resulting phylogenetic trees were edited using Figtree 1.4.2 (http://tree.bio.ed.ac.uk/software/figtree/). Adobe Illustrator CS5 (Adobe System, San Jose, CA, USA) was used to optimize the graphics of all trees.
Molecular clock analyses
We employed a Bayesian inference method with a relaxed clock model using Beast v2.6.6 (Bouckaert et al. 2014) to conduct phylogeny and simultaneously estimate branch divergence times. We used the uncorrelated lognormal model to estimate variation rates across all branches. We used fossil calibrations as probabilistic priors. The lognormal priors were used for spits between the species Mallomonas asmundiae and M. striata var. serrata, and between M. elevata and M. mangofera var. foveata. Both calibrations were based on an offset of 38 Ma, a mean of 0.5 Ma, and a standard deviation of 1.0, which represents a minimal age estimate for the majority of fossils of Mallomonas species from the Gtraffe Pipe locality (Creaser et al. 2004; Doria et al. 2011; Siver et al. 2015). A generalized time reversible (GTR) + gamma site model was applied to the three-gene concatenated data set and a Yule tree prior was used as a speciation model. The analysis was run for 50 million generations with the chain sampled every 1000 generations. Convergence of parameter estimates and estimation of burn-in was checked using the program Tracer version 1.7 (Rambaut et al. 2018). The initial 5,000,000 (10%) were removed, and the rest were retained to construct the final chronogram with 95% posterior probabilities (PP) and age estimates for all nodes. The resulting phylogenetic trees were edited using Figtree 1.4.2 (http://tree.bio.ed.ac.uk/software/figtree/) and optimized using Adobe Illustrator CS5 (Adobe System, San Jose, CA, USA).
ITS2 secondary structures.
The ITS2 sequences of the genera Mallomonas and Synura were downloaded from GenBank and aligned with the sequence of H. foetidus obtained in this study using MAFFT version 7 (Katoh et al. 2019). The ITS2 secondary structure of H. foetidus was constructed using the mfold computer program (Zuker 2003) and VARNA (Darty et al. 2009).