3.1. Genome structural features and organization
Total sizes of S. arundinetorum and M. pictum mitogenomes are 15,282 bp and 14,352 bp, respectively (GenBank accessions OP289102 and OP311642) (Table 1), which are similar with the sizes of other stylommatophoran species that range from 13,798 bp for Camaena poyuensis to 16,323 bp for Achatinella mustelina [24]. The two mitochondrial genomes show similar structure, including the circular molecules encoding 13 protein-coding genes, 2 rRNAs, and 22 tRNAs, as well as one typical control region characterized by most of the land snails. Furthermore, 9 PCGs (cob, cox1, cox2, nad1, nad2, nad4, nad4 L, nad5, nad6), 12 tRNA genes (trnA, trnC, trnD, trnF-trnL1, trnP, trnS L, trnV), one rRNA gene (rrnL), and the control region are located on the light strand (L-strand). The other mitochondrial genes are located on the H-strand in same orientation, excepting for trnW and trnY in S. arundinetorum that are located on the heavy strand (H-strand) in reversed orientation (Fig. 1).
Similar with other stylommatophoran species, the mitogenomes of M. pictum and S. arundinetorum showed the high A + T contents, which reach to 72.1% and 76.78%, respectively (Suppl. material 3) [40]. Moreover, the AT skew values are − 0.12 (S. arundinetorum) and − 0.094 (M. pictum), and GC skew values are 0.047 (S. arundinetorum) and 0.019 (M. pictum). This result indicated the higher ratios of T or G than A and C nucleotides (Table 2 and Suppl. material 3), which is consistent with the previous studies [25]. Additionally, 13 genes in S. arundinetorum and 18 genes in M. pictum showed a certain extent of overlaps with nearby genes, with overlapped length ranging from 7 bp to 29 bp and 1 bp to 40 bp, respectively. Totally, we identified 105 bp and 172 bp intergenic sequences in two mitogenomes, with length ranging from 1 bp to 46 bp and 1 bp to 70 bp, respectively (Table 1).
3.2. Protein-Coding Genes, rRNA Genes, and Codon Usage
Among the 13 PCGs, the largest gene is nad5 (1674 bp in S. arundinetorum and 1626 bp in M. pictum) and the smallest gene is atp8 (147 bp and 159 bp), which is consistent with the previous studies [40]. The AT skew values of 13 PCGs were almost negative, while their GC skew values were almost positive. ‘ATG’, ‘ATA’, ‘ATT’, and ‘TTG’ were used as the start codons except the nad4l in S. arundinetorum that displayed incomplete start codon and the atp8 in M. pictum that started with ‘GTG’. Most of the PCGs stop with TAG or TAA, excepting for the cox2, cox3, nad2 in M. pictum and the nad2, nad4 in S. arundinetorum that used a single T– or TA- as stop codons (Table 1).
Then, we performed Ka/Ks analysis for 12 PCGs to estimate the evolutionary-selection constraints on two species (no value was calculated for the atp8). The Ka/Ks ratios of all PCGs are less than 1, indicating that these genes were evolving primarily under purifying selection. The Ka/Ks ratios of the nad2, nad3, nad4, and nad6 in M. pictum and the nad4l in S. arundinetorum are greater than 0.5 (Suppl. material 7, Suppl. material 2), suggesting that these genes received less pressure from purification and evolved relatively more quickly. The cox1 has the lowest Ka/Ks ratio in both two species and only minor alteration in protein sequence was observed. This gene is also widely used as a molecular marker for species identification and phylogenetic analysis [41]. Most of the PCGs (except the atp8) in both mitogenomes have negative AT-skew values. As for their GC skew values, seven PCGs (cox1, cox2, nad1-nad3, nad4 L, nad6) have positive GC skew values and two genes (atp6 and atp8) show negative values.
The lengths of mitochondrial tRNA genes are similar in the two mitogenomes. The nucleotide composition of tRNA genes are biased toward A and T (Suppl. material 8, Table 2). Consistent with other gastropodous species, majority of the tRNAs have negative AT skew values and positive GC skew values (Suppl. material 8) [23, 40]. The highest frequency of amino acids usage by PCGs were Leu2, Phe, Gly, Ala Val, and Ile (Fig. 3) and the lowest frequency of amino acids usage was Arg, which is similar with other gastropodous species [25, 40]. Moreover, the most frequently used codons were TTA (Leu), ATT (Lie), TTT (Phe), and ATA (Met) that are composed of A/T nucleotides (Fig. 3 and Suppl. material 4).
3.3. The Interspersed Repeat and Control Region
Four types of interspersed repeat, including forward, palindromic, reverse, and complement, were identified in the two mitogenomes. We noticed that the contents of interspersed repeat in S. arundinetorum is significantly higher than M. pictum (Suppl. material 9). The mitochondrial control region (CR) of S. arundinetorum and M. pictum were located between trnK (Lys) and trnV (Val), and trnQ (Gln) and trnC (Cys), respectively (Fig. 1). The length of CR in S. arundinetorum is obviously longer than M. pictum (1,243 bp versus 411 bp), suggesting that the former has a more complicated mitochondrial CR. Similarly, the nucleotide composition of CR was biased toward A and T in both mitogenomes, with proportions of 78.92% in S. arundinetorum and 76.16% in M. pictum.
3.4. Phylogenetic Analysis and Divergence Times
To phylogenetically analyze the two systellommatophoran species, we selected 21 additional gastropod species with published mitogenomes for construction of phylogenetic tree. Among them, Rhopalocaulis grandidieri (Systellommatophora) and Auriculastra duplicate (Basommatophora) were selected as the outgroup. Then, we performed Maximum likelihood (ML) and Bayesian inference (BI) analyses based on the 13 PCGs of these species, which produced almost identical topologies with strong bootstrap and posterior probability values (Fig. 4, Suppl. material 10, and Suppl. material 11). According to the phylogenetic tree, the stylommatophoran species were clustered on the same branch and can be divided into five major subclades. Consistent with previous studies, family Philomycidae, Arionidae, and Oreohelicidae were phylogenetically closed, and the M. pictum and Philomycus bilineatus were clustered together [25, 42]. S. arundinetorum, S. putris, and O. wujiaquensis belonging to Succineidae were grouped together, and S. arundinetorum and O. wujiaquensis are phylogenetically more closed.
The time-calibrated phylogeny indicated that Stylommatophora diverged from Systellommatophora and Basommatophora approximately 317.17 million years ago (Mya) (Fig. 4), which is complying with the previous study [43]. Succineidae diverged from other Stylommatophora species about 258.87 Mya ago, and S. arundinetorum and O. wujiaquensis were differentiated about 13.20 Mya ago. Philomycidae originated approximately 198.08 Mya ago, and M. pictum and P. bilineatus were divergent about 15.39 Mya ago. Based on the time-calibrated phylogenetic tree, Stylommatophora was divergent properly around 200 to 300 Mya ago that match the Triassic Period in the Earth. During this period, the climate was more appropriate and the Earth underwent outbreak of speciation, including cephalopods, crustaceans, fish, amphibians, reptiles, speculations, terrestrial snail and so on [44].
3.5. Mitochondrial gene order and rearrangements
According to the previous report, mitochondrial gene rearrangements were commonly occurred in gastropods species [45]. To study the order of mitochondrial genes in M. pictum and S. arundinetorum, we designated the gene order in Achatina fulica mitogenome as the ancestral pattern of Stylommatophora because it displayed same gene order with the outgroups (Rhopalocaulis grandidieri and Auriculastra duplicate). Overall, we observed 4 unique PCG orders in the mitogenomes of the 23 species analyzed in this study (Fig. 5). Noticeably, the order of mitochondrial PCGs in M. pictum is identical with Philomycus bilineatus while displays great differences with the other species. We speculated that the mitochondrial PCGs in M. pictum had experienced four successive inversion arrangements (nad6-nad5-nad1-nad4L vs. cob-cox2-atp8-atp6, cox3 vs. nad4, cox2 vs. atp8-atp6, and cox2 vs. cox3) (Fig. 6).
When considered all types of genes, we identified 15 types of gene orders in the mitogenomes of the 23 species, indicating the higher frequency of gene rearrangements for the tRNAs. Compared with the ancestral species, three tRNA inversions occurred in S. arundinetorum (trnD-trnC vs. trnF, trnG-trnH-trnQ-trnl2-atp8-trnN-atp6-trnR-trnE-12S-trnM-nad3-trnS2 vs. trnY-trnW, and trnL1-trnA vs. trnP). Furthermore, we also observed three tRNA inversions in Punctum randolphii (trnS1-nad4 vs. trnT-cox3, trnG-trnH vs. trnQ-trnL2-atp8-trnN-atp6-trnR-trnE-12S-trnM-nad3-trnS2-trnT-cox3, trnP vs. trnA) and Arion ater (trnL2-atp8-trnN-atp6-trnR vs. trnE-12S-trnM, trnA vs. trnP, trnE vs. trnQ) (Fig. 6).
Compared to other stylommatophoran species, M. pictum displays higher frequency of both PCG and tRNA rearrangements. For example, compared with A. fulica, five large-scale gene rearrangement events (trnP-nad6-nad5-nad1-nad4 L vs. cob-trnD-trnC-trnF-cox2-trnY-trnW-trnG-trnH-trnQ-trnL2-atp8-trnN-atp6-trnR-trnE-12S-trnM, trnL2-ATP8-trnN-ATP6-trnR-trnE-12S-trnM vs. cox2-trnY-trnW-trnG-trnH-trnQ, trnS2-trnS1-nad4 vs. trnT-cox3, cox2-trnY-trnW-trnG-trnH-trnQ vs. trnP-nad6-nad5-nad1-nad4 L-nad3, trnL1 vs. 16S) were observed (Fig. 6) [24, 42]. In addition, some gene clusters, such as nad6-nad5-nad1-nad4 L, trnY-trnW, trnG-trnH, and trnI-nad2-trnK, are conserved in these mitochondrial genomes, which can be served as a synapomorphy for the terrestrial molluska species.