Identification and physicochemical property analysis of MYC genes
A total of 40 MYC genes were identified in the genomes of four Cucurbitaceae crops. 10 in C. sativus, 8 in C. melo, 12 in C. lanatus, and 10 in B. hispida. All the MYC genes were unequally distributed on each chromosome. For example, in C. sativus L., there are 6 MYC genes on chromosome 3, while there are none on chromosome 1, 2, 4, and 5. In C. melo L., except for chromosomes 2, 3, 5, 7, 8, 9, 10, and 12 where there is no distribution, the other chromosomes each have 1, 2, or 3 MYC genes (Fig. 1). Sequence analysis revealed that the amino acid lengths encoded by CsMYC genes varied from 431aa (CsMYC8) to 694aa (CsMYC5), by CmMYC genes varied from 433aa (CmMYC8) to 745aa (CmMYC1), by ClMYC genes varied from 423aa (ClMYC3) to 969aa (ClMYC1), and by BhMYC genes varied from 501aa (BhMYC4) to 968aa (BhMYC3). In Cucurbitaceae crops, except for the CsMYC1, CmMYC5, ClMYC10 and BhMYC1 were acidic proteins, the others are alkaline proteins. The most proteins are unstable (instability index greater than 40), while the CsMYC5, CmMYC1, ClMYC12, BhMYC7, and BhMYC9. The average hydrophilicity values of all are less than 0, indicating that all the proteins are hydrophobic. Subcellular localization prediction results revealed that the most MYC proteins are localized in the cell nucleus, followed by chloroplasts. (Table 2).
Table 2
Protein information of MYC gene family members in Cucurbitaceae crops.
Species | Gene ID | Name | Number of amino acids (aa) | Molecular weight (D) | pI | Instability index | Aliphatic index | Average of hydropathicity | Prediction of subcellular location |
Cucumis sativus L. | CsaV3_3G000850.1 | CsMYC1 | 447 | 49398.41 | 8.65 | 43.61 | 82.37 | -0.416 | Nucleus |
CsaV3_3G001710.1 | CsMYC2 | 642 | 69911.19 | 6.21 | 50.44 | 64.14 | -0.578 | Nucleus |
CsaV3_3G007980.1 | CsMYC3 | 649 | 71943.89 | 5.83 | 49.00 | 77.66 | -0.490 | Nucleus |
CsaV3_3G022420.1 | CsMYC4 | 501 | 55301.14 | 5.70 | 44.81 | 78.60 | -0.445 | Nucleus |
CsaV3_3G034600.1 | CsMYC5 | 694 | 78486.44 | 5.24 | 38.28 | 84.84 | -0.370 | Chloroplast\Nucleus |
CsaV3_3G049150.1 | CsMYC6 | 688 | 75666.20 | 5.11 | 58.15 | 69.71 | -0.617 | Nucleus |
CsaV3_6G000530.1 | CsMYC7 | 644 | 72769.50 | 5.51 | 43.25 | 78.57 | -0.500 | Nucleus |
CsaV3_6G008940.1 | CsMYC8 | 431 | 48394.52 | 5.42 | 47.63 | 76.43 | -0.429 | Nucleus |
CsaV3_6G037080.1 | CsMYC9 | 650 | 71869.01 | 5.91 | 48.30 | 83.31 | -0.347 | Nucleus |
CsaV3_7G027460.1 | CsMYC10 | 691 | 76190.07 | 5.66 | 41.31 | 76.58 | -0.372 | Nucleus |
Cucumis melo L. | MELO3C015748.2.1 | CmMYC1 | 745 | 82003.24 | 5.70 | 37.16 | 81.25 | -0.269 | Nucleus |
MELO3C003412.2.1 | CmMYC2 | 723 | 79597.55 | 5.32 | 56.07 | 68.62 | -0.628 | Nucleus |
MELO3C024041.2.1 | CmMYC3 | 501 | 55262.12 | 6.16 | 48.09 | 78.06 | -0.486 | Nucleus |
MELO3C006016.2.1 | CmMYC4 | 586 | 65067.91 | 5.55 | 47.76 | 80.00 | -0.535 | Nucleus |
MELO3C013772.2.1 | CmMYC5 | 442 | 48984.93 | 7.68 | 48.03 | 81.52 | -0.439 | Nucleus |
MELO3C013851.2.1 | CmMYC6 | 662 | 72281.88 | 6.03 | 49.58 | 64.24 | -0.570 | Nucleus |
MELO3C021212.2.1 | CmMYC7 | 656 | 74227.22 | 5.61 | 42.86 | 79.07 | -0.473 | Nucleus |
MELO3C022250.2.1 | CmMYC8 | 433 | 48684.93 | 5.59 | 47.11 | 76.07 | -0.442 | Nucleus |
Citrullus lanatus | Cla97C03G062520.1 | ClMYC1 | 969 | 105839.97 | 6.15 | 47.45 | 78.86 | -0.415 | Nucleus |
Cla97C05G080890.1 | ClMYC2 | 618 | 68589.08 | 5.85 | 46.81 | 76.99 | -0.547 | Nucleus |
Cla97C06G112130.1 | ClMYC3 | 423 | 47569.45 | 5.14 | 52.02 | 71.70 | -0.513 | Nucleus |
Cla97C06G112140.1 | ClMYC4 | 427 | 47692.74 | 5.09 | 52.44 | 52.44 | -0.424 | Nucleus |
Cla97C06G113160.1 | ClMYC5 | 645 | 72922.74 | 5.78 | 46.22 | 79.64 | -0.469 | Nucleus |
Cla97C07G128490.1 | ClMYC6 | 637 | 70287.21 | 6.36 | 45.59 | 81.93 | -0.368 | Nucleus |
Cla97C07G129080.1 | ClMYC7 | 501 | 55179.91 | 5.96 | 49.98 | 76.47 | -0.504 | Nucleus |
Cla97C09G170270.1 | ClMYC8 | 690 | 76046.16 | 5.87 | 40.82 | 80.36 | -0.345 | Nucleus |
Cla97C09G174730.1 | ClMYC9 | 680 | 74878.38 | 5.24 | 53.48 | 70.68 | -0.610 | Nucleus |
Cla97C10G185380.1 | ClMYC10 | 463 | 51021.92 | 8.77 | 53.53 | 79.74 | -0.471 | Nucleus |
Cla97C10G186220.1 | ClMYC11 | 694 | 76681.99 | 6.35 | 49.20 | 64.81 | -0.573 | Nucleus |
Cla97C10G204640.1 | ClMYC12 | 695 | 78085.01 | 4.97 | 38.55 | 83.87 | -0.326 | Chloroplast\Nucleus |
Benincasa hispida | BhiUN179M27 | BhMYC1 | 453 | 49798.89 | 8.19 | 47.72 | 83.43 | -0.371 | Nucleus |
Bhi01M000362 | BhMYC2 | 617 | 68250.67 | 5.77 | 44.99 | 78.36 | -0.520 | Nucleus |
Bhi02M001191 | BhMYC3 | 968 | 105559.53 | 6.12 | 44.22 | 79.96 | -0.421 | Nucleus |
Bhi05M000179 | BhMYC4 | 501 | 55182.03 | 6.02 | 45.65 | 79.20 | -0.464 | Nucleus |
Bhi05M000251 | BhMYC5 | 647 | 71471.31 | 5.91 | 43.64 | 80.06 | -0.370 | Nucleus |
Bhi05M000336 | BhMYC6 | 682 | 75237.76 | 5.22 | 48.52 | 71.17 | -0.632 | Nucleus |
Bhi09M000958 | BhMYC7 | 604 | 66789.67 | 6.01 | 39.23 | 80.83 | -0.350 | Nucleus |
Bhi11M000137 | BhMYC8 | 660 | 72104.63 | 5.99 | 47.48 | 65.35 | -0.570 | Nucleus |
Bhi11M001900 | BhMYC9 | 697 | 78638.85 | 5.07 | 39.16 | 84.61 | -0.314 | Chloroplast\Nucleus |
Bhi12M001654 | BhMYC10 | 643 | 72375.09 | 5.60 | 47.35 | 80.34 | -0.451 | Nucleus |
Structure and phylogenetic analysis of Cucurbitaceae crops MYC gene
Motif prediction analysis on the protein sequences of the MYC family members showed that when the number of motifs is limited to 10, no MYC family member contains all the motifs (Fig. 2). And the motif 1, 2, 5, 7, and 8 were relatively conservative, common to all MYC genes. Gene structure analysis revealed that the number of exons in MYC genes ranges from 1 to 15 (Fig. 2). The gene structures of most MYC genes within the same lineage are similar, further indicating the conservation of protein motifs and gene structures within the same evolutionary branch of MYCs.
According to the alignment of bHLH domains in MYC proteins, there was a basic amino acid region (Basic) composed of approximately 12 amino acids. This region contains a highly conserved H4-V5-E6-E8-R9-R11-R12 sequence, which is essential for the binding of bHLH to target genes. This region also includes two helical structures, comprising approximately 37 amino acids. Notably, the 22th and 38th leucine (Leu) amino acids in the HLH domain are highly conserved, indicating their necessity for dimer formation (Fig. 3).
To clarify the evolutionary relationships among the MYC gene family in Cucurbitaceae crops, Zea mays, Brachypodium distachyon, and Oryza sativa, we constructed a phylogenetic tree. As the result showed that All MYC proteins could be divided into five subgroups, labeled as I to Ⅴ. In each subgroup, there are both monocotyledonous and dicotyledonous plants, indicating that the MYC genes were relatively conserved during the evolutionary processes of both monocots and dicots. Group Ⅳ is the largest subgroup, consisting of 4, 4, 6, 3, 2, 1, and 1 MYC proteins from C. sativus L., C. melo L., C. lanatus, B. hispida, Z.mays, B. distachyon, and O. sativa, respectively. while the group I was the smallest, only with 4 MYC genes (Fig. 4). In group Ⅱ, there were 7 MYC genes, while only 1 belongs to monocotyledonous plants.
Collinearity analysis of MYC gene among Cucurbitaceae crops
To infer the evolution of MYC genes, synteny analysis was carried out among the four Cucurbitaceae species (Fig. 5; Table 3). A total of 36 MYC genes (9 in C. sativus L., 7 in C. melo L., 11 in C. lanatus, and 9 in B. hispida) were located within synteny blocks of the four Cucurbitaceae genomes. We found five orthologous gene pairs which exist among all four species. The result showed that these MYC genes were conserved during the evolution of all the four Cucurbitaceae species, suggesting the conserved roles in Cucurbitaceae species. Furthermore, it was also observed that some MYC genes were lost in some species. For instance, certain MYC genes were found in C. sativus L., C. melo L., and B. hispida, but were lost in B. hispida, such as the CsaV3_3G000850/MELO3C013772.2/Cla97C10G185380 collinear gene pair. The result showed that the specific traits in different Cucurbitaceae species during the evolution and the amplification of genome. In addition, there were two collinear gene pairs only existed in C. lanatus, and B. hispida.
Table 3
The collinear gene pairs existed in four Cucurbitaceae genomes.
Number | Cucumber | Melon | Watermelon | Waxgourp |
1 | CsaV3_3G000850 | MELO3C013772.2 | Cla97C10G185380 | - |
2 | CsaV3_7G027460 | MELO3C015748.2 | Cla97C09G170270 | Bhi09M000958 |
3 | CsaV3_6G037080 | - | Cla97C07G128490 | Bhi05M000251 |
4 | CsaV3_6G008940 | MELO3C022250.2 | Cla97C06G112130 | - |
5 | CsaV3_6G000530 | MELO3C021212.2 | Cla97C06G113160 | Bhi12M001654 |
6 | CsaV3_3G049150 CsaV3_3G001710 | MELO3C003412.2 MELO3C013851.2 | Cla97C09G174730 Cla97C10G186220 | Bhi05M000336 Bhi11M000137 |
7 |
8 | CsaV3_3G007980 | MELO3C006016.2 | Cla97C05G080890 | Bhi01M000362 |
9 | CsaV3_3G034600 | - | Cla97C10G204640 | Bhi11M001900 |
10 | - | - | Cla97C03G062520 | Bhi02M001191 |
11 | - | - | Cla97C07G129080 | Bhi05M000179 |
The regulatory TFs of MYC genes
The 1.5-kb upstream sequences of MYC genes were selected to predicate theirs regulatory TFs. As the results showed that three types of cis-elements related to development, hormone stress, and abiotic stresses were identified (Fig. 6). Among the cis-elements related to development, the number of G-box (CACGTC) elements is the highest, which is a light-responsive element. For example, genes of MELO3C021212.2.1, MELO3C003412.2.1, and Cla97C10G186220.1 contain 9 G-box elements, indicating that they might be regulated by the light environment. For the cis-elements related to hormone stress, the abscisic acid (ABA)-responsive element ABRE (ACGTG) occupies a relatively large proportion, 8 in genes of MELO3C021212.2.1, MELO3C003412.2.1, and Cla97C10G186220.1, 6 in genes of Bhi02M001191, CsaV3_3G001710.1, and Bhi11M000137. Among the cis-elements related to abiotic stresses, anaerobic induction ARE (AAACCA) were detected in a series of members, such as 6 in Cla97C07G128490.1 and CsaV3_3G007980.1, 5 in Cla97C05G080890.1.
Tissue-specific expression analysis of MYC genes in C. sativus
To investigate the expression profiles of the MYC gene family in different tissues, using cucumber as a representative, we conducted the transcriptome analysis based on publicly available cucumber transcriptome sequencing data among various tissues (PRJNA80169). We utilized the cucumber ChineseLong_V3 genome information for this reanalysis, focusing on the expression levels of the cucumber MYC gene family in 10 different tissues or organs, including root, stem, leave, tendril, male flower, female flower, ovary, ovary unfertilized, ovary fertilized and tendrils base. As the result showed that there was significant expression variation of the MYC gene family across different tissues (Fig. 7). Such as, While the CsaV3_3G001710 gene showed a relatively high level of expression across all tissues or organs, three genes (CsaV3_3G000850, CsaV3_6G008940, and CsaV3_6G037080), on the other hand, exhibited relatively low expression levels across all tissues or organs. Some genes exhibited significant tissue-specific expression patterns. For example, the CsaV3_3G049150 gene showed relatively high expression in root and fertilized ovary, while demonstrating low expression in other tissues or organs. Similarly, compared to other tissues or organs, the CsaV3_7G027460 gene displayed higher expression in root. The above results indicate that the cucumber MYC family genes played distinct roles in the development of tissues or organs, contributing to various functions in the growth and development of the plant.
Expression analysis of cucumber MYC genes under different stress conditions.
Based on publicly available transcriptome data from the NCBI SRA database, we conducted a analysis of the expression levels of the cucumber MYC genes under both biotic and abiotic (high temperature, low temperature, salt and silicon stress, powdery mildew, and southern root-knot nematode) stress conditions.
Under high-temperature stress, most MYC genes did not show significant differential expression (Fig. 8). For instance, genes of CsaV3_6G037080, CsaV3_3G000850, and CsaV3_6G008940 exhibited no change in expression levels under high-temperature stress, and their expression levels were relatively low. However, gene CsaV3_3G001710 demonstrated a significant upregulation at 6 hours post high-temperature treatment (6hph), while gene CsaV3_3G007980 showed high expression levels at both 3hph and 6 hph. These results suggest that gene CsaV3_3G007980 is likely involved in the response of cucumber to high-temperature stress.
Under low-temperature stress, the expression levels of four genes (CsaV3_6G037080, CsaV3_3G000850, CsaV3_6G008940, and CsaV3_6G000530) showed no significant change and remained at relatively low levels. Two genes exhibited relatively high expression levels during low-temperature treatment, with CsaV3_3G007980 gene showing a significant upregulation at 6hph. The expression levels of the other genes remained unchanged before and after low-temperature treatment. Additionally, CsaV3_3G049150 showed a significant downregulation in expression both at 6hph and 12hph. Worth noting is that the expression levels of all genes did not undergo significant changes at 3hph. These results suggest that CsaV3_3G007980 and CsaV3_3G049150 genes play a key role in the response of cucumber to prolonged low-temperature stress, and CsaV3_3G007980 was positively regulated, while CsaV3_3G049150 was negatively regulated (Fig. 9).
Under salt and silicon stress, most genes did not show differential expression after NaCl and Silicon treatments. One gene (CsaV3_3G000850) exhibited significant downregulation after NaCl treatment and show significant upregulation after Silicon treatment. However, when treated simultaneously with NaCl and Silicon, it displayed a higher degree of downregulation (Fig. 10). Despite the significant differential expression observed for CsaV3_3G00850 gene after treatment, its expression level remained relatively low under both control and stress.
Similarly, we analyzed the response of the MYC gene to biological stress. After inoculation with powdery mildew for 48 hours, most genes showed no significant difference in expression in both resistant (SSL508-28) and susceptible (D8) material (Fig. 11). However, certain MYC genes exhibited differential expression patterns between SSL508-28 and D8. For instance, CsaV3_3G000850, treated with powdery mildew, demonstrated a significantly higher level of upregulation in D8, compared to a relatively lower fold-change in SSL508-28. Interestingly, post-inoculation, the absolute expression level of CsaV3_3G000850 in D8 was much lower than its expression in SSL508-28. In addition, after inoculation with powdery mildew, CsaV3_3G049150 showed a significantly downregulated expression in D8 and a certain degree of upregulation in SSL508-28.
After inoculation with root-knot nematode (Meloidogyne incognita), the expression levels of most genes in both resistant (IL10-1) and susceptible (CC3) material generally exhibited similar trends, such as CsaV3_3G000850 and CsaV3_3G034600 (Fig. 12). However, there were two genes, CsaV3_6G000530 and CsaV3_6G037080, that showed an upregulation trend in resistant materials and a downregulation trend in susceptible materials. Nevertheless, the degree of differential expression for these genes is relatively low, whether in resistant or susceptible materials.
The responding of eight genes of four Cucurbitaceae crops under temperature stress
These eight genes in four Cucurbitaceae crops (CsaV3_3G007980, CsaV3_3G001710, MELO3C006016, MELO3C013851, Bhi01M000362, Bhi11M000137, Cla97C10G186220, Cla97C05G080890) were selected for analysis in response to temperature stress. As the result showed (Fig. 13), in Cucumis sativus, that genes CsaV3_3G007980 and CsaV3_3G001710 showed upregulated expression under both low and high temperature stress, which is relatively consistent with the transcriptome results, indicating their involvement in cucumber response to temperature stress. This expression pattern also appeared in Cucumis melo and Citrullus lanatus. The homologous genes MELO3C006016 and Cla97C05G080890 of CsaV3_3G007980 showed mainly upregulated expression under high temperature stress. However, under low temperature stress, The homologous gene MELO3C013851 exhibited significant downregulation at 3 and 6 hours of low temperature treatment, and then showed upregulated expression at 12 hours of low temperature treatment. While, in Benincasa hispida, these two homologous genes Bhi01M000362 and Bhi11M000137 show significantly downregulated expression in all time periods of both low and high temperature treatments.