Identification of AAAP genes in pepper
To explore the AAAP protein family in pepper, we used one domain (PF01490) searche of Pepper Genome Database2 (http://peppersequence.genomics.cn/page/); the HMM profile was used as a query and each putative AAAP protein sequences was verified by SMART, CDD and Pfam analyses. A total of 53 AAAP genes were identified and renamed in pepper according to their affinities within gene subfamilies; CaGAT1, CaANL4 and CaVAAT1, were not anchored to chromosomes (Table 1). Gene lengths ranged from 669 (CaLHT4) to 2,532 bp (CaAAP4), the molecular weight varies from 24.43 kDa (CaLHT4) to 93.22 kDa (CaAAP4). Thus, the gene length span of the CaAAAP gene family is large and correlated with molecular weight. The isoelectric points (pIs) of CaAAAP proteins ranged from 4.27(CaVAAT5) to 10.06(CaANT5); the majority of proteins (83%) had pIs more than 7.0, which indicates that AAAP proteins in pepper may represent a class of basic protein.
Table 1
The general information and sequence characterization of 53 CaAAAP genes.
S.N. | Genea | Locusb | Locationc | ORF(bp)d | Exone | Proteinf | TM regiong |
Size (aa) | MW(d) | pI |
| AAP group | | | | | | | | |
1 | CaAAP1 | Capana07g002429 | Chr07:220179435–220181692 | 1335 | 7 | 444 | 49460.8 | 8.72 | 11 |
2 | CaAAP2 | Capana07g002430 | Chr07:220188828–220192330 | 1869 | 10 | 622 | 68584.1 | 8.46 | 14 |
3 | CaAAP3 | Capana07g002431 | Chr07:220195003–220198004 | 1410 | 7 | 469 | 51763.2 | 8.45 | 10 |
4 | CaAAP4 | Capana07g002432 | Chr07:220225817:220233681 | 2532 | 13 | 843 | 93224.4 | 8.88 | 17 |
5 | CaAAP5 | Capana04g000780 | Chr04:14469803:14475148 | 1446 | 7 | 481 | 52757.7 | 8.81 | 10 |
6 | CaAAP6 | Capana12g000826 | Chr12:27187513:27194231 | 1467 | 7 | 488 | 53825.3 | 8.94 | 9 |
7 | CaAAP7 | Capana08g002210 | Chr08:143014796:143019992 | 1419 | 7 | 472 | 51747.8 | 9.12 | 11 |
8 | CaAAP8 | Capana04g001588 | Chr04:67204663:67207202 | 1434 | 6 | 477 | 52413.8 | 8.27 | 10 |
9 | CaAAP9 | Capana06g001752 | Chr06:50038242:50040303 | 1419 | 6 | 472 | 51681.9 | 7.84 | 10 |
10 | CaAAP10 | Capana05g001770 | Chr05:174328892:174330262 | 1020 | 3 | 339 | 37207.9 | 6.86 | 7 |
| LHT group | | | | | | | | |
11 | CaLHT1 | Capana02g003614 | Chr02:162887482:162890584 | 1350 | 8 | 449 | 50428 | 8.6 | 9 |
12 | CaLHT2 | Capana02g003615 | Chr02:162905774:162912940 | 1266 | 7 | 421 | 47443 | 8.24 | 9 |
13 | CaLHT3 | Capana02g003616 | Chr02:162914284:162921151 | 1332 | 8 | 443 | 49858.6 | 9.08 | 11 |
14 | CaLHT4 | Capana04g002888 | Chr04:215599914:215604069 | 1227 | 9 | 408 | 46113.8 | 8.27 | 7 |
15 | CaLHT5 | Capana04g001881 | Chr04:130533897:130536648 | 1329 | 6 | 442 | 49917.6 | 8.06 | 10 |
16 | CaLHT6 | Capana11g000230 | Chr11:5761051:5762379 | 1329 | 1 | 442 | 49944.5 | 9.1 | 10 |
17 | CaLHT7 | Capana03g001379 | Chr03:25005836:25008812 | 1329 | 7 | 442 | 49012.6 | 9.4 | 11 |
18 | CaLHT8 | Capana05g000336 | Chr05:7406911:7414486 | 1065 | 7 | 354 | 39858.5 | 9.42 | 7 |
19 | CaLHT9 | Capana11g002248 | Chr11:216341951:216346445 | 1311 | 8 | 436 | 48573.9 | 9.03 | 8 |
20 | CaLHT10 | Capana04g000478 | Chr04:7738487:7744000 | 1581 | 5 | 526 | 57977.4 | 9.61 | 9 |
21 | CaLHT11 | Capana04g000098 | Chr04:1109665:1112218 | 1338 | 5 | 445 | 49110.6 | 8.68 | 10 |
22 | CaLHT12 | Capana08g002793 | Chr08:152269921:152272976 | 1713 | 5 | 570 | 61963 | 9.55 | 9 |
23 | CaLHT13 | Capana11g000398 | Chr11:11019799:11021033 | 708 | | 235 | 25865.9 | 9.01 | 2 |
24 | CaLHT14 | Capana04g000106 | Chr04:1178475:1183677 | 669 | | 222 | 24427.3 | 8.47 | 3 |
| GAT group | | | | | | | | |
25 | CaGAT1 | Capana00g003418 | Chr00:545297054:545303475 | 1365 | 7 | 454 | 49950.5 | 8.68 | 10 |
26 | CaGAT2 | Capana11g000210 | Chr11:5435275:5440152 | 1092 | 6 | 363 | 39923.8 | 9.98 | 9 |
| ProT group | | | | | | | | |
27 | CaProT1 | Capana05g001989 | Chr05:191409867:191415970 | 1320 | 7 | 439 | 47836.8 | 9.73 | 12 |
28 | CaProT2 | Capana05g001990 | Chr05:191424542:191430181 | 1347 | 7 | 448 | 49162.1 | 9.4 | 11 |
29 | CaProT3 | Capana03g002827 | Chr03:118029421:118036334 | 1344 | 7 | 447 | 49190.2 | 9.61 | 12 |
| AUX group | | | | | | | | |
30 | CaAUX1 | Capana09g001555 | Chr09:181029189:181033262 | 1467 | 7 | 488 | 54841.3 | 8.15 | 10 |
31 | CaAUX2 | Capana10g001370 | Chr10:147549183:147556929 | 1467 | 7 | 488 | 54912.8 | 8.56 | 10 |
32 | CaAUX3 | Capana04g001744 | Chr04:99262090:99266939 | 1317 | 8 | 438 | 49663.3 | 8.25 | 9 |
33 | CaAUX4 | Capana08g002704 | Chr08:150979738:150984984 | 1482 | 8 | 493 | 55541.5 | 8.75 | 10 |
| ANT group | | | | | | | | |
34 | CaANT1 | Capana02g002432 | Chr02:144978448:144979728 | 1281 | 1 | 426 | 46665.9 | 7.92 | 11 |
35 | CaANT2 | Capana02g002433 | Chr02:144981268:144982602 | 1335 | 1 | 444 | 48548.7 | 4.74 | 11 |
36 | CaANT3 | Capana02g002434 | Chr02:144983909:144985192 | 1284 | 1 | 427 | 46457.5 | 4.82 | 11 |
37 | CaANT4 | Capana04g002414 | Chr04:201839016:201840293 | 1278 | 1 | 425 | 46811.9 | 7.45 | 11 |
38 | CaANT5 | Capana03g004210 | Chr03:248829547:248830964 | 930 | 2 | 309 | 33786.3 | 10.06 | 10 |
| ATLa group | | | | | | | | |
39 | CaATL1 | Capana06g001998 | Chr06:75940086:75942122 | 846 | 3 | 281 | 30550.9 | 4.94 | 3 |
40 | CaATL2 | Capana03g000522 | Chr03:7178172:7179590 | 1419 | 1 | 472 | 51153.5 | 5.43 | 10 |
41 | CaATL3 | Capana05g002081 | Chr05:197860240:197862960 | 1302 | 6 | 433 | 47459.1 | 8.35 | 11 |
42 | CaATL4 | Capana00g004937 | Chr00:676629079:676631743 | 1320 | 5 | 439 | 47898.6 | 8.55 | 11 |
43 | CaATL5 | Capana04g000715 | Chr04:12477359:12484737 | 1344 | 5 | 447 | 48657.5 | 8.36 | 11 |
44 | CaATL6 | Capana02g000804 | Chr02:93929776:93933801 | 1407 | 5 | 468 | 50795.2 | 8.78 | 11 |
45 | CaATL7 | Capana02g003206 | Chr02:157224580:157228911 | 1383 | 5 | 460 | 49954.5 | 8.55 | 11 |
| ATLb group | | | | | | | | |
46 | CaVAAT1 | Capana00g004212 | Chr00:618994856:618996151 | 1296 | 1 | 431 | 46618.7 | 7.96 | 8 |
47 | CaVAAT2 | Capana04g001726 | Chr04:93008498:93010474 | 1281 | 3 | 426 | 46958.9 | 7.71 | 10 |
48 | CaVAAT3 | Capana12g002556 | Chr12:222093246:222094767 | 1017 | 2 | 338 | 36992 | 7.91 | 9 |
49 | CaVAAT4 | Capana05g002349 | Chr05:207916251:207920239 | 1140 | 3 | 379 | 41865.5 | 9.04 | 9 |
50 | CaVAAT5 | Capana03g003057 | Chr03:162840327:162847744 | 1395 | 9 | 464 | 51484.7 | 4.27 | 7 |
51 | CaVAAT6 | Capana10g001696 | Chr10:173666186:173669984 | 1608 | 11 | 535 | 57908.7 | 5.19 | 10 |
52 | CaVAAT7 | Capana03g002859 | Chr03:127734852:127743213 | 1338 | 7 | 445 | 48690.8 | 4.98 | 5 |
53 | CaVAAT8 | Capana12g002523 | Chr12:220748120:220761121 | 1989 | 15 | 662 | 73214.4 | 5.85 | 8 |
a Systematic designation given to pepper AAAPs in this study. |
b Locus identity number of AtAATs assigned by Pepper Genome Database2 (http://peppersequence.genomics.cn/page/). |
c Chromosomal localization of pepper AAAP genes.. |
d Length of the open reading frame. |
e Number of extrons obtained from GSDS by comparing sequences between transcript and genome (Gene Structure Display Server; http://gsds.cbi.pku.edu.cn/ ).. |
f Protein characterization of CaAAAPs obtained from EXPASY server (http://web.expasy.org/protparam/ ). |
g Number of transmembrane segments possessed by CaAAAPs, predicted by the TMHMM Server v2.0. |
S.N., serial number; ORF, open reading frame; bp, base pair; aa, amino acids; MW, molecular weight; pI, isoelectric point; TM, transmembrane; NA, not available. |
We studied the exon/intron arrangement of the coding sequences of CaAAAP genes in their genome sequences and found that 13.21% (7/53) of pepper AAAP genes contained a single exon, 3.77% (2/53) had a single intron, and 83.02% had 1 to 14 introns (Fig. 1).Prediction of TM regions showed that most CaAAAPs (77.36%) had 8–11. Similar numbers of TMs regions were found in several subfamilies (e.g., 10 TMs in the AUX subfamily and 11 TMs in the ANT and ATLa subfamilies; Table 1 and Additional file 1:Figure S1). Thus, members of the same subfamily have a conserved structure.
Phylogenetic and structural analyses of AAAP proteins in pepper
To further understand the homology between the AAAP gene families of pepper and other plant species (Table 2), we constructed an unrooted phylogenetic tree of full-length AAAPs from pepper, potato, rice and Arabidopsis was constructed (Fig. 2). We found that the genes CaAAAP, StAAAP, OsAAAP and AtAAAP were divided into eight distinct subfamilies, which indicates that the AAAP gene family has eight subfamilies in angiosperms. In pepper, the LHT subfamily was the largest (26.42%; 14 genes), whereas the GAT subfamily comprised only two genes. The numbers of genes in the subgroups ProT and ANT were the same as or similar to those in potato, rice, and Arabidopsis, which indicates that CaAAAPs of these two subgroups have not changed greatly in the pepper genome.
Table 2
Comparative analysis of Amino acid/auxin permease (AAAP) proteins between Capsicum and other plant species.
Specie | AAAP subfamily | # of AAAP proteins | # of Proteins | # % of AAAP proteins | Reference |
AAP | LHT | GAT | ProT | AUX | ANT | ATLa | ATLb |
A. thaliana | 8 | 10 | 2 | 3 | 4 | 4 | 5 | 10 | 46 | 25,498 | 0.18 | 6 |
P.edulis | 16 | 8 | 6 | 3 | 7 | 2 | 6 | 7 | 55 | 31,987 | 0.17 | 11 |
O. sativa | 19 | 6 | 4 | 3 | 5 | 4 | 7 | 10 | 58 | 35,825 | 0.16 | 7 |
Z. mays | 15 | 24 | 2 | 2 | 5 | 3 | 6 | 14 | 71 | 39,591 | 0.18 | 8 |
M.truncatula | 26 | 18 | 4 | 3 | 5 | 3 | 13 | 14 | 86 | 44,623 | 0.19 | 12 |
P. trichocarpa | 17 | 13 | 7 | 3 | 8 | 4 | 8 | 11 | 71 | 45,000 | 0.16 | 9 |
S.tuberosum | 8 | 11 | 3 | 4 | 5 | 5 | 8 | 8 | 52 | 39,031 | 0.13 | 10 |
C.annuum | 10 | 14 | 2 | 3 | 4 | 5 | 7 | 8 | 53 | 34,476 | 0.15 | |
AAP: amino acid permease; LHT: lysine and histidine transporter; GAT: g -aminobutyric acid transporter; ProT: proline transporter;AUX: auxin transporter; ANT: aromatic and neutral amino acid transporter; ATL: amino acid transporter-like |
Conserved domains of pepper AAAP proteins were analyzed with the MEME server and a total of 20 conserved motifs were identified (Fig. 1, Additional file 3: Table S1). Motifs 1 (44/53), 2 (42/53), and 7 (49/53) were widespread among members of the CaAAAP family. Some subfamilies included several specific motifs. For example, the LHT and GAT subfamilies contained motifs 3, 12, 13, and 14, whereas motif 5 was only found in the LHT, AAP, GAT, and ProT subfamilies. Motifs 9, 10, and 17 were only present in the AUX subfamily; motifs 15 and 18 were only present in the ANT subfamily; motifs 16 and 19 were only present in the ATLa subfamily. Similar numbers of motifs were found in the ProT and AUX subfamilies (Fig. 1), which suggests that the structures of these subfamilies are highly conserved.
Chromosomal location and duplication analyses
We used Mapchart 2.30 mapping to identify the chromosomal location of AAAP genes in the pepper genome (Fig. 3). In addition to CaGAT1, CaANL4 and CaVATT1, the remaining 50 genes were unevenly distributed on 11 of 12 chromosomes; no genes were mapped to chromosomes 1 (Fig. 3, Table 1). Most of the genes were mapped to the bottom of chromosomes 2, 5, 7 and 8, whereas the genes on chromosome 11 were mostly mapped to the top. A total of 58.5% (31/53) of genes were mapped to chromosome 2, 3, 4 and 5, which contained 8, 6, 11 and 6 genes, respectively. Only one gene was located on chromosome 9, and two to four genes were mapped to the remaining chromosomes (Fig. 3).
To identify the duplication events of AAAP genes in pepper, we analyzed the 53 full-length AAAP protein sequences using MCScanX. According to the defined criterion of separation five or fewer genes with more than 50% similarity at protein level, 33.96% (18 of 53) originated from the duplication events (Fig. 3).Twelve genes (22.64%) were arranged in tandem duplication and organized into four groups. Two pairs of tandem duplicate genes were identified on chromosome 2; chromosomes 5 and 7 each contained one pair (Fig. 3). Three segmental duplication blocks were located on chromosomes 2, 4 and 12, representing 11.32% of all CaAAAP genes (6/53) (Fig. 3, Additional file 2: Figure S2). Therefore, compared to segmental duplications, tandem duplication events predominated in the expansion of AAAP genes in pepper. Furthermore, high-sequence similarity occurred in duplicated genes: CaAAP1 and CaAAP3, which originated via tandem duplication, were 94.28% similar, whereas CaANT1 and CaANT4, which were a result of segmental duplication, exhibited 81.79% similarity.
We further estimated nonsynonymous (Ka) and synonymous (Ks) nucleotide substitution rates in the coding sequences of paralog pairs to explore the selective pressures and duplication time of AAAP gene family members in pepper (Table 3). In general, Ka/Ks ratios less than 1 indicate purifying selection, and Ka/Ks ratios greater than 1 indicate positive selection [43]. The Ka/Ks ratios of all seven paralog pairs were < 1.0, which indicates that CaAAAP genes evolved under purifying selection (Table 3). We also estimated the dates of duplication events of paralog pairs using the formula T = Ks/2λ (assuming a clock-like rate (λ) of 6.96 × 10 − 9 synonymous substitutions per years [44]); duplication events were estimated to have occurred 8.53 to 68.69 million years ago (Mya), with an average duplication time of 43.61 Mya. In addition, the two segmental duplications of CaAAAPs occurred from 54 to 58.87 Mya, and five tandem duplications occurred from 41.43 to 8.53 Mya.
Table 3
Ka-Ks calculation for each pair of AAAP paralogs in pepper. .
Paralog pairs | S-sites | N-sites | Ka | Ks | Ka/Ks | Selection pressure | Duplication type | Duplication time (Mya) |
CaANT1-CaANT2 | 304.25 | 970.75 | 0.16 | 0.57 | 0.29 | Purifying selection | Tandem | 40.96 |
CaANT2-CaANT3 | 305.00 | 976.00 | 0.05 | 0.12 | 0.40 | Purifying selection | Tandem | 8.53 |
CaANT1-CaANT3 | 304.83 | 970.17 | 0.15 | 0.57 | 0.26 | Purifying selection | Tandem | 40.59 |
CaLHT1-CaLHT3 | 313.75 | 1015.25 | 0.13 | 0.58 | 0.22 | Purifying selection | Tandem | 41.43 |
CaAAP1-CaAAP3 | 316.08 | 1015.92 | 0.07 | 0.14 | 0.50 | Purifying selection | Tandem | 10.37 |
CaANT1-CaANT4 | 303.58 | 971.42 | 0.11 | 0.82 | 0.14 | Purifying selection | Segmental | 58.87 |
CaAAP5-CaAAP6 | 351.08 | 1091.92 | 0.11 | 0.75 | 0.15 | Purifying selection | Segmental | 54.00 |
S-Sites, number of synonymous sites; N-Sites, number of non-synonymous sites; Ka, non-synonymous substitution rate; Ks, synonymous substitution; Mya, million years ago. |
Expression patterns of CaAAAP genes in various tissues
We investigated the expression profiles of all CaAAAP genes in roots, stems, leaves, floral buds, flowers and different developmental stages of fruits (Fig. 4, Additional file 4: Table S2). 48 (90.5%) of the CaAAAP genes were detected in at least one tissue ( RPKM ≥ 1), and 19(35.8%) genes were detected in all tissues tested (RPKM ≥ 1). In particular, approximately half of the CaAAAP genes showed low expression in fruits. By contrast, approximately 80% CaAAAP genes showed high expression in flowers (RPKM ≥ 1). These results indicate that CaAAAPs play an important role in the growth and development of pepper, in particular in the flowers. The CaAAAP genes clustered into three distinct clades based on expression patterns (Fig. 4). Seven genes (CaAAP2, CaAAP3, CaAAP5, CaAAP9, CaATL6, CaATL7, and CaVAAT8) in group I were expressed at relatively high levels in all tissues. In addition to several genes exhibited relatively high expression in specific organs (such as CaLHT3, CaLHT5, CaLHT8, VAAT1 and VAAT6 in buds; CaATL4 in fruits; CaLHT9 and CaGAT2 in roots; CaLHT12 in roots, stems and leaves), the other genes in group II were expressed at relatively low levels in all tested tissues. Group III comprised 20 genes that were expressed at relatively high levels in most organs.
Differential expression profiling of CaAAAP genes in response to abiotic stress
To study whether CaAAPs are involved in responses to hormones and abiotic stresses in pepper, we investigated the expression levels of the CaAAPs in the roots and leaves of 40-day old seedlings in response to cold, heat, salt, osmotic, oxidative, ABA, IAA, GA3, JA and SA treatment (Fig. 5, Additional file 5: Table S3). In addition to CaLHT2, CaLHT5, CaLHT7, CaLHT8, CaLHT13, and CaAAP10, most AAAP genes were induced in at least one of the treatment as compared with the control (Fig. 5). Interestingly, some AAAP genes varied greatly between the leaves and roots in the response to abiotic or hormones stress. For instance, CaAAP4, CaLHT9, CaLHT10, CaATL3, CaATL6, CaATL7, CaAUX3, and CaVAAT7 were found to be upregulated under cold, heat, osomotic, oxidative and salt in the roots, but downregulated in the leaves. There were 28, 10, 20, and 18 CaAAAP genes were also upregulated by ABA, GA3, IAA, and JA treatment in the roots respectively, but downregulated in the leaves. Whereas there were 4, 5, and 7 CaAAAP genes were observed to be upregulated in the leaves but downregulated in the roots under the cold, IAA and salt stress treatment, respectively. In contrast, the highest number of CaAAAP genes were upregulated in the SA response in the leaves and roots (33 genes).There were several stress-responsive cis-elements showing in the promoter regions of these members, such as ABRE, ARE, LTR, MBS, TGACG-motif, CGTCA-motif, TCA-element, GARE-motif, AuxRR-core, and TC-rich repeats (Additional file 6: Table S4). Among the 53 AAAP genes, the CaAAP7 promoter had no these stress-responsive elements, while CaVAAT2 had maximum 14 elements. These results reaveled that a number of CaAAAP genes might involved in regulating abiotic and hormone stress responses.