3.1. Characterization of CD 14 gene of buffalo
The buffalo CD14 coding sequence was of 1122 bp size with 62.3% GC content. The open reading frame of CD14 gene was of 1116 nucleotide, with a natural start codon and TAA stop codon. The total length of Bubaline CD14 molecule was observed to be 373 amino acids with signal peptide of 20 amino acid. Cattle CD14 also contains 373 amino acids and revealed 98% identity with buffalo CD14. The comparative analysis of derived amino acid sequence between cattle and buffalo revealed slightly higher molecular weight (39705.07 Da) of buffalo sequence as compared to cattle (39679.96 Da). The buffalo CD14 peptide sequence was characterized by one extra strongly basic amino acid and had two polar amino acids less than that of cattle. The nucleotide sequence comparison of cattle with buffalo revealed 22 nucleotide substitutions, out of which eleven were synonymous codon without any amino acid change. The peptide sequence comparison of buffalo CD14 with that of cattle revealed eleven amino acid substitution at 14, 62, 131, 134, 139, 143, 154, 209, 235-236, 277, 337th position. In the derived peptide sequence of CD14 of buffalo, six leucine-rich repeats (LRR) were observed (Fig 1), whereas a cattle has ten LRR. Six LRR have been identified and depicted in different colours (Fig 1), amino acid site 114-138 as wheat, 168-176 as pale green, 192-218 as light blue, 244-256 (pale yellow), 266-283 (light pink), 296-309 (blue-white). Buffalo CD14 molecule was predicted to contain 17.2% leucine, which is similar to mouse (17.66%) and higher than that of human (15.5%). Analysis of derived peptide sequence revealed 373 amino acids which is GPI anchored at C-terminus near 353 amino acid position, about 7 amino acid after hydrophobic tail. Four putative N-linked glycosylation sites were observed in bubaline CD14.
3.2. Comparison of CD14 with related genes.
Comparison of CD14 gene with other related genes containing leucine-rich repeats, viz. MD2, TLR2, TLR4, and TLR9 of cattle revealed very low similarity of 22.4, 21.8, 18.8 and 23.4%, respectively. This reveals the distinct nature of CD14 gene from that of related gene sequences.
The molecular interaction of CD14 gene with the related genes are depicted in Fig2. It is evident from the figure that CD14 is related to TLR2, TLR4, TLR6, IL10, TNF, MYD88, LY96 by String analysis (Fig 2). The result was confirmed through analysis of biochemical pathway that these genes are related in TLR signalling pathway through KEGG analysis (Fig 3). Moreover, it is evident that each molecule responsible for innate immunity, either receptor or immune mediator acts through a cascade of mechanisms with interaction with other.
3.3. Identification of CD14 variants with SNPs
SSCP followed by sequencing has identified eight variants of CD14 gene. cDNA of CD14 gene was amplified for different variants from Variant A to H and sequenced. Sequences were designated as CD14 A (MF 092866), CD14 B (MF 092866), CD14 –C (MF 092866), CD14-D (MF 092866), CD14-E (MF 092866), CD14-F (MF 092866), CD14-G (MF 092866) and CD14-H (MF 092866). Variant A of CD14 gene was observed to be the wild type (Fig1 and Fig4). The allelic frequencies for different variants of CD14 as CD14-A, CD14-B, CD14-C, CD14-D, CD14-E, CD14-F, CD14-G and CD14-H were observed to be 0.47, 0.09, 0.08, 0.18, 0.022, 0.13, 0.018 and 0.018 respectively.
The 3D structure of Variant A has been depicted in Fig 2 with disulfide bond as a blue colored sphere, LPS binding site as green, leucine zipper as yellow.
Mutations in CD14 affect the binding ability which in turn affects the biological potentiality. Mutational hotspots were detected in bubaline CD14 with 58 number of non-synonymous SNP, out of which 18 were observed to be deleterious and 34 as thermodynamically unstable. Eight different variants of CD14 gene have been identified in all the four breeds of buffalo with 58 SNPs, revealing a high degree of polymorphism. AA ( genotypic frequency 0.468) as most frequent and AG and AH as the least frequent genotypes were identified as, with very low frequencies (0.0174) for both the genotypes. High degree of variability was observed for alleles B (81.3%), C (97.0%), D (86.0%), E (97.4%) and F (85.8%) compared to A allele of amplified CD14 nucleotide of buffalo, which may be due to coding for leucine-rich repeats, that confers the recognition and binding ability for a wide range of pathogens. However, an absence of any change of amino acid at the sites of glycosylation confirms that the vital functions of the CD14 molecule remained unaltered.
Breed wise differences for allelic frequencies for CD14 gene were observed.
None of the breed of buffalo under current study was observed to be resistant to mastitis. However the differences in susceptibility was observed for Breeds of buffalo. Among the four breeds under consideration, Surti was observed to be most susceptible to mastitis, while Bhadawari was observed to be comparatively better resistant to mastitis under current study. The resistance of buffalo breed as Mehsana and Murrah was observed to be intermediate. Surti was observed to be mostly susceptible to mastitis.
Comparisons were attempted for identification of mutant type of CD14 variant with the wild type.
The various functional domains of CD14 were studied with respect to wild type variant A as follows:
Variant A versus Variant B
A number of SNPs were identified out of which were observed to be synonymous and non-synonymous (Table 1). G210 C was observed to be deleterious mutation by Provean software, leading to altered function. I-mutant software depicts decreased stability at amino acid position G207A, T209A, G210C, 218, 230, 233, 234.
Table 1: Identified SNPs for CD14 variants
Sl
no.
|
SNP
|
Var B
|
Var C
|
Var D
|
Var E
|
Var F
|
Var G
|
Var H
|
Status of mutation
|
1
|
G207A
|
P
|
|
|
|
|
|
|
Unstable
|
2
|
D208Q
|
P
|
|
|
|
|
|
|
Neutral
|
3
|
T209A
|
P
|
|
|
P
|
|
P
|
|
Unstable
|
4
|
T209S
|
|
|
|
|
P
|
|
P
|
Neutral
|
5
|
G210C
|
P
|
|
|
|
|
|
|
Both unstable and deleterious
|
6
|
L215P
|
|
|
P
|
|
|
|
|
Both unstable and deleterious
|
7
|
Y225C
|
|
P
|
|
|
|
|
|
Neutral
|
8
|
A227P
|
P
|
|
|
|
|
|
|
Neutral
|
9
|
L228V
|
P
|
|
|
|
|
|
|
Unstable
|
10
|
N230H
|
P
|
|
|
|
|
|
|
Unstable
|
11
|
A231P
|
P
|
|
|
|
|
|
|
Neutral
|
12
|
M233L
|
P
|
|
|
|
|
|
|
Unstable
|
13
|
E234K
|
P
|
|
|
|
|
|
|
Unstable
|
14
|
E234D
|
|
P
|
|
|
|
|
|
Neutral
|
15
|
T235S
|
|
|
|
|
|
|
P
|
Neutral
|
16
|
R247K
|
|
|
P
|
|
|
|
|
Unstable
|
17
|
V248L
|
|
|
P
|
|
|
|
|
Unstable
|
18
|
Q249P
|
|
|
P
|
|
|
|
|
Neutral
|
19
|
P250A
|
|
|
P
|
|
|
|
|
Unstable
|
20
|
Q251H
|
|
|
P
|
|
|
|
|
Unstable
|
21
|
S252R
|
|
|
P
|
P
|
|
|
|
Neutral
|
22
|
S252T
|
|
|
|
|
P
|
|
|
Neutral
|
23
|
L253R
|
|
|
P
|
|
|
|
|
Both unstable and deleterious
|
24
|
D254L
|
|
|
P
|
|
|
|
|
Both unstable and deleterious
|
25
|
L255H
|
|
|
P
|
|
|
|
|
Both unstable and deleterious
|
26
|
L255P
|
|
|
|
|
P
|
|
|
Both unstable and deleterious
|
27
|
S256N
|
|
|
P
|
|
P
|
|
|
Deleterious
|
28
|
H257R
|
|
|
P
|
|
|
|
|
Neutral
|
29
|
H257P
|
|
|
|
|
|
|
|
Deleterious
|
30
|
N258C
|
|
|
P
|
|
|
|
|
Deleterious
|
31
|
S259P
|
|
|
P
|
|
|
|
|
Deleterious
|
32
|
S259L
|
|
|
|
|
P
|
|
|
Deleterious
|
33
|
L260P
|
|
|
|
|
P
|
|
|
Both unstable and deleterious
|
34
|
R261H
|
|
|
P
|
|
|
|
|
Unstable
|
35
|
R261N
|
|
|
|
|
P
|
|
|
Unstable
|
36
|
V262P
|
|
|
|
|
P
|
|
|
Unstable
|
37
|
T263C
|
|
|
P
|
|
|
|
|
Both unstable and deleterious
|
38
|
T263P
|
|
|
|
|
P
|
|
|
Unstable
|
39
|
A264T
|
|
|
|
|
P
|
|
|
Unstable
|
40
|
G266A
|
|
|
P
|
|
P
|
|
|
Unstable
|
41
|
A267Q
|
|
|
|
|
P
|
|
|
Unstable
|
42
|
R269G
|
|
|
|
|
P
|
|
|
Unstable
|
43
|
C270K
|
|
|
|
|
P
|
|
|
Unstable
|
44
|
V271A
|
|
|
P
|
|
|
|
|
Unstable
|
45
|
W272H
|
|
|
|
|
P
|
|
|
Unstable
|
46
|
P273A
|
|
|
P
|
|
|
|
|
Both unstable and deleterious
|
47
|
S274P
|
|
|
P
|
|
|
|
|
Neutral
|
48
|
A275V
|
|
|
|
|
P
|
|
|
Neutral
|
49
|
R277M
|
|
|
P
|
|
|
|
|
Both unstable and deleterious
|
50
|
R277S
|
|
|
|
|
|
|
P
|
Neutral
|
51
|
N280D
|
|
|
P
|
|
|
|
|
Deleterious
|
52
|
S282A
|
|
|
P
|
|
|
|
|
Both unstable and deleterious
|
53
|
F283L
|
|
|
P
|
P
|
|
P
|
|
Unstable
|
54
|
E287P
|
|
|
|
P
|
|
P
|
|
Neutral
|
55
|
Q288R
|
|
|
|
P
|
|
P
|
|
Neutral
|
56
|
V289L
|
|
|
|
P
|
|
P
|
|
Unstable
|
57
|
K296L
|
|
|
|
P
|
|
P
|
|
Deleterious
|
58
|
R337H
|
|
|
|
P
|
|
P
|
|
Unstable
|
3D structure of variant A and Variant B have been depicted in fig 4 and fig 5. Disulphide bridges in blue, Alpha helix as pink and Beta pleated sheet as red, coil in cyan. The site for 209 as orange, 210 as green, 218 purple-blue.
Variant A versus variant C
Three non-synonymous SNPs were identified out of which T235 K was observed to be deleterious by Provean. I-mutant has predicted Y225C and T235K with reduced thermodynamic stability (Table 1). The 3D structure for CD14 variant C (Fig 6), with the mutations, have been depicted in figure 4. The secondary structure of CD 14 has been depicted in an alpha helix as cyan, beta sheet as red, loop in pink. Disulphide bonds in blue, Y225C in green and E243D and T235K as yellow.
Variant A versus Variant D
24 synonymous mutations were observed out of which 12 were observed to be deleterious (L215P, L253R, D254L, L255H, S256N, N258C, S259P, T263C, P273A, R277M, N280D, S282A, F283L) as depicted in Table 1. This high degree of mutations has an effect on its structure also as revealed by TM-align software. Structural alignment of CD14A with CD14D has revealed a greater degree of variation, RMSD was observed to be 1.87. The structural alignment with the differing domain of CD14 had been highlighted in Fig 7.
Sites for disulphide bond has been depicted as blue coloured sphere. Site for mutation L215P as red. L253R, D254L, L255H, S256N, N258C, S259P as yellow.
T263C Orange, P273A Cyan, R277M Grey, N280D Warm pink, S282A Purple blue.
The domain of CD14 present at site L215P is leucine-rich repeat. It has already been reported that LRR in extracellular domain is responsible for the recognition of pathogens, hence any mutation at this site may lead to defects in coded protein causing impairment of the function of pathogen recognition.
The mutation of CD14 at site 253-256 codes for leucine-rich repeat, may cause a defect in pathogen recognition and the ultimate function may be impaired. It is to be kept in mind that CD14 is a pattern recognition receptor containing leucine-rich repeats (LRR). It is to be taken into consideration that whenever there is any mutation causing alteration of leucine moiety, the function of LRR is effected. The mutations at P273A, R277M, N280D, S282A, F283L site of CD14 gene of buffalo cause alteration of LRR domain.
The 3D structure depicting 3 D structural alignment of Variant A and variant D (Fig 7).
Variant A versus Variant E
The mutations identified in CD14 E are listed in Table 1. Seven synonymous mutations were observed out of which K296 L was predicted to be deleterious by Provean. 3D structure of Variant E of a CD14 molecule (Fig 8). The sites for disulphide bond was predicted in blue spheres. K296L was predicted in red. T209S in green, R277S yellow, V289L orange.
Results in I-mutant revealed a large decrease of thermodynamical stability for T209S, R277S, V289L, R337H. Hence the mutation at these sites will also lead to structural instability and ultimately decreased function.
T209S, R277S moieties are present in LRR region, which in turn is present for pathogen recognition. V289L moiety is present in leucine zipper site. Leucine zipper is responsible for dimerization and binding with TLR4, MD2 and its functioning.R337H is the site for the alpha helix.
Mutation of CD14 buffalo at the site K296L indicates an alteration in the major function of some of the domain as LRR, domain linker, and leucine zipper. This zipper site is very important for dimerization and pathogen binding.
Variant A versus Variant F
This high degree of mutations has an effect on its structure also as revealed by TM-align software. Deleterious mutations as revealed were L255P, S256N, H257P, S259L, L260P, C270K, W272H (Table 1). Structural alignment of CD14A with CD14 F has revealed a greater degree of variation, RMSD was observed to be 1.81. The structural alignment has been observed in Fig 9. The site for mutation as L255P, S256N, H27P had been depicted as red sphere.S259L and L260P had been depicted in magenta. C270K as cyan. W272H as grey. 255 to 257 aa of CD14 Comprises for LRR. Both C270K, W272H sites comprise of LRR domain. Site C270K encodes for a di-sulphide bond. Alteration in amino acid moiety at site 270, impairs di-sulphide bonding, which affects protein folding. Impaired protein folding, in turn, effects its function. Much structural dissimilarity has been observed as RMSD 1.81.
Variant A versus variant G
Seven non-synonymous mutations were observed out of which K296 L was predicted to be deleterious by Provean (Table 1). Structural alignment revealed differences with RMSD was observed to be 0.22 (Fig 10).Variant A as green, G by Cyan. The sites for disulfide bond was predicted in blue spheres. K296L was predicted in red.
Results in I-mutant revealed a large decrease of thermodynamical stability for T209S, R277S, V289L, R337H (Table 1). Hence the mutation at these sites will also lead to structural instability and ultimately decreased function.
T209S, R277S moieties are present in LRR region, which in turn is present for pathogen recognition. V289L moiety is present in leucine zipper site. Leucine zipper is responsible for dimerization and binding with TLR4, MD2 and its functioning. R337H is the site for the alpha helix.
Mutation of CD14 buffalo at the site K296L indicates an alteration in the major function of some of the domain as LRR, domain linker, and leucine zipper. This zipper site is very important for dimerization and pathogen binding. Structural dissimilarity with RMSD 0.22 has also been observed.
Variant A versus variant H
Three non-synonymous mutations were detected as T209S, T235S, R277S (Table 1). However, the mutations were observed to be thermodynamically stable as revealed by I-mutant and none of the mutations were observed to be deleterious.
The alignment of all the variants of CD14 had been depicted in Fig 11, which clearly reveals the mutations, both synonymous and non-synonymous for bubaline CD14.
3.2. Association of SNPs with phenotypic traits related to mastitis
Mutations in CD14 effect the binding ability which in turn affects the biological potentiality. Mutational hotspots were detected in bubaline CD14 with 58 number of non-synonymous SNP, out of which 18 were observed to be deleterious and 34 as thermodynamically unstable. Eight variants of CD14 were identified, where variant A was observed to be the wild type. Eight genotypes AA, AB, AC, AD, AE, AF, AG, AH were designated as Variant A, B, C, D, E, F, G, H respectively. Phenotypic association with different indicator traits for mastitis were identified and listed in Table 2. The breedwise details for the number of buffaloes of different breed affected with sub clinical mastitis is being presented in supplementary Table 1.
Table 2: Association of different variants of CD14 with indicator traits for mastitis
|
Somatic cell count
(105 cells per ml)
|
California mastitis test
|
Rennet coagulation time
(sec)
|
CD14 A
|
0.89 ± 0.23a
|
Negative
|
128.70 ± 3.80 c
|
CD14B
|
1.26±0.34 a
|
Weak
|
168.56± 3.67 c
|
CD14C
|
1.15± 0.25 a
|
Trace
|
135.60 ± 2.89 c
|
CD14D
|
2.17±0.37 b
|
Strong Positive
|
286.07 ± 2.87 d
|
CD14E
|
1.29±0.28 a
|
Weak
|
175.45± 3.24 c
|
CD14F
|
1.8 ±0.38 b
|
Strong Positive
|
265.07± 3.56 d
|
CD14G
|
1.32±0.39 a
|
Distinct Positive
|
189.04± 3.23 c
|
CD14H
|
0.88 ±0.27 a
|
Negative
|
129.20 ± 3.9 c
|
Superscript a and b indicate the differences at P˂ 0.01% of significance within the genotypes for somatic cell count.
Superscript c and d indicate the differences at P˂ 0.01% of significance within the genotypes for rennet coagulation time.
Association of SNP of CD14 with Somatic cell count
Polymorphism or SNPs of CD14 gene were observed to be significantly associated with somatic cell count. Significant differences somatic cell count were observed between wild-type Variant A with variant D and variant F
Association of SNP of CD14 with California mastitis test
Polymorphism or SNPs of CD14 gene were observed to be significantly associated with California mastitis test. Variant D and variant F of CD14 was observed to be strongly positive in contrast to a negative response in wild-type Variant A and variant H. Variant G was also observed to have a distinct positive response for California mastitis test (Table 2). As evident in the Table 2, the data for california mastitis test was observed to be discrete variable. For the analysis, scoring was used for qualitative data, then employed log transformation in order to maintain continuity of data.
Association of SNP of CD14 with rennet coagulation time
Polymorphism or SNPs of CD14 gene were observed to be significantly associated with rennet coagulation time. Significant differences were observed between wild type Variant A with variant D and variant F of CD14 protein (Table 2).
Mutated CD14 with inactive biological function was observed to be responsible for mastitis.