General features of E.coli S9922 isolates from calf encephalitis
ExPEC is a foodborne pathogen that can cause urethritis, infectious pneumonia, neonatal meningitis, cow mastitis and so on, which seriously threatens the health of human and animals. E.coli S9922 was isolated from cerebral effusion in a calf that died of meningitis. However, the character of clinical isolate remains to be determined. We analyzed the isolate in this study by genomic sequencing using the PacBio platform (Novo Zhiyuan Technology, Beijing). The draft genome sequences had been deposited in the NCBI Sequence ReadArchive under the accession number: SRA8740874. The genome of E.coli S9922 comprises a circular chromosome of 4,960,912bp with an average G+C content of 50.7%, and three circular plasmid (Plas1:110,372 bp, Plas2:73,649 bp, Plas3:77,569 bp) with an average G +C content of 49.62% , 52.87% and 51.82%, respectively (Fig.1). General genome information including Genome size, G+C content, Average gene size, Number of contings, Number of rRNA/tRNA/SRNA, Number of GIs were shown in Table 1.
Gene islands (GIs)
GIs comprise a horizontally acquired flexible gene pool that is a major driver in evolution and niche specialization of pathogenic bacteria. We used IslandPath-DIOMB (Version 0.2) to identify putative genomic islands in E.coli S9922. The results showed that there were 23 GIs in the E.coli S9922 genome, 20 of which are located on chromosomes, two on plasmid 1, and one on plasmid 2. The total length of the GIs is 313608 bp, and that of the average length is 13635 bp, and they were shown in Fig. 2.
Virulence and pathogenicity analysis
T3SS of Gram-negative bacteria is commonly used to study pathogens, infection mechanisms, virulence effects at the molecular level. The results showed that there were 5055 encode genes identified in the T3SS of E.coli S9922, including 182 T3SS effector proteins and 4873 non-T3SS effector proteins.
The E.coli S9922 genome had a total of 291 genes annotated in PHI, of which 200 are attenuated virulence genes (Fig.3). Using Diamond software, the amino acid sequence of E. coli S9922 was compared with the VFDB database, and 204 genes related to the virulence were found, mainly including: lipopolysaccharide (LPS) (10 genes), flagella (40 genes), type IV pili (12 genes), type I pili (19 genes), TTSS (Virulence-related factors, such as SPI-1) (eight genes), enterobacteria (14 genes), and VAS cluster (eight genes)
The ARDB database contains the names of drug resistance-related genes which can be found through the annotation of the database. The amino acid sequence of E.coli S9922 was compared with the ARDB database. E.coli S9922 was resistant to adriamycin, erythromycin, vancomycin, teicoplanin, penicillin, bacitracin, aminoglycosides, glycylcycline, macrolides, β-lactams, acridine yellow, tetracycline, streptomycin, isepamicin, netilmicin, tobramycin, kanamycin, sisomicin, dibekacin, chloramphenicol, monoamimycin, cephalosporin, gentamicin, phosphomycin, polymyxin, enrofloxacin, norfloxacin, deoxycholic acid, tigecycline, fluoroquinolone, phosphoamimycin, glycyl, kasugamycin, chloride, puromycin, trimethoprim, and sulphonamide.
Comparative genomics analysis
E.coli S9922 genome was compared to 16 reference E.coli strains, the main features of the above E.coli strains are shown in Table 2. E.coli S9922 was similar to the other E.coli with G+C content between 50.5–50.8%, The majority of ExPEC belong to phylogenetic group B2 including UPEC, APEC and NMEC.
Collinearity refers to the linkage relationship between genes, which is a phenomenon where homologous genes are arranged in the same order within the genomes of different species. The degree of collinearity between two species can be used as a measure of the evolutionary distance between them, helping to understand the relationship between species. In order to further determine the genomic differentiation and similarities between E.coli S9922 and the other strains, we performed the Whole-genome collinearity analysis, these results were displayed in Fig. 4. It can be seen that E.coli S9922 and E.coli 90-9272 had the highest number of collinear fragments, indicating that E.coli S9922 and E.coli 90-9272 had the least evolutionary distance and closest genetic relationship. PCN061 was second most closely related to E.coli S9922, and E.coli S9922 had the least collinear fragment and the most farthest related to UTI89.
Population evolution analysis
Core genes and specific genes are likely to correspond to the commonality and characteristics of E.coli S9922, thus serving as the basis for the study of the functional differences between samples. Compared with the genes of reference strains, E.coli S9922 obtained 2780 core genes, 6586 non-genes, and 669 specific genes by cluster analysis. Among them, by comparing COG, KEGG and GO, core genes homologous to E.coli S9922 were obtained, which were 2570, 2780 and 2188 genes, respectively. The unique genes were classified as follows: 204, 550, 239 genes in COG, KEGG, and GO comparisons, respectively, in shown in Additional file 1.
In genomic evolution, a gene family is a group of genes with a common ancestor. Different genes in a family often have a similar structure and function, which can serve as the basis for identifying unknown gene function and provide clues regarding gene evolution history. Pairwise comparison was made between E.coli S9922 and other strains to obtain gene family clustering results, as shown in Table 3 and Fig. 5.
Among the gene family clustering results obtained by pairwise comparisons between E.coli S9922 and other strains, the number of unclustered genes was the highest in the E.coli S9922 strain. The number of E.coli S9922 specific gene families was 35.
By comparing E.coli S9922 with other strains, the evolutionary tree was obtained as shown in Fig. 6. It can be seen that the evolutionary relationship between E.coli S9922 and 90-9272 is relatively close, while the evolutionary relationship between E.coli S9922 and UTI89 is distant.