We have clearly showed by analyses that oligonucleotides selected at the deletion boundaries of different COVID-19 proteins useful to detect unknown variant after sequencing (Table-1). In truth, submitted sequences have no data regarding specific variant status and we are unable to get such information after down loading the sequence (www.ncbi.nlm.nih.gov/nucleotide). However, the SARS-CoV-2 Database has such information (not always) to get the variant status. Moreover, many sequences deposited without giving expressed protein data (accession numbers starting OV and OX). Thus, we always need an easy method to know correct lineage of the corona virus. So far, US scientists (Howard D et al; Bankers et al.) are giving correct and enough sequencing data to predict type of corona virus infection is rampant [14]. By comparing the few sequences from other country, one can predict type of infection may arrive in the neighbouring countries [9]. Thus, analysis we have presented giving few oligonucleotides for BLAST-2 sequence comparison to get corona virus variant status is logistic and demanding (see, figures-2,3,4). We hope many remote sequencing facilities in the poor countries have no important software for sequence analysis. But our approach using free NCBI Database, free NCBI/BLAST system and free CLUSTAL-Omega software may be a safe haven for scientists and it will be very applicable to Africa, Latin America, Middle East, and South Asian countries.
We analysed the amino acid composition of spike proteins and found no gross changes in omicron amino acid composition as compared to Wuhan and Alpha strains (www.expasy.org/cgi-bin /portparam). Acidic amino acid (Asp + Glu) and basic amino acids (Arg + Lys) were found 110 and 103 in Wuhan virus, 109 and 103 respectively for B.1.1.7 alpha virus where as such amino acids in Omicron corona virus (BA.1) were found 111 for both acidic and basic amino acids [14]. Instability index for Wuhan, alpha and omicron viruses were found 33.01, 32.82 and 34.65 respectively [14]. Alipathic index for Wuhan virus was found 84.67 and for omicron virus 84.95 and for alpha virus 84.65. Hydrophobic plot indicated some differences as omicron virus is more hydrophobic (data not shown). Such study indicated that during evolution different variants were selected by deletion and point mutation but 3-D structures never drastically changed keeping the Spike-ACE-2 interaction intact as well as SARS-CoV-2 life cycle processes unchanged. In truth, increase of infection during D614G and N501Y mutations was important as well as antibody escape due to E69V70 deletion as well as E452R mutation critically enhanced recurrent infectivity. However, rampant other mutations in the RBD domain in omicron corona virus must have role in transmission and immunity. Perhaps, further development of highly transmissible omicron BA.2, BA.4 and BA.5 corona viruses due to mutation (N435K, E484A, P681H, Q949H) needs to be evaluated in the future. In this aspect, BA.2.75 variant was declared variant of concern by WHO. When compared spike protein of BA.2.48 with BA.2.75, we detected 9 mutations (K144E, W149R, F154L, S245Y, G254S, D336H, G443S, N457K, R490Q). We concluded W149R and N457K could be specific for omicron BA.2.75 variant. Interestingly, BA.2.75 spike have 31 mutations as compared to Wuhan virus.
Whether deletions of nsp1 K131S132F133 amino acid sequence as well as nsp6 S3675G3676F3677 amino acid sequence favours omicron BA.4 variant for transmission is not known (figure-10). But acquisition of both N501Y and D614G mutations in omicron corona viruses including other 27 mutations in RBD domain suggested higher transmission than SARS-CoV-2 Alpha and Delta variants. BA.2 is about 1.5 and 4.2 times as contagious than BA.1 and Delta respectively. Further, BA.2 is found to be able to alarmingly re-infect patients originally infected by omicron BA.1 [25]. BA.4 and BA.5 omicron viruses shows reduced neutralization by the serum from individuals vaccinated with triple doses of AstraZeneca or Pfizer vaccine compared with omicron BA.1 and BA.2 [26].
Two serine mutations to S370F and S374F in omicron BA.2/4/5 and a third serine mutation in BA.1 to S372L as well as in BA.2/4/5 to S372P were surely significant in determining enhanced transmission (figure-3). BA.4 and BA.5 also had F491V mutation that differed from BA.1 and BA.2 and such variants were increasingly replacing BA.1.1 and BA.2.9. Spike protein of B.1.1.7, omicron BA.1 and BA.2 variants were 1270 AA although had differential deletions and insertions where as delta variant S protein was 1271 AA long and that of omicron BA.4 and BA.5 variants were 1268 AA. Thus, length of S protein also predicted a specific type of SARS-CoV-2 variant (figure-2; Table-1B).
However, we admit that many sequencing errors present in the database. As for example, several higher molecular weight ORF1ab proteins (7108 − 7099) were available in the database (UEP73521, UPH81172, UEE28211, UES61213, UFC36273) but might be sequencing error due to duplication or integration of unusual sequence in same reading frame. Further, 1/3 of the SARS-CoV-2 sequences deposited had no protein prediction. Such sequences start with OV and OX (OV316200, OV315626, OX197261, OX129695 etc) and UK origin. One multi-alignment using BA.2 variants predicted 21 mutant oligonucleotides but BLAST search found no such sequences in the specific variant (data available on request). Particularly, CvsT silent mutations in the 3rd codon noticed in most sequences. However, BA2.3-CvsT-8991 oligonucleotide (5’-aga gta cac tga ctt tgt aac atc agc gtg cag t-3’) appeared specific for omicron BA.2.3 variant. Thus, deletions found in proteins of COVID-19 variants were more authentic and such oligonucleotides gave good analyses and reproducible. Thus, our method is reliable to find specific variant type after new sequencing of SARS-CoV-2 (figures-12A/12B).
Question arises if such hyper-variable difference of omicron spike protein could be involved in the vaccine failure for the corona vaccine made from S gene of Wuhan corona virus! Study indicated that only 20% and 24% of BNT162b2 vaccine recipients had detectable neutralizing antibody against the omicron variant HKU691 and HKU344-R346K, respectively, while none of the Coronavac recipients had detectable neutralizing antibody titre against either omicron isolate. Omicron variant escapes neutralizing antibodies elicited by BNT162b2 or Coronavac [27, 28]. Further study suggested that E484K mutation evaded antibody neutralization elicited by infection or vaccination and further enhanced by K417N and N501Y mutations [29–32]. Another study indicated that neutralizing antibodies elicited by inactivated corona virus vaccine and RBD-subunit spike protein vaccine against B.1.617 and B.1.1.7 variants enhanced viral entry and membrane fusion, as well as more resistant to antibody neutralization [33–35]. L452R mutation may be important in BA.4 and BA.5 omicron variants for immunity to neutralization antibody but such mutation not present in BA.1 and BA.2 variants (figure-3). Thus, new omicron vaccine may be needed although in India third booster dose with Covishild vaccine is ongoing. BA.1/2/4/5 omicron corona viruses were found to be highly resistant to the antisera elicited by the mRNA-1273, ChAdOx1, and BNT162b2 vaccines [26, 33]. Surely, oligonucleotides from the hypervariable region could differentiate BA.1 to BA.2. When we aligned different BA.4 and BA.5 variants, we found S gene has 2–3 mutations but we suspect authenticity of such data. As for example, few BA.4.6 variant or BA.5.6 variant have mutation in S gene sequence but not in other designated as same variant! (data available on request). It was reported that one amino acid differences between the BA.4 and BA.5 lineages in OFR1a, ORF6, ORF7b, and Nucleocapsid (N) proteins but no amino acid differences in proteins such as ORF1b, Spike, Envelope (E), Membrane (M), ORF8, and ORF10. Furthermore, only one amino acid in ORF10 differs between the BA.5 and BA.5.1 sub-lineages. Similarly, OFR1a-DEL141/143, ORF7b-D61L, and N-P151S mutations are specific to BA.4, and ORF6-D3N mutation is specific to BA.5 and BA.5.1 lineages [35]. However, we found ORF7b-D61L will be ORF6-D61L because ORF7b is only 43 amino acids and such mutation is prominent in BA.2 lineages (accession nos. ON957923, ON999338). N-P151S mutation as described above was correct for BA.4 lineages but N-S413R was found not present in BA.1 lineage but BA.2/4/5 as well as BE.x and BF.x lineages. We also found N-P13L mutation for all omicron whereas in one BF-10 lineage (accession no. OP257537) N-P13F mutation was noticed. Moreover, BA.4 and BA.5 have been found to be 4–20 times more resistant to monoclonal antibodies such as cilgavimab and Evusheld than BA.2. We also found that Q493R spike mutation located in omicron BA.1 and BA.2 lineages but not detected in BA.4 and BA.5. Our analysis however detected many mutations in spike having contradictory to Desingu & Nagarajan (2022) in sub-variants of BA.4 as well as BA.5 indicating rapid spread of such variant replacing omicron BA.2 variant [35]. As for example, we found V3G, T341R and S653N spike mutations in BA.4 lineages including one K142E (protein id. OPO51195) and one R185M mutation (protein id. OPO51217). Analysis also detected L5F, G176E, S879F, Q1196L mutations in one BA.4.5 variant (accession no. OP257738) as well as one T1268L mutation in BA.4.6 (accession no. OP258078). Similarly, BA.5 lineages have some T71I as well as Q949R/H mutation in many omicron BA.5.5 lineages, one S1165F mutation in BA.5.2 (protein id. UTN60872) and N435K mutation in many omicron BA.5.2.1lineages.
In recent omicron variant BA.2.75 we found new spike mutations K144E, W149R/H, G254S and N458K mutations but roles of such mutations remain elusive (figure-2; figure-13). As for example, N435K mutation in spike was found very much prominent in recent omicron isolates (BA.1.1, BA.2.75, BA.5.5 and BA.5.2.1). Thus, the oligonucleotide > MutS435-omicron (5’-gga att cta aca agc ttg att cta agg-3’) would be important to track such omicron transmission whereas N435K mutation was not present in Alpha and Delta variants. Further, oligonucleotide MutS341-BA.4.6 (5’-gtt ttt aac gcc acc aca ttt gca tct gt-3’) appeared very specific for omicron BA.4.6 which was spreading now. Thus, mutation might be there but specificity was not found when we analysed BlastN seq-2 between oligonucleotide and genomic sequences (Accession number) of BA.1, BA.2, BA.4, BA.5 omicron viruses as well as Alpha, Beta, Delta, Zeta, Gamma and Wuhan variant. Finally, our method was based on amino specific acid change which we had done by BlastX search of 60nt mutant oligonucleotide. Definitely we searched a lot but the major emphasis was given to omicron viruses which lately transmitted in huge population. Omicron viruses now combined with Alpha and Delta virusesSARS-CoV-2 producing many chimera human corona viruses. As for example, we detected a deletion in ORF7a in some variants (accession numbers: ON836740/ON944596/OP257615). Analysis of ON944596 produced 31LPP and 157FR deletions in spike protein as well as FSTQ deletion (5’-TTT AGC ACT AAT-3’) in ORF7a protein near 27506nt suggesting it would be a Delta-BA.2 chimera. It had 65 mutations when aligned with BA.4.0 variant (accession no. ON907393) but had only 30 mutations with Delta variant (accession no. OM542166). Similarly, another FSTQ deleted SARS-CoV-2 had no KSF deletion at nsp1 but 31LPP deletion in spike and likely a BA.5-like (99.82% similarity; 54 mutations) human corona virus although no 69HV deletion was found. Truly, the sequence 100% homology to MutORF8-64-BA.5.0 oligonucleotide confirming BA.5-like variant.