The COVID-19 pandemic first emerged in Malaysia in Jan 2020. As of 12th Sept 2021, 1,979,698 COVID-19 cases that occurred over three major epidemic waves were confirmed. The virus contributing to the three epidemic waves has not been well-studied. We sequenced the genome of 22 SARS-CoV-2 strains detected in Malaysia during the second and the ongoing third wave of the COVID-19 epidemic. Detailed phylogenetic and genetic variation analyses of the SARS-CoV-2 isolate genomes were performed using these newly determined sequences and all other available sequences. Results from the analyses suggested multiple independent introductions of SARS-CoV-2 into Malaysia. A new B.1.524(G) lineage with S-D614G mutation was detected in Sabah, East Malaysia and Selangor, Peninsular Malaysia on 7th October 2020 and 14th October 2020, respectively. This new B.1.524(G) group was not the direct descendant of any of the previously detected lineages. The new B.1.524(G) carried a set of genetic variations, including A701V (position variant frequency = 0.0007) in Spike protein and a novel G114T mutation at the 5’UTR. The biological importance of the specific mutations remained unknown. The sequential appearance of the mutations, however, suggests that the spread of the new B.1.524(G) lineages likely begun in Sabah and then spread to Selangor. The findings presented here support the importance of SARS-CoV-2 full genome sequencing as a tool to establish an epidemiological link between cases or clusters of COVID-19 worldwide.
Various methods have been developed for rapid and high throughput full genome sequencing of SARS-CoV-2. Here, we described a protocol for targeted multiplex full genome sequencing of SARS-CoV-2 genomic RNA directly extracted from human nasopharyngeal swabs using the Ion Personal Genome Machine (PGM). This protocol involves concomitant amplification of 237 gene fragments encompassing the SARS-CoV-2 genome to increase the abundance and yield of viral specific sequencing reads. Five complete and one near-complete genome sequences of SARS-CoV-2 were generated with a single Ion PGM sequencing run. The sequence coverage analysis revealed two amplicons (positions 13751-13965 and 23941-24106), which consistently gave low sequencing read coverage in all isolates except 4Apr20-64-Hu. We analyzed the potential primer binding sites within these low covered regions and noted that the 4Apr20-64-Hu possess C at positions 13730 and 23929, whereas the other isolates possess T at these positions. The genetic variations observed suggest that the naturally occurring genome variations present in the actively circulating SARS-CoV-2 strains affected the performance of the target enrichment panel of the Ion AmpliSeq™ SARS‑CoV‑2 Research Panel. The possible impact of other genetic sequence variations warrants further investigation, and an improved version of the Ion AmpliSeq™ SARS‑CoV‑2 Research Panel, hence, should be considered.
Various methods have been developed for rapid and high throughput full genome sequencing of SARS-CoV-2. Here, we described a protocol for targeted multiplex full genome sequencing of SARS-CoV-2 genomic RNA directly extracted from human nasopharyngeal swabs using the Ion Personal Genome Machine (PGM). This protocol involves concomitant amplification of 237 gene fragments encompassing the SARS-CoV-2 genome to increase the abundance and yield of viral specific sequencing reads. Five complete and one near-complete genome sequences of SARS-CoV-2 were generated with a single Ion PGM sequencing run. The sequence coverage analysis revealed two amplicons (positions 13 751-13 965 and 23 941-24 106), which consistently gave low sequencing read coverage in all isolates except 4Apr20-64-Hu. We analyzed the potential primer binding sites within these low covered regions and noted that the 4Apr20-64-Hu possess C at positions 13 730 and 23 929, whereas the other isolates possess T at these positions. The genome nucleotide variations observed suggest that the naturally occurring variations present in the actively circulating SARS-CoV-2 strains affected the performance of the target enrichment panel of the Ion AmpliSeq™ SARS CoV 2 Research Panel. The possible impact of other genome nucleotide variations warrants further investigation, and an improved version of the Ion AmpliSeq™ SARS CoV 2 Research Panel, hence, should be considered.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.