“…Sequences with a 99.7% identity to HPIV2 strain t146a293_HPIV2 (MH892406.1) were identified in SRA datasets from multiple different animal species sequenced by He el al. (2022) (Jones et al 2022a). That the same HPIV2 strain so closely matched a human strain, was found in multiple animal species, was associated with human genomic contamination and was found to comprise 28% of sample MJ-ZJ-MO-3 almost certainly indicates the virus was not related to a natural infection of pangolins as proposed by He et al (2022), but stems from laboratory contamination (Jones et al 2022a).…”
Section: Discussionmentioning
confidence: 86%
“…(2022) (Jones et al 2022a). That the same HPIV2 strain so closely matched a human strain, was found in multiple animal species, was associated with human genomic contamination and was found to comprise 28% of sample MJ-ZJ-MO-3 almost certainly indicates the virus was not related to a natural infection of pangolins as proposed by He et al (2022), but stems from laboratory contamination (Jones et al 2022a). Alternatively it is also possible the synthetic vectors identified could have been sourced via contamination from unrelated research.…”
Section: Discussionmentioning
confidence: 88%
“…Human hosted viruses are found in high count numbers and coverage including Human papillomaviruses (HPVs) and Human endogenous retrovirus (HERV) K113. Human orthorubulavirus 2 (Human parainfluenza virus 2 (HPIV2)) strain t146a293_HPIV2 (Accession MH892406.1) is found at extremely high levels in pangolin samples MJ-ZJ-MO-4 and MJ-ZJ-MO-3 at 5.91% and 27.66% of the metagenome respectively and the same strain was identified in Marmota himalayana (Himalayan marmot) sample MH-HeB-NA-1 (Jones et al 2022a). Using a Spearman correlation matrix, the presence of HPIV2 is moderately correlated with HPV and HERV K113 abundances (Supp.…”
Section: Human and Murine Hosted Virusesmentioning
confidence: 99%
“…He et al note "no viruses closely related to either SARS-CoV or SARS-CoV-2 (or other sarbecoviruses) were detected in any of animals examined", and no detection of SARSr-CoVs is mentioned anywhere in virus species classification of the samples. However, Jones et al (2022a) unexpectedly identified a novel SARS2r-CoV in 6 pangolin (Manis javanica and Manis pentadactyla) and 3 Malayan porcupine (Hystrix brachyura) metatranscriptomic datasets. The SARS2r-CoV partial genome has high homology to the GX pangolin CoVs in the NSP 4 N and C terminal regions but is more closely related to bat-SL-CoVZC45 in the RdRp coding region.…”
Section: Introductionmentioning
confidence: 99%
“…Here we expand on our previous analysis to find GX_ZC45r-CoV, a novel bat-SL-CoVZC45-related coronavirus, in two coypu (Myocastor coypus), two Malayan porcupine (Hystrix brachyura), one each of Asian badger (Meles leucurus), Masked palm civet (Paguma larvata) and Hoary bamboo rat (Rhizomys pruinosus) datasets sequenced by He at al. (2022), in addition to the 6 pangolin (Manis javanica and Manis pentadactyla) and three Malayan porcupine datasets described in Jones et al (2022a). We undertake more extensive mitochondrial alignments, human mitochondrial haplogroup analysis and more detailed phylogenetic analysis of genomic regions covered by the recovered SARS2r-CoV sequence and propose that the fragmentary nature of the genome indicates genetic manipulation.…”
Pangolins are the only animals other than bats proposed to have been infected with SARS-CoV-2 related coronaviruses (SARS2r-CoVs) prior to the COVID-19 pandemic. Here we examine the novel SARS2r-CoV we previously identified in game animal metatranscriptomic datasets sequenced by He et al. (2022) and find that sections of the partial genome phylogenetically group with Guangxi (GX) pangolin CoVs (GX PCoVs) while the full RdRp sequence groups with bat-SL-CoVZC45. While the novel SARS2r-CoV is found in 6 pangolin datasets, the same CoV is also found in 10 additional NGS datasets from 5 separate mammalian species and is likely related to contamination by a laboratory researched virus. We find that NGS datasets containing the novel SARS2r-CoV are contaminated with significant Homo sapiens genetic material, and numerous viruses not associated with the host animals sampled. We further identify the dominant human haplogroup of the contaminating H.sapiens genetic material to be F1c1a1, which is of East Asian provenance. The association of this novel CoV with both bat CoV and the Guangxi pangolin CoV (GX PCoV) clades is an important step towards identifying the origin of the GX PCoVs.
“…Sequences with a 99.7% identity to HPIV2 strain t146a293_HPIV2 (MH892406.1) were identified in SRA datasets from multiple different animal species sequenced by He el al. (2022) (Jones et al 2022a). That the same HPIV2 strain so closely matched a human strain, was found in multiple animal species, was associated with human genomic contamination and was found to comprise 28% of sample MJ-ZJ-MO-3 almost certainly indicates the virus was not related to a natural infection of pangolins as proposed by He et al (2022), but stems from laboratory contamination (Jones et al 2022a).…”
Section: Discussionmentioning
confidence: 86%
“…(2022) (Jones et al 2022a). That the same HPIV2 strain so closely matched a human strain, was found in multiple animal species, was associated with human genomic contamination and was found to comprise 28% of sample MJ-ZJ-MO-3 almost certainly indicates the virus was not related to a natural infection of pangolins as proposed by He et al (2022), but stems from laboratory contamination (Jones et al 2022a). Alternatively it is also possible the synthetic vectors identified could have been sourced via contamination from unrelated research.…”
Section: Discussionmentioning
confidence: 88%
“…Human hosted viruses are found in high count numbers and coverage including Human papillomaviruses (HPVs) and Human endogenous retrovirus (HERV) K113. Human orthorubulavirus 2 (Human parainfluenza virus 2 (HPIV2)) strain t146a293_HPIV2 (Accession MH892406.1) is found at extremely high levels in pangolin samples MJ-ZJ-MO-4 and MJ-ZJ-MO-3 at 5.91% and 27.66% of the metagenome respectively and the same strain was identified in Marmota himalayana (Himalayan marmot) sample MH-HeB-NA-1 (Jones et al 2022a). Using a Spearman correlation matrix, the presence of HPIV2 is moderately correlated with HPV and HERV K113 abundances (Supp.…”
Section: Human and Murine Hosted Virusesmentioning
confidence: 99%
“…He et al note "no viruses closely related to either SARS-CoV or SARS-CoV-2 (or other sarbecoviruses) were detected in any of animals examined", and no detection of SARSr-CoVs is mentioned anywhere in virus species classification of the samples. However, Jones et al (2022a) unexpectedly identified a novel SARS2r-CoV in 6 pangolin (Manis javanica and Manis pentadactyla) and 3 Malayan porcupine (Hystrix brachyura) metatranscriptomic datasets. The SARS2r-CoV partial genome has high homology to the GX pangolin CoVs in the NSP 4 N and C terminal regions but is more closely related to bat-SL-CoVZC45 in the RdRp coding region.…”
Section: Introductionmentioning
confidence: 99%
“…Here we expand on our previous analysis to find GX_ZC45r-CoV, a novel bat-SL-CoVZC45-related coronavirus, in two coypu (Myocastor coypus), two Malayan porcupine (Hystrix brachyura), one each of Asian badger (Meles leucurus), Masked palm civet (Paguma larvata) and Hoary bamboo rat (Rhizomys pruinosus) datasets sequenced by He at al. (2022), in addition to the 6 pangolin (Manis javanica and Manis pentadactyla) and three Malayan porcupine datasets described in Jones et al (2022a). We undertake more extensive mitochondrial alignments, human mitochondrial haplogroup analysis and more detailed phylogenetic analysis of genomic regions covered by the recovered SARS2r-CoV sequence and propose that the fragmentary nature of the genome indicates genetic manipulation.…”
Pangolins are the only animals other than bats proposed to have been infected with SARS-CoV-2 related coronaviruses (SARS2r-CoVs) prior to the COVID-19 pandemic. Here we examine the novel SARS2r-CoV we previously identified in game animal metatranscriptomic datasets sequenced by He et al. (2022) and find that sections of the partial genome phylogenetically group with Guangxi (GX) pangolin CoVs (GX PCoVs) while the full RdRp sequence groups with bat-SL-CoVZC45. While the novel SARS2r-CoV is found in 6 pangolin datasets, the same CoV is also found in 10 additional NGS datasets from 5 separate mammalian species and is likely related to contamination by a laboratory researched virus. We find that NGS datasets containing the novel SARS2r-CoV are contaminated with significant Homo sapiens genetic material, and numerous viruses not associated with the host animals sampled. We further identify the dominant human haplogroup of the contaminating H.sapiens genetic material to be F1c1a1, which is of East Asian provenance. The association of this novel CoV with both bat CoV and the Guangxi pangolin CoV (GX PCoV) clades is an important step towards identifying the origin of the GX PCoVs.
HKU4-related coronaviruses (CoVs) are merbecoviruses related to Middle Eastern Respiratory Syndrome coronavirus (MERS-CoV). In 2022 and 2023, two HKU4-related CoV strains were discovered in Manis javanica (Malayan pangolin) metagenomic datasets derived from organ samples: HKU4-P251T and MjHKU4r-CoV-1. Together with the Tylonycteris robustula bat CoV 162275, which was discovered in 2022, pangolin CoVs HKU4-P251T and MjHKU4r-CoV-1 form a novel phylogenetic clade distinct from all previously documented HKU4-related CoVs. In this study, we identified a novel HKU4-related CoV in a pangolin single-cell sequencing dataset generated by BGI-Shenzhen in Shenzhen, Guangdong, China in 2020. The CoV phylogenetically belongs to the same newly identified clade. The single cell datasets were reported as generated from organ samples of a single pangolin that died of natural causes. 98% of the HKU4-related CoV reads were found in only one of the seven single cell datasets -- a large intestine cell dataset, cells of which exhibit low expression of DPP4. Bacterial contamination was found to be moderately correlated with HKU4-related CoV presence. We further identified with high confidence that the RNA-Seq dataset supporting one of four near identical variants of MjHKU4r-CoV-1 is a Sus scrofa (wild pig) metagenomic dataset, with only a trace level of Manis javanica genomic content. The presence of HKU4-related CoV reads in the dataset are almost certainly laboratory research-related and not from a premortal pangolin or pig infection. Our findings raise concerns about the provenance of the novel HKU4-related CoV we identify here, MjHKU4r-CoV-1 and its four near-identical variants.
Pangolins are the only animals other than bats proposed to have been infected with SARS-CoV-2related coronaviruses (SARS2r-CoVs) prior to the COVID-19 pandemic. Here we examine thenovel SARS2r-CoV we previously identified in game animal metatranscriptomic datasetssequenced by He et al. (2022) and find that sections of the partial genome phylogenetically groupwith Guangxi (GX) pangolin CoVs (GX PCoVs), while the full RdRp sequence groups with bat-SL-CoVZC45. While the novel SARS2r-CoV is found in 6 pangolin datasets, the same CoV isalso found in 10 additional NGS datasets from 5 separate mammalian species and is likelyrelated to contamination by a laboratory researched virus. Absence of bat mitochondrialsequences from the datasets, the fragmentary nature of the virus sequence and the presence of apartial sequence of a cloning vector attached to a SARS2r-CoV read suggests that it has beencloned. We find that NGS datasets containing the novel SARS2r-CoV are contaminated withsignificant Homo sapiens genetic material, and numerous viruses not associated with the hostanimals sampled. We further identify the dominant human haplogroup of the contaminatingH.sapiens genetic material to be F1c1a1, which is of East Asian provenance. The association ofthis novel CoV with both bat CoV and the Guangxi pangolin CoV (GX PCoV) clades is animportant step towards identifying the origin of the GX PCoVs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.