“…For example, hg19 and hg38-expressed CFHR-Factor H complex genes CFHR1 and CFHR3 are linked with atypical hemolytic uremic syndrome 36, 37 and fall within a region harboring population-specific copy number variations. The absence of these genes in CHM13 could be due to the reliance on a single cell line, especially in contrast to the genetic diversity from multiple cell lines underlying hg38 38 . We detected quantification of CFHR1 in fibroblast and muscle, and CFHR3 in iPS and iPS neural progenitor cells for hg19 and hg38.…”
Section: Resultsmentioning
confidence: 99%
“…We detected quantification of CFHR1 in fibroblast and muscle, and CFHR3 in iPS and iPS neural progenitor cells for hg19 and hg38. One study reports that detection of the disease-causing structural variants was not possible when aligning to CHM13, even with long-read sequencing 38 , suggesting CFHR-Factor H complex disorders should not be evaluated using CHM13v2. The absence of these genes in CHM13v2 likely influences mapping in CHM13 of other CFHR-Factor H complex genes – CFHR4 (also linked to atypical hemolytic uremic syndrome) is detected as quantified only in CHM13 in iPSC NPC with a median TPM of 4.4.…”
Transcriptomics is a powerful tool for unraveling the molecular effects of genetic variants and disease diagnosis. Prior studies have demonstrated that choice of genome build impacts variant interpretation and diagnostic yield for genomic analyses. To identify the extent genome build also impacts transcriptomics analyses, we studied the effect of the hg19, hg38, and CHM13 genome builds on expression quantification and outlier detection in 386 rare disease and familial control samples from both the Undiagnosed Diseases Network (UDN) and Genomics Research to Elucidate the Genetics of Rare Disease (GREGoR) Consortium. We identified 2,800 genes with build-dependent quantification across six routinely-collected biospecimens, including 1,391 protein-coding genes and 341 known rare disease genes. We further observed multiple genes that only have detectable expression in a subset of genome builds. Finally, we characterized how genome build impacts the detection of outlier transcriptomic events. Combined, we provide a database of genes impacted by build choice, and recommend that transcriptomics-guided analyses and diagnoses are cross-referenced with these data for robustness.
“…For example, hg19 and hg38-expressed CFHR-Factor H complex genes CFHR1 and CFHR3 are linked with atypical hemolytic uremic syndrome 36, 37 and fall within a region harboring population-specific copy number variations. The absence of these genes in CHM13 could be due to the reliance on a single cell line, especially in contrast to the genetic diversity from multiple cell lines underlying hg38 38 . We detected quantification of CFHR1 in fibroblast and muscle, and CFHR3 in iPS and iPS neural progenitor cells for hg19 and hg38.…”
Section: Resultsmentioning
confidence: 99%
“…We detected quantification of CFHR1 in fibroblast and muscle, and CFHR3 in iPS and iPS neural progenitor cells for hg19 and hg38. One study reports that detection of the disease-causing structural variants was not possible when aligning to CHM13, even with long-read sequencing 38 , suggesting CFHR-Factor H complex disorders should not be evaluated using CHM13v2. The absence of these genes in CHM13v2 likely influences mapping in CHM13 of other CFHR-Factor H complex genes – CFHR4 (also linked to atypical hemolytic uremic syndrome) is detected as quantified only in CHM13 in iPSC NPC with a median TPM of 4.4.…”
Transcriptomics is a powerful tool for unraveling the molecular effects of genetic variants and disease diagnosis. Prior studies have demonstrated that choice of genome build impacts variant interpretation and diagnostic yield for genomic analyses. To identify the extent genome build also impacts transcriptomics analyses, we studied the effect of the hg19, hg38, and CHM13 genome builds on expression quantification and outlier detection in 386 rare disease and familial control samples from both the Undiagnosed Diseases Network (UDN) and Genomics Research to Elucidate the Genetics of Rare Disease (GREGoR) Consortium. We identified 2,800 genes with build-dependent quantification across six routinely-collected biospecimens, including 1,391 protein-coding genes and 341 known rare disease genes. We further observed multiple genes that only have detectable expression in a subset of genome builds. Finally, we characterized how genome build impacts the detection of outlier transcriptomic events. Combined, we provide a database of genes impacted by build choice, and recommend that transcriptomics-guided analyses and diagnoses are cross-referenced with these data for robustness.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.