2023
DOI: 10.1038/s41422-023-00849-5
|View full text |Cite
|
Sign up to set email alerts
|

The complete and fully-phased diploid genome of a male Han Chinese

Abstract: Since the release of the complete human genome, the priority of human genomic study has now been shifting towards closing gaps in ethnic diversity. Here, we present a fully phased and well-annotated diploid human genome from a Han Chinese male individual (CN1), in which the assemblies of both haploids achieve the telomere-to-telomere (T2T) level. Comparison of this diploid genome with the CHM13 haploid T2T genome revealed significant variations in the centromere. Outside the centromere, we discovered 11,413 st… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 128 publications
(143 reference statements)
1
10
0
Order By: Relevance
“…2b) and haploid KOREF 20 (Fig. S6) genome assemblies of three individuals places the prevalent ancestry based on highest composition (EUR:54%, EAS:87%, EAS:87%, respectively), consistent with both highest GAI scores, and the published European 18 , Chinese 19 and Korean 20 ancestry for each (Tables S6-S7). Further, ntRoot correctly infers EAS and EUR as the two most prevalent super-population ancestries from a synthetic diploid genome admix of HuRef and CN1 (Fig.…”
Section: Human Ancestry Inference At Scale From Genomic Datasupporting
confidence: 54%
See 1 more Smart Citation
“…2b) and haploid KOREF 20 (Fig. S6) genome assemblies of three individuals places the prevalent ancestry based on highest composition (EUR:54%, EAS:87%, EAS:87%, respectively), consistent with both highest GAI scores, and the published European 18 , Chinese 19 and Korean 20 ancestry for each (Tables S6-S7). Further, ntRoot correctly infers EAS and EUR as the two most prevalent super-population ancestries from a synthetic diploid genome admix of HuRef and CN1 (Fig.…”
Section: Human Ancestry Inference At Scale From Genomic Datasupporting
confidence: 54%
“…5 ntRoot LAI inferences using the chromosome-scale diploid HuRef 18 (Fig. 2a), CN1 19 (Fig. 2b) and haploid KOREF 20 (Fig.…”
Section: Human Ancestry Inference At Scale From Genomic Datamentioning
confidence: 99%
“…Over the past two decades, the human reference genome employed in genetic studies has been meticulously crafted from genomic segments sourced from thousands of individuals (Lander et al 2001; Schneider et al 2017). Despite efforts to assemble high-quality, gapless genomes such as T2T-CHM13 (Nurk et al 2022), T2T-YAO (He et al 2023), CN1 (Yang et al 2023), I002C (https://github.com/LHG-GG/I002C) or HG002 (https://github.com/marbl/HG002), such references raise concerns regarding their abilities to represent genetic variations across diverse human populations accurately. The prevailing consensus is that no singular reference sequence can adequately encapsulate the complex genomic diversity across global populations (Yang et al 2019).…”
Section: Introductionmentioning
confidence: 99%
“…Currently, haplotype-resolved genome assemblies can be generated with trio-based, Hi-C-based, or Strand-seq-based approaches [21][22][23] . These assemblies can then be utilized to construct high-quality pangenome graphs, enhancing our understanding of genetic structure and the evolutionary history of complex genomic regions [24][25][26] . However, generating haplotype-resolved diploid T2T genome assemblies remains challenging, primarily due to the complexities associated with phasing centromeres, subtelomeres, and SDRs 22 .…”
Section: Introductionmentioning
confidence: 99%