The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group

Ahn, Sung‐Min; Kim, Tae-Hyung; Lee, Sung-Hoon; Kim, Deokhoon; Ghang, Ho; Kim, Dae-Soo; Kim, Byoung-Chul; Kim, Sang Yoon; Kim, Woo-Yeon; Kim, Chulhong; Park, Daeui; Lee, Yong Seok; Kim, Sang Soo; Reja, Rohit; Jho, Sungwoong; Kim, Chang Geun; Cha, Ji-Young; Kim, Kyung‐Hee; Lee, Bong-Hee; Bhak, Jong; Kim, Seong‐Jin

doi:10.1101/gr.092197.109

Cited by 291 publications

(243 citation statements)

References 37 publications

(36 reference statements)

Supporting

Mentioning

229

Contrasting

Unclassified

Order By: Relevance

“…Another commonly used approach is to apply quality filters that are aimed at selectively removing errors. Every whole-genome sequence reported so far has used filtering to some extent: the most commonly used filters being those that remove sequences with a too-low coverage depth, discard variants with a low-confidence score or eliminate variants located within a cluster of variants 3,7,[10][11][12][13][14][15][16][17][18][19][20][21][22][23][24][25] . Surprisingly, there is little consensus with respect to which filters should be used and at which threshold they should be applied.…”

mentioning

confidence: 99%

“…As a result, each reported study developed its own heterogeneous set of filters and applied them at various (suboptimal) thresholds. For instance, in the case of the coverage depth filter, thresholds removing sequences with a coverage depth <4×, <10× or <11× were applied 6,10,11,16,17,23,24 . Additionally, it has not been assessed to which extent filters discard true variants, and how each filter can be optimized in terms of sensitivity and specificity.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing

et al. 2011

View full text Add to dashboard Cite

mentioning

confidence: 99%

mentioning

confidence: 99%

Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing

et al. 2011

View full text Add to dashboard Cite

“…So far, seven human diploid genomes have been fully sequenced, and some important insights have been gained from these resequencing studies. [145][146][147][148][149][150][151] The most prominent finding from these studies is that, besides SNPs, other genetic variants are also abundant in the human genome. These studies found that in addition to the 3-4 million SNPs, several hundred-thousand of short indels (for example, sizes defined as 3 and 16 bp or less in the Bentley et al 147 and Wang et al 148 study, respectively) are also present in each individual human genome.…”

Section: Human Genetic Variationmentioning

confidence: 99%

The pursuit of genome-wide association studies: where are we now?

et al. 2010

View full text Add to dashboard Cite

It is now 5 years since the first genome-wide association studies (GWAS), published in 2005, identified a common risk allele with large effect size for age-related macular degeneration in a small sample set. Following this exciting finding, researchers have become optimistic about the prospect of the genome-wide association approach. However, most of the risk alleles identified in the subsequent GWAS for various complex diseases are common with small effect sizes (odds ratio o1.5). So far, more than 450 GWAS have been published and the associations of greater than 2000 single nucleotide polymorphisms (SNPs) or genetic loci were reported. The aim of this review paper is to give an overview of the evolving field of GWAS, discuss the progress that has been made by GWAS and some of the interesting findings, and summarize what we have learned over the past 5 years about the genetic basis of human complex diseases. This review will focus on GWAS of SNPs association for complex diseases but not studies of copy number variations.

show abstract

“…As such, the remaining several hundred thousands of indels in the range of several nucleotides to tens of nucleotides, which were identified in the recent whole genome resequencing experiments, do not currently have their own category. [27][28][29][30][31][32][33] For example, Wang et al (2008) 29 found B140 000 indels within 1-3 bp in the Han Chinese YH genome, and B400 000 indels defined from 1 to 16 bp were also detected in the African NA18507 genome by Bentley et al (2008). 30 Perhaps a new category such as 'short indels' needs to be created to fit them in, and those indels between 100 bp to 1 kb should probably be renamed as 'intermediate indels' (Figures 1c and d).…”

Section: Categories Of Genetic Variationsmentioning

confidence: 99%

“…2 However, the richness of genetic variations in the human genome has recently been further corroborated by the several whole genome resequencing studies, revealing plenty of new SNPs, indels, CNVs and other structural variations. [27][28][29][30][31][32][33] The technological developments have facilitated and accelerated the process of identifying genetic variations, especially with the arrival of next generation sequencing technologies, which have made whole genome resequencing and the 1000 Genomes Project feasible. [53][54][55] In recent years, many studies have been done to directly examine the associations of CNVs with complex diseases using SNP genotyping arrays.…”

Section: The Evolution Of Genetic Markers In Disease Gene Mappingmentioning

confidence: 99%

The discovery of human genetic variations and their use as disease markers: past, present and future

Loy

Salim

et al. 2010

J Hum Genet

View full text Add to dashboard Cite

The field of human genetic variations has progressed rapidly over the past few years. It has added much information and deepened our knowledge and understanding of the diversity of genetic variations in the human genome. This significant progress has been driven mainly by the developments of microarray and next generation sequencing technologies. The array-based methods have been widely used for large-scale copy number variation (CNV) detection in the human genome. The arrival of next generation sequencing technologies, which enabled the completion of several whole genome resequencing studies, has also resulted in a massive discovery of genetic variations. These studies have identified several hundred thousand short indels and a total of thousands of CNVs and other structural variations in the human genome. The discovery of these 'newer' types of genetic variations, indels, CNVs and copy neutral variations (inversions and translocations) has also widened the scope of genetic markers in human genetic and disease gene mapping studies. The aim of this review article is to summarize the latest developments in the discovery of human genetic variations and address the issue of inadequate coverage of genetic variations in the current genome-wide association studies, which mainly focuses on common SNPs. Finally, we also discuss the future directions in the field and their impacts on next generation genome-wide association studies.

show abstract

The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic group

Cited by 291 publications

References 37 publications

Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing

Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing

The pursuit of genome-wide association studies: where are we now?

The discovery of human genetic variations and their use as disease markers: past, present and future

Contact Info

Product

Resources

About