2017
DOI: 10.1101/192344
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562

Abstract: K562 is one of the most widely used human cell lines in biomedical research. It is one of three tier-one cell lines of ENCODE, and one of the cell lines most commonly used for large-scale CRISPR/Cas9 gene-editing screens. Although the functional genomic and epigenomic characteristics of K562 are extensively studied, its genome sequence has never been comprehensively analyzed and higher-order structural features of its genome beyond its karyotype were only cursorily known. The high degree of aneuploidy in K562 … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
34
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
5

Relationship

1
9

Authors

Journals

citations
Cited by 29 publications
(37 citation statements)
references
References 113 publications
3
34
0
Order By: Relevance
“…Additionally, the models were also tested with the data from chromosomes 7 and 8 of the cell line K562 5 . To increase the sensitivity of the models with mutations from patients, DNA sequences of boundaries were mutated with mutations from corresponding cell lines GM12878 22 and K562 23 for training, validating and testing. All datasets have approximately the same numbers of positive and negative samples and we used area under the receiver operating characteristic curve (AUROC) to measure the performance of the models.…”
Section: Resultsmentioning
confidence: 99%
“…Additionally, the models were also tested with the data from chromosomes 7 and 8 of the cell line K562 5 . To increase the sensitivity of the models with mutations from patients, DNA sequences of boundaries were mutated with mutations from corresponding cell lines GM12878 22 and K562 23 for training, validating and testing. All datasets have approximately the same numbers of positive and negative samples and we used area under the receiver operating characteristic curve (AUROC) to measure the performance of the models.…”
Section: Resultsmentioning
confidence: 99%
“…In iPSCs, we observed an average 1.6-fold increase in estimated cell number for gRNAs targeting X, and a 3.2-fold increase in cell number for non-targeting gRNAs ( Figure 3C) compared to non-essential gene targeting guides. The effect in K562 cells was substantially smaller (1.5-fold increase in cells with non-targeting gRNAs, Figure 3D) likely reflecting the complex karyotype (Zhou et al 2019) and DNA repair activity of the K562 cell line. We further performed assays to compare survival and metabolic activity of cells five days post infection for the single gRNA Yusa 1.0 library, and the minimized double gRNA library.…”
Section: Double Grna Library Enables Screens In Ipscsmentioning
confidence: 97%
“…Furthermore, we also compared our distance based model for predicting contact frequencies between genomic pairs crossing several validated somatic deletions in K562 sample [50] and compared against the experimentally Hi-C matrix of the same sample [31]. The results of K562 comparison can be seen in Supplementary Figure S2 and Supplementary Table S2.…”
Section: Evaluating the Hi-c Contact Frequency Predictionmentioning
confidence: 99%