2022
DOI: 10.1093/bib/bbac157
|View full text |Cite
|
Sign up to set email alerts
|

A systematic evaluation of data processing and problem formulation of CRISPR off-target site prediction

Abstract: CRISPR/Cas9 system is widely used in a broad range of gene-editing applications. While this editing technique is quite accurate in the target region, there may be many unplanned off-target sites (OTSs). Consequently, a plethora of computational methods have been developed to predict off-target cleavage sites given a guide RNA and a reference genome. However, these methods are based on small-scale datasets (only tens to hundreds of OTSs) produced by experimental techniques to detect OTSs with a low signal-to-no… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(18 citation statements)
references
References 46 publications
(25 reference statements)
1
9
0
Order By: Relevance
“…We observed opposite effects of class weighting on the XGBoost and neural network models. Consistent with our previous study (35), the classification and regression XGBoost models benefit from class weighting in all model variations. However, when examining the neural network models, we found no such improvement and even a decrease in the average AUPR for most models.…”
Section: Resultssupporting
confidence: 87%
See 3 more Smart Citations
“…We observed opposite effects of class weighting on the XGBoost and neural network models. Consistent with our previous study (35), the classification and regression XGBoost models benefit from class weighting in all model variations. However, when examining the neural network models, we found no such improvement and even a decrease in the average AUPR for most models.…”
Section: Resultssupporting
confidence: 87%
“…In our deep neural networks, the prediction of off-target activity is based on aligned sgRNA and off-target sequences, which include the PAM. To accommodate aligned sequences with bulges, we extend the one-hot-encoding approach utilized in our previous study (35), which could only handle mismatches, to handle bulges. Given an aligned sgRNA sequence S t and an aligned off-target sequence S o , we encode the pair of aligned nucleotides or gaps (represented by -) at position i in the aligned sequences into a 5 × 5 matrix C i ( j t , j o ) using the following scheme: where 1 ≤ j t , j o ≤ 5 represent the four possible nucleotides and a gap.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…This technique relies on the delivery of double-stranded oligonucleotides (dsODNs) with known sequences, which can integrate into DSBs during NHEJ (non-homologous end joining). The integrated dsODNs provide templates for targeted PCR amplification and sequencing of the tagged DNA fragments ( Tsai et al, 2015 ; Malinin et al, 2021 ; Yaish et al, 2022 ) ( Figure 1B ). GUIDE-seq can detect off-target sites with indel frequencies as low as 0.03% ( Tsai et al, 2015 ).…”
Section: Experimental Detectionmentioning
confidence: 99%