2017
DOI: 10.1038/s41598-017-12825-2
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale DNA Barcode Library Generation for Biomolecule Identification in High-throughput Screens

Abstract: High-throughput screens allow for the identification of specific biomolecules with characteristics of interest. In barcoded screens, DNA barcodes are linked to target biomolecules in a manner allowing for the target molecules making up a library to be identified by sequencing the DNA barcodes using Next Generation Sequencing. To be useful in experimental settings, the DNA barcodes in a library must satisfy certain constraints related to GC content, homopolymer length, Hamming distance, and blacklisted subseque… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
13
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(13 citation statements)
references
References 22 publications
(13 reference statements)
0
13
0
Order By: Relevance
“…Current state-of-the art error-correcting DNA barcoding applications often use Hamming or Levenshtein error correction strategies ( 20 , 23 ). Hamming codes only correct substitutions, and are thus insufficient for any DNA barcode applications with indels ( 26 ).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Current state-of-the art error-correcting DNA barcoding applications often use Hamming or Levenshtein error correction strategies ( 20 , 23 ). Hamming codes only correct substitutions, and are thus insufficient for any DNA barcode applications with indels ( 26 ).…”
Section: Resultsmentioning
confidence: 99%
“…Error-correcting barcodes must efficiently detect and correct all DNA sequencing and synthesis errors. Many current DNA barcode strategies repurpose error-correcting codes developed for computers ( 18 , 19 ), such as Hamming or Reed–Solomon codes, to DNA applications ( 20 , 21 ). Hamming distance (i.e., the number of substitutions between two sequences of equal length) is possibly the most used due to its simplicity.…”
mentioning
confidence: 99%
“…Among these are guanine-cytosine (GC) content, homopolymer length, and certain sequences that must be avoided because of their natural presence in a sample or their recognition by a restriction enzyme. 38 These limitations imply that most techniques based on random synthesis of a DNA barcode greatly overestimate the number of useful barcodes that can be generated when groups calculate the theoretical number as an exponential function (4 number of bases ). Taking into account these considerations, Lyons et al 38 provide a framework for generating billions of acceptable DNA barcodes.…”
Section: Barcoding Antibodies For Transcriptomics and Proteomicsmentioning
confidence: 99%
“…(Mayr and Bojanic, 2009) The size and possible structures attainable in small-molecule space endow them with unique features to study and modulate biology. (Lyons et al, 2017) Recently, these libraries are important trends for synthesized molecules over this time period. In their analysis, they found that average molecular weights increased by 100 Da from the 1950s to the 2000s, leading to so-called "molecular obesity."…”
Section: Introductionmentioning
confidence: 99%