Taehoon Ryu scite author profile

DNA-based data storage has emerged as a promising method to satisfy the exponentially increasing demand for information storage. However, practical implementation of DNA-based data storage remains a challenge because of the high cost of data writing through DNA synthesis. Here, we propose the use of degenerate bases as encoding characters in addition to A, C, G, and T, which augments the amount of data that can be stored per length of DNA sequence designed (information capacity) and lowering the amount of DNA synthesis per storing unit data. Using the proposed method, we experimentally achieved an information capacity of 3.37 bits/character. The demonstrated information capacity is more than twice when compared to the highest information capacity previously achieved. The proposed method can be integrated with synthetic technologies in the future to reduce the cost of DNA-based data storage by 50%.

show abstract

DNA Micro‐Disks for the Management of DNA‐Based Data Storage with Index and Write‐Once–Read‐Many (WORM) Memory Features

Choi

Bae

Lee

et al. 2020

Advanced Materials

View full text Add to dashboard Cite

DNA‐based data storage has attracted attention because of its higher physical density of the data and longer retention time than those of conventional digital data storage. However, previous DNA‐based data storage lacked index features and the data quality of storage after a single access was not preserved, obstructing its industrial use. Here, DNA micro‐disks, QR‐coded micro‐sized disks that harbor data‐encoded DNA molecules for the efficient management of DNA‐based data storage, are proposed. The two major features that previous DNA‐based data‐storage studies could not achieve are demonstrated. One feature is accessing data items efficiently by indexing the data‐encoded DNA library. Another is achieving write‐once–read‐many (WORM) memory through the immobilization of DNA molecules on the disk and their enrichment through in situ DNA production. Through these features, the reliability of DNA‐based data storage is increased by allowing selective and multiple accession of data‐encoded DNA with lower data loss than previous DNA‐based data storage methods.

show abstract

A high-throughput optomechanical retrieval method for sequence-verified clonal DNA from the NGS platform

Lee

Kim²,

Kim

et al. 2015

Nat Commun

View full text Add to dashboard Cite

Writing DNA plays a significant role in the fields of synthetic biology, functional genomics and bioengineering. DNA clones on next-generation sequencing (NGS) platforms have the potential to be a rich and cost-effective source of sequence-verified DNAs as a precursor for DNA writing. However, it is still very challenging to retrieve target clonal DNA from high-density NGS platforms. Here we propose an enabling technology called ‘Sniper Cloning’ that enables the precise mapping of target clone features on NGS platforms and non-contact rapid retrieval of targets for the full utilization of DNA clones. By merging the three cutting-edge technologies of NGS, DNA microarray and our pulse laser retrieval system, Sniper Cloning is a week-long process that produces 5,188 error-free synthetic DNAs in a single run of NGS with a single microarray DNA pool. We believe that this technology has potential as a universal tool for DNA writing in biological sciences.

show abstract

A comparison of the nutrient composition and statistical profile in red pepper fruits (Capsicums annuum L.) based on genetic and environmental factors

et al. 2019

View full text Add to dashboard Cite

Red peppers are a remarkable source of nutrients in the human diet. However, comprehensive studies have not reported on the effects of genotype, cultivation region, and year on pepper fruit characteristics. To address this, 12 commercial pepper varieties were grown at two locations in South Korea, during 2016 and 2017, representing four environments, and concentrations of proximate, minerals, amino acids, fatty acids, capsaicinoids, and free sugars in pepper pericarps were determined. Variation in most nutrients was observed among the 12 varieties grown within each location in each year, indicating a significant genotype effect. Statistical analysis of combined data showed significant differences among varieties, locations, and years for the measured components. The % variability analysis demonstrated that environment (location and year) and genotype-environment interaction contributed more to the nutritional contents than genotype alone. Particularly, variation in many amino acids, capsaicinoids, free sugars, and myristic acid was attributed to location. Year effect was significant for palmitoleic acid, ash, tryptophan, copper, linolenic acid, crude fiber, and tyrosine. Insoluble dietary fiber, soluble dietary fiber, sodium, sulfate, linoleic acid, and alanine were primarily varied by genotype–environment interaction. Palmitic acid was the trait the most highly affected by genotype. Cultivation and the genotype–environment interaction have a major role in determining the composition of 12 pepper varieties across four environments. The data from this study could explain the natural variation in the compositional data of peppers by genotypes and environments.

show abstract

Purification of multiplex oligonucleotide libraries by synthesis and selection

Choi

et al. 2021

Nat Biotechnol

View full text Add to dashboard Cite

Spatial epitranscriptomics reveals A-to-I editome specific to cancer stem cell microniches

Lee

Choi

et al. 2022

Nat Commun

View full text Add to dashboard Cite

Epitranscriptomic features, such as single-base RNA editing, are sources of transcript diversity in cancer, but little is understood in terms of their spatial context in the tumour microenvironment. Here, we introduce spatial-histopathological examination-linked epitranscriptomics converged to transcriptomics with sequencing (Select-seq), which isolates regions of interest from immunofluorescence-stained tissue and obtains transcriptomic and epitranscriptomic data. With Select-seq, we analyse the cancer stem cell-like microniches in relation to the tumour microenvironment of triple-negative breast cancer patients. We identify alternative splice variants, perform complementarity-determining region analysis of infiltrating T cells and B cells, and assess adenosine-to-inosine base editing in tumour tissue sections. Especially, in triple-negative breast cancer microniches, adenosine-to-inosine editome specific to different microniche groups is identified.

show abstract

High-throughput retrieval of physical DNA for NGS-identifiable clones in phage display library

Noh

Kim

Jung

et al. 2019

mAbs

View full text Add to dashboard Cite

In antibody discovery, in-depth analysis of an antibody library and high-throughput retrieval of clones in the library are crucial to identifying and exploiting rare clones with different properties. However, existing methods have technical limitations, such as low process throughput from the laborious cloning process and waste of the phenotypic screening capacity from unnecessary repetitive tests on the dominant clones. To overcome the limitations, we developed a new high-throughput platform for the identification and retrieval of clones in the library, TrueRepertoire™. This new platform provides highly accurate sequences of the clones with linkage information between heavy and light chains of the antibody fragment. Additionally, the physical DNA of clones can be retrieved in high throughput based on the sequence information. We validated the high accuracy of the sequences and demonstrated that there is no platform-specific bias. Moreover, the applicability of TrueRepertoire™ was demonstrated by a phage-displayed single-chain variable fragment library targeting human hepatocyte growth factor protein.

show abstract

Barcode-free next-generation sequencing error validation for ultra-rare variant detection

Yeom

Lee

Ryu³

et al. 2019

Nat Commun

View full text Add to dashboard Cite

The advent of next-generation sequencing (NGS) has accelerated biomedical research by enabling the high-throughput analysis of DNA sequences at a very low cost. However, NGS has limitations in detecting rare-frequency variants (< 1%) because of high sequencing errors (> 0.1~1%). NGS errors could be filtered out using molecular barcodes, by comparing read replicates among those with the same barcodes. Accordingly, these barcoding methods require redundant reads of non-target sequences, resulting in high sequencing cost. Here, we present a cost-effective NGS error validation method in a barcode-free manner. By physically extracting and individually amplifying the DNA clones of erroneous reads, we distinguish true variants of frequency > 0.003% from the systematic NGS error and selectively validate NGS error after NGS. We achieve a PCR-induced error rate of 2.5×10 −6 per base per doubling event, using 10 times less sequencing reads compared to those from previous studies.

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Taehoon Ryu

High information capacity DNA-based data storage with augmented encoding characters using degenerate bases

DNA Micro‐Disks for the Management of DNA‐Based Data Storage with Index and Write‐Once–Read‐Many (WORM) Memory Features

A high-throughput optomechanical retrieval method for sequence-verified clonal DNA from the NGS platform

A comparison of the nutrient composition and statistical profile in red pepper fruits (Capsicums annuum L.) based on genetic and environmental factors

Purification of multiplex oligonucleotide libraries by synthesis and selection

Spatial epitranscriptomics reveals A-to-I editome specific to cancer stem cell microniches

High-throughput retrieval of physical DNA for NGS-identifiable clones in phage display library

Barcode-free next-generation sequencing error validation for ultra-rare variant detection

Contact Info

Product

Resources

About