With the application and development of high-throughput sequencing technology in life and health sciences, massive multi-omics data brings the problem of efficient management and utilization. Database development and biocuration are the prerequisites for the reuse of these big data. Here, relying on China National GeneBank (CNGB), we present CNGB Sequence Archive (CNSA) for archiving omics data, including raw sequencing data and its further analyzed results which are organized into six objects, namely Project, Sample, Experiment, Run, Assembly and Variation at present. Moreover, CNSA has created a correlation model of living samples, sample information and analytical data on some projects. Both living samples and analytical data are directly correlated with the sample information. From either one, information or data of the other two can be obtained, so that all data can be traced throughout the life cycle from the living sample to the sample information to the analytical data. Complying with the data standards commonly used in the life sciences, CNSA is committed to building a comprehensive and curated data repository for storing, managing and sharing of omics data. We will continue to improve the data standards and provide free access to open-data resources for worldwide scientific communities to support academic research and the bio-industry.
Database URL: https://db.cngb.org/cnsa/.
Insecticidal proteins from Bacillus thuringiensis (Bt) are widely used to control insect pests, but their efficacy is reduced when pests evolve resistance. We report on a novel allele (r16) of the cadherin gene (PgCad1) in pink bollworm (Pectinophora gossypiella) associated with resistance to Bt toxin Cry1Ac, which is produced by transgenic cotton. The r16 allele isolated from a field population in China has 1545 base pairs of a degenerate transposon inserted in exon 20 of PgCad1, which generates a mis-spliced transcript containing a premature stop codon. A strain homozygous for r16 had 300-fold resistance to Cry1Ac, 2.6-fold cross-resistance to Cry2Ab, and completed its life cycle on transgenic Bt cotton producing Cry1Ac. Inheritance of Cry1Ac resistance was recessive and tightly linked with r16. Compared with transfected insect cells expressing wild-type PgCad1, cells expressing r16 were less susceptible to Cry1Ac. Recombinant cadherin protein was transported to the cell membrane in cells transfected with the wild-type PgCad1 allele, but not in cells transfected with r16. Cadherin occurred on brush border membrane vesicles (BBMVs) in the midgut of susceptible larvae, but not resistant larvae. These results imply that the r16 allele mediates Cry1Ac resistance in pink bollworm by interfering with the localization of cadherin.
The authors have withdrawn their manuscript whilst they perform additional experiments to test some of their conclusions further. Therefore, the authors do not wish this work to be cited as reference for the project. If you have any questions, please contact the corresponding author.
BackgroundTargeted next-generation sequencing (NGS) is a cost-effective approach for rapid and accurate detection of genetic mutations in patients with suspected genetic disorders, which can facilitate effective diagnosis.Methodology/Principal FindingsWe designed a capture array to mainly capture all the coding sequence (CDS) of 2,181 genes associated with 561 Mendelian diseases and conducted NGS to detect mutations. The accuracy of NGS was 99.95%, which was obtained by comparing the genotypes of selected loci between our method and SNP Array in four samples from normal human adults. We also tested the stability of the method using a sample from normal human adults. The results showed that an average of 97.79% and 96.72% of single-nucleotide variants (SNVs) in the sample could be detected stably in a batch and different batches respectively. In addition, the method could detect various types of mutations. Some disease-causing mutations were detected in 69 clinical cases, including 62 SNVs, 14 insertions and deletions (Indels), 1 copy number variant (CNV), 1 microdeletion and 2 microduplications of chromosomes, of which 35 mutations were novel. Mutations were confirmed by Sanger sequencing or real-time polymerase chain reaction (PCR).Conclusions/SignificanceResults of the evaluation showed that targeted NGS enabled to detect disease-causing mutations with high accuracy, stability, speed and throughput. Thus, the technology can be used for the clinical diagnosis of 561 Mendelian diseases.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.