Gustavo Arango-Argoty scite author profile

BackgroundGrowing concerns about increasing rates of antibiotic resistance call for expanded and comprehensive global monitoring. Advancing methods for monitoring of environmental media (e.g., wastewater, agricultural waste, food, and water) is especially needed for identifying potential resources of novel antibiotic resistance genes (ARGs), hot spots for gene exchange, and as pathways for the spread of ARGs and human exposure. Next-generation sequencing now enables direct access and profiling of the total metagenomic DNA pool, where ARGs are typically identified or predicted based on the “best hits” of sequence searches against existing databases. Unfortunately, this approach produces a high rate of false negatives. To address such limitations, we propose here a deep learning approach, taking into account a dissimilarity matrix created using all known categories of ARGs. Two deep learning models, DeepARG-SS and DeepARG-LS, were constructed for short read sequences and full gene length sequences, respectively.ResultsEvaluation of the deep learning models over 30 antibiotic resistance categories demonstrates that the DeepARG models can predict ARGs with both high precision (> 0.97) and recall (> 0.90). The models displayed an advantage over the typical best hit approach, yielding consistently lower false negative rates and thus higher overall recall (> 0.9). As more data become available for under-represented ARG categories, the DeepARG models’ performance can be expected to be further enhanced due to the nature of the underlying neural networks. Our newly developed ARG database, DeepARG-DB, encompasses ARGs predicted with a high degree of confidence and extensive manual inspection, greatly expanding current ARG repositories.ConclusionsThe deep learning models developed here offer more accurate antimicrobial resistance annotation relative to current bioinformatics practice. DeepARG does not require strict cutoffs, which enables identification of a much broader diversity of ARGs. The DeepARG models and database are available as a command line version and as a Web service at http://bench.cs.vt.edu/deeparg.Electronic supplementary materialThe online version of this article (10.1186/s40168-018-0401-z) contains supplementary material, which is available to authorized users.

show abstract

An in-depth map of polyadenylation sites in cancer

Lin¹,

Li²,

Kim³

et al. 2012

129

147

View full text Add to dashboard Cite

We present a comprehensive map of over 1 million polyadenylation sites and quantify their usage in major cancers and tumor cell lines using direct RNA sequencing. We built the Expression and Polyadenylation Database to enable the visualization of the polyadenylation maps in various cancers and to facilitate the discovery of novel genes and gene isoforms that are potentially important to tumorigenesis. Analyses of polyadenylation sites indicate that a large fraction (∼30%) of mRNAs contain alternative polyadenylation sites in their 3′ untranslated regions, independent of the cell type. The shortest 3′ untranslated region isoforms are preferentially upregulated in cancer tissues, genome-wide. Candidate targets of alternative polyadenylation-mediated upregulation of short isoforms include POLR2K, and signaling cascades of cell–cell and cell–extracellular matrix contact, particularly involving regulators of Rho GTPases. Polyadenylation maps also helped to improve 3′ untranslated region annotations and identify candidate regulatory marks such as sequence motifs, H3K36Me3 and Pabpc1 that are isoform dependent and occur in a position-specific manner. In summary, these results highlight the need to go beyond monitoring only the cumulative transcript levels for a gene, to separately analysing the expression of its RNA isoforms.

show abstract

DeepARG: A deep learning approach for predicting antibiotic resistance genes from metagenomic data

Arango-Argoty

Garner

Pruden

et al. 2017

Preprint

126

View full text Add to dashboard Cite

Growing concerns regarding increasing rates of antibiotic resistance call for global monitoring efforts. Monitoring of environmental media (e.g., wastewater, agricultural waste, food, and water) is of particular interest as these media can serve as sources of potential novel antibiotic resistance genes (ARGs), as hot spots for ARG exchange, and as pathways for the spread of ARGs and human exposure. Next-generation sequencebased monitoring has recently enabled direct access and profiling of the total metagenomic DNA pool, where ARGs are identified or predicted based on the "best hits" of homology searches against existing databases. Unfortunately, this approach tends to produce high rates of false negatives. To address such limitations, we propose here a deep leaning approach, taking into account a dissimilarity matrix created using all known categories of ARGs. Two models, deepARG-SS and deepARG-LS, were constructed for short read sequences and full gene length sequences, respectively.Performance evaluation of the deep learning models over 30 classes of antibiotics demonstrates that the deepARG models can predict ARGs with both high precision (>0.97) and recall (>0.90) for most of the antibiotic resistance categories. The models show advantage over the traditional best hit approach by having consistently much lower false negative rates and thus higher overall recall (>0.9). As more data become available for under-represented antibiotic resistance categories, the deepARG models' performance can be expected to be further enhanced due to the nature of the . CC-BY-NC 4.0 International license peer-reviewed) is the author/funder. It is made available under a The copyright holder for this preprint (which was not . http://dx.doi.org/10.1101/149328 doi: bioRxiv preprint first posted online Jun. 12, 2017; underlying neural networks. The deepARG models are available both in command line version and via a Web server at http://bench.cs.vt.edu/deeparg. Our newly developed ARG database, deepARG-DB, containing predicted ARGs with high confidence and high degree of manual curation, greatly expands the current ARG repository.DeepARG-DB can be downloaded freely to benefit community research and future development of antibiotic resistance-related resources.

show abstract

MetaStorm: A Public Resource for Customizable Metagenomics Annotation

et al. 2016

View full text Add to dashboard Cite

Metagenomics is a trending research area, calling for the need to analyze large quantities of data generated from next generation DNA sequencing technologies. The need to store, retrieve, analyze, share, and visualize such data challenges current online computational systems. Interpretation and annotation of specific information is especially a challenge for metagenomic data sets derived from environmental samples, because current annotation systems only offer broad classification of microbial diversity and function. Moreover, existing resources are not configured to readily address common questions relevant to environmental systems. Here we developed a new online user-friendly metagenomic analysis server called MetaStorm (http://bench.cs.vt.edu/MetaStorm/), which facilitates customization of computational analysis for metagenomic data sets. Users can upload their own reference databases to tailor the metagenomics annotation to focus on various taxonomic and functional gene markers of interest. MetaStorm offers two major analysis pipelines: an assembly-based annotation pipeline and the standard read annotation pipeline used by existing web servers. These pipelines can be selected individually or together. Overall, MetaStorm provides enhanced interactive visualization to allow researchers to explore and manipulate taxonomy and functional annotation at various levels of resolution.

show abstract

NanoARG: a web service for detecting and contextualizing antimicrobial resistance genes from nanopore-derived metagenomes

et al. 2019

View full text Add to dashboard Cite

Background Direct and indirect selection pressures imposed by antibiotics and co-selective agents and horizontal gene transfer are fundamental drivers of the evolution and spread of antibiotic resistance. Therefore, effective environmental monitoring tools should ideally capture not only antibiotic resistance genes (ARGs), but also mobile genetic elements (MGEs) and indicators of co-selective forces, such as metal resistance genes (MRGs). A major challenge towards characterizing the potential human health risk of antibiotic resistance is the ability to identify ARG-carrying microorganisms, of which human pathogens are arguably of greatest risk. Historically, short reads produced by next-generation sequencing technologies have hampered confidence in assemblies for achieving these purposes. Results Here, we introduce NanoARG, an online computational resource that takes advantage of the long reads produced by nanopore sequencing technology. Specifically, long nanopore reads enable identification of ARGs in the context of relevant neighboring genes, thus providing valuable insight into mobility, co-selection, and pathogenicity. NanoARG was applied to study a variety of nanopore sequencing data to demonstrate its functionality. NanoARG was further validated through characterizing its ability to correctly identify ARGs in sequences of varying lengths and a range of sequencing error rates. Conclusions NanoARG allows users to upload sequence data online and provides various means to analyze and visualize the data, including quantitative and simultaneous profiling of ARGs, MRGs, MGEs, and putative pathogens. A user-friendly interface allows users the analysis of long DNA sequences (including assembled contigs), facilitating data processing, analysis, and visualization. NanoARG is publicly available and freely accessible at https://bench.cs.vt.edu/nanoarg . Electronic supplementary material The online version of this article (10.1186/s40168-019-0703-9) contains supplementary material, which is available to authorized users.

show abstract

Effects of Dairy Manure-Based Amendments and Soil Texture on Lettuce- and Radish-Associated Microbiota and Resistomes

Guron

Arango-Argoty

Zhang

et al. 2019

mSphere

View full text Add to dashboard Cite

Dairy cattle are routinely treated with antibiotics, and the resulting manure or composted manure is commonly used as a soil amendment for crop production, raising questions regarding the potential for antibiotic resistance to propagate from “farm to fork.” The objective of this study was to compare the microbiota and “resistomes” (i.e., carriage of antibiotic resistance genes [ARGs]) associated with lettuce leaf and radish taproot surfaces grown in different soils amended with dairy manure, compost, or chemical fertilizer only (control). Manure was collected from antibiotic-free dairy cattle (DC) or antibiotic-treated dairy cattle (DA), with a portion composted for parallel comparison. Amendments were applied to loamy sand or silty clay loam, and lettuce and radishes were cultivated to maturity in a greenhouse. Metagenomes were profiled via shotgun Illumina sequencing. Radishes carried a distinct ARG composition compared to that of lettuce, with greater relative abundance of total ARGs. Taxonomic species richness was also greater for radishes by 1.5-fold. The resistomes of lettuce grown with DC compost were distinct from those grown with DA compost, DC manure, or fertilizer only. Further, compost applied to loamy sand resulted in twofold-greater relative abundance of total ARGs on lettuce than when applied to silty clay loam. The resistomes of radishes grown with biological amendments were distinct from the corresponding fertilizer controls, but effects of composting or antibiotic use were not measureable. Cultivation in loamy sand resulted in higher species richness for both lettuce and radishes than when grown in silty clay loam by 2.2-fold and 1.2-fold, respectively, when amended with compost. IMPORTANCE A controlled, integrated, and replicated greenhouse study, along with comprehensive metagenomic analysis, revealed that multiple preharvest factors, including antibiotic use during manure collection, composting, biological soil amendment, and soil type, influence vegetable-borne resistomes. Here, radishes, a root vegetable, carried a greater load of ARGs and species richness than lettuce, a leafy vegetable. However, the lettuce resistome was more noticeably influenced by upstream antibiotic use and composting. Network analysis indicated that cooccurring ARGs and mobile genetic elements were almost exclusively associated with conditions receiving raw manure amendments, suggesting that composting could alleviate the mobility of manure-derived resistance traits. Effects of preharvest factors on associated microbiota and resistomes of vegetables eaten raw are worthy of further examination in terms of potential influence on human microbiomes and spread of antibiotic resistance. This research takes a step toward identifying on-farm management practices that can help mitigate the spread of agricultural sources of antibiotic resistance.

show abstract

Identification of discriminatory antibiotic resistance genes among environmental resistomes using extremely randomized tree algorithm

et al. 2019

View full text Add to dashboard Cite

Background The interconnectivities of built and natural environments can serve as conduits for the proliferation and dissemination of antibiotic resistance genes (ARGs). Several studies have compared the broad spectrum of ARGs (i.e., “resistomes”) in various environmental compartments, but there is a need to identify unique ARG occurrence patterns (i.e., “discriminatory ARGs”), characteristic of each environment. Such an approach will help to identify factors influencing ARG proliferation, facilitate development of relative comparisons of the ARGs distinguishing various environments, and help pave the way towards ranking environments based on their likelihood of contributing to the spread of clinically relevant antibiotic resistance. Here we formulate and demonstrate an approach using an extremely randomized tree (ERT) algorithm combined with a Bayesian optimization technique to capture ARG variability in environmental samples and identify the discriminatory ARGs. The potential of ERT for identifying discriminatory ARGs was first evaluated using in silico metagenomic datasets (simulated metagenomic Illumina sequencing data) with known variability. The application of ERT was then demonstrated through analyses using publicly available and in-house metagenomic datasets associated with (1) different aquatic habitats (e.g., river, wastewater influent, hospital effluent, and dairy farm effluent) to compare resistomes between distinct environments and (2) different river samples (i.e., Amazon, Kalamas, and Cam Rivers) to compare resistome characteristics of similar environments. Results The approach was found to readily identify discriminatory ARGs in the in silico datasets. Also, it was not found to be biased towards ARGs with high relative abundance, which is a common limitation of feature projection methods, and instead only captured those ARGs that elicited significant profiles. Analyses of publicly available metagenomic datasets further demonstrated that the ERT approach can effectively differentiate real-world environmental samples and identify discriminatory ARGs based on pre-defined categorizing schemes. Conclusions Here a new methodology was formulated to characterize and compare variances in ARG profiles between metagenomic data sets derived from similar/dissimilar environments. Specifically, identification of discriminatory ARGs among samples representing various environments can be identified based on factors of interest. The methodology could prove to be a particularly useful tool for ARG surveillance and the assessment of the effectiveness of strategies for mitigating the spread of antibiotic resistance. The python package is hosted in the Git repository: https://github.com/gaarangoa/ExtrARG Electronic supplementary material The online version of this article (10.1186/s40168-019-0735-1) contains supplementary material, which is available to authorized users.

show abstract

Apigenin Impacts the Growth of the Gut Microbiota and Alters the Gene Expression of Enterococcus

et al. 2017

View full text Add to dashboard Cite

Apigenin is a major dietary flavonoid with many bioactivities, widely distributed in plants. Apigenin reaches the colon region intact and interacts there with the human gut microbiota, however there is little research on how apigenin affects the gut bacteria. This study investigated the effect of pure apigenin on human gut bacteria, at both the single strain and community levels. The effect of apigenin on the single gut bacteria strains Bacteroides galacturonicus, Bifidobacterium catenulatum, Lactobacillus rhamnosus GG, and Enterococcus caccae, was examined by measuring their anaerobic growth profiles. The effect of apigenin on a gut microbiota community was studied by culturing a fecal inoculum under in vitro conditions simulating the human ascending colon. 16S rRNA gene sequencing and GC-MS analysis quantified changes in the community structure. Single molecule RNA sequencing was used to reveal the response of Enterococcus caccae to apigenin. Enterococcus caccae was effectively inhibited by apigenin when cultured alone, however, the genus Enterococcus was enhanced when tested in a community setting. Single molecule RNA sequencing found that Enterococcus caccae responded to apigenin by up-regulating genes involved in DNA repair, stress response, cell wall synthesis, and protein folding. Taken together, these results demonstrate that apigenin affects both the growth and gene expression of Enterococcus caccae.

show abstract

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.