MotivationAnnotation of enzyme function has a broad range of applications, such as metagenomics, industrial biotechnology, and diagnosis of enzyme deficiency-caused diseases. However, the time and resource required make it prohibitively expensive to experimentally determine the function of every enzyme. Therefore, computational enzyme function prediction has become increasingly important. In this paper, we develop such an approach, determining the enzyme function by predicting the Enzyme Commission number.ResultsWe propose an end-to-end feature selection and classification model training approach, as well as an automatic and robust feature dimensionality uniformization method, DEEPre, in the field of enzyme function prediction. Instead of extracting manually crafted features from enzyme sequences, our model takes the raw sequence encoding as inputs, extracting convolutional and sequential features from the raw encoding based on the classification result to directly improve the prediction performance. The thorough cross-fold validation experiments conducted on two large-scale datasets show that DEEPre improves the prediction performance over the previous state-of-the-art methods. In addition, our server outperforms five other servers in determining the main class of enzymes on a separate low-homology dataset. Two case studies demonstrate DEEPre’s ability to capture the functional difference of enzyme isoforms.Availability and implementationThe server could be accessed freely at http://www.cbrc.kaust.edu.sa/DEEPre.Supplementary information
Supplementary data are available at Bioinformatics online.
The maintenance of terminally differentiated cells, especially hepatocytes, in vitro has proven challenging. Here we demonstrated the long-term in vitro maintenance of primary human hepatocytes (PHHs) by modulating cell signaling pathways with a combination of five chemicals (5C). 5C-cultured PHHs showed global gene expression profiles and hepatocyte-specific functions resembling those of freshly isolated counterparts. Furthermore, these cells efficiently recapitulated the entire course of hepatitis B virus (HBV) infection over 4 weeks with the production of infectious viral particles and formation of HBV covalently closed circular DNA. Our study demonstrates that, with a chemical approach, functional maintenance of PHHs supports long-term HBV infection in vitro, providing an efficient platform for investigating HBV cell biology and antiviral drug screening.
A comprehensive reference map of all cell types in the human body is necessary for improving our understanding of fundamental biological processes and in diagnosing and treating disease. Highthroughput single-cell RNA sequencing techniques have emerged as powerful tools to identify and characterize cell types in complex and heterogeneous tissues. However, extracting intact cells from tissues and organs is often technically challenging or impossible, for example in heart or brain tissue. Single-nucleus RNA sequencing provides an alternative way to obtain transcriptome profiles of such tissues. To systematically assess the differences between high-throughput single-cell and single-nuclei RNA-seq approaches, we compared Drop-seq and DroNc-seq, two microfluidic-based 3′ RnA capture technologies that profile total cellular and nuclear RNA, respectively, during a time course experiment of human induced pluripotent stem cells (iPSCs) differentiating into cardiomyocytes. Clustering of timeseries transcriptomes from Drop-seq and DroNc-seq revealed six distinct cell types, five of which were found in both techniques. Furthermore, single-cell trajectories reconstructed from both techniques reproduced expected differentiation dynamics. We then applied DroNc-seq to postmortem heart tissue to test its performance on heterogeneous human tissue samples. Our data confirm that DroNcseq yields similar results to Drop-seq on matched samples and can be successfully used to generate reference maps for the human cell atlas.
Highlights d Ovarian tumors separate into two groups: high T and low T cell infiltration groups d Granulysin-expressing CD4 + T cells are present in the high T infiltration cell group d MKI67-expressing plasmablasts are identified in high T cell infiltration tumors d Correlation of CD8 and Tox in this study with immunoreactive subtype in TCGA data
Direct lineage reprogramming, including with small molecules, has emerged as a promising approach for generating desired cell types. We recently found that during chemical induction of induced pluripotent stem cells (iPSCs) from mouse fibroblasts, cells pass through an extra-embryonic endoderm (XEN)-like state. Here, we show that these chemically induced XEN-like cells can also be induced to directly reprogram into functional neurons, bypassing the pluripotent state. The induced neurons possess neuron-specific expression profiles, form functional synapses in culture, and further mature after transplantation into the adult mouse brain. Using similar principles, we were also able to induce hepatocyte-like cells from the XEN-like cells. Cells in the induced XEN-like state were readily expandable over at least 20 passages and retained genome stability and lineage specification potential. Our study therefore establishes a multifunctional route for chemical lineage reprogramming and may provide a platform for generating a diverse range of cell types via application of this expandable XEN-like state.
We have developed Lynx (http://lynx.ci.uchicago.edu)—a web-based database and a knowledge extraction engine, supporting annotation and analysis of experimental data and generation of weighted hypotheses on molecular mechanisms contributing to human phenotypes and disorders of interest. Its underlying knowledge base (LynxKB) integrates various classes of information from >35 public databases and private collections, as well as manually curated data from our group and collaborators. Lynx provides advanced search capabilities and a variety of algorithms for enrichment analysis and network-based gene prioritization to assist the user in extracting meaningful knowledge from LynxKB and experimental data, whereas its service-oriented architecture provides public access to LynxKB and its analytical tools via user-friendly web services and interfaces.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.