Jihua Feng scite author profile

BackgroundMost genes are not affected when any transcription factor (TF) is knocked out, indicating that they have robust transcriptional regulatory program. Yet the mechanism underlying robust transcriptional regulatory program is less clear.ResultsHere, we studied the cause and effect of robust transcriptional regulatory program. We found that cooperative TFs in the robust transcriptional regulatory program regulate their common target genes in an activity-redundant fashion, and they are able to compensate for each other's loss. As a result, their target genes are insensitive to their single perturbation. We next revealed that the degree of robustness of transcriptional regulatory program influences gene expression variability. Genes with fragile (unrobust) transcriptional regulatory program under normal growth condition could be readily reprogrammed to significantly modulate gene expression upon changing conditions. They also have high evolutionary rates of gene expression. We further showed that the fragile transcriptional regulatory program is a major source of expression variability.ConclusionWe showed that activity-redundant TFs guarantee the robustness of transcriptional regulatory programs, and the fragility of transcriptional regulatory program plays a major role in gene expression variability. These findings reveal the mechanisms underlying robust transcription and expression variability.

show abstract

A Novel Molecular Representation Learning for Molecular Property Prediction with a Multiple SMILES-Based Augmentation

Feng

Liu

et al. 2022

Computational Intelligence and Neuroscience

View full text Add to dashboard Cite

Deep learning has brought a rapid development in the aspect of molecular representation for various tasks, such as molecular property prediction. The prediction of molecular properties is a crucial task in the field of drug discovery for finding specific drugs with good pharmacological activity and pharmacokinetic properties. SMILES string is always used as a kind of character approach in deep neural network models, inspired by natural language processing techniques. However, the deep learning models are hindered by the nonunique nature of the SMILES string. To efficiently learn molecular features along all message paths, in this paper we encode multiple SMILES for every molecule as an automated data augmentation for the prediction of molecular properties, which alleviates the overfitting problem caused by the small amount of data in the datasets of molecular property prediction. As a result, by using the multiple SMILES-based augmentation, we obtained better molecular representation and showed superior performance in the tasks of predicting molecular properties.

show abstract

Cladistic analysis of iridoviruses based on protein and DNA sequences

Wang

Deng

Wang

et al. 2003

Archives of Virology

View full text Add to dashboard Cite

Cladograms of iridoviruses were inferred from bootstrap analysis of molecular data sets comprising all published protein and DNA sequences of the major capsid protein, ATPase and DNA polymerase genes of members of the Iridoviridae family Iridovirus. All data sets yielded cladograms supporting the separation of the Iridovirus, Ranavirus and Lymphocystivirus genera, and the cladogram based on data derived from major capsid proteins further divided both the Iridovirus and Ranavirus genera into two groups. Tests of alternative hypotheses of topological constraints were also performed to further investigate relationships between infectious spleen and kidney necrosis virus (ISKNV), an unclassified fish iridovirus for which the complete genome sequence data is available, and other iridoviruses. Cladograms inferred and results of Shimodaira-Hasegawa tests indicated that ISKNV is more closely related to the Ranavirus genus than it is to the other genera of the family.

show abstract

New insights into two distinct nucleosome distributions: comparison of cross-platform positioning datasets in the yeast genome

et al. 2010

View full text Add to dashboard Cite

BackgroundRecently, a number of high-resolution genome-wide maps of nucleosome locations in S. cerevisiae have been derived experimentally. However, nucleosome positions are determined in vivo by the combined effects of numerous factors. Consequently, nucleosomes are not simple static units, which may explain the discrepancies in reported nucleosome positions as measured by different experiments. In order to more accurately depict the genome-wide nucleosome distribution, we integrated multiple nucleosomal positioning datasets using a multi-angle analysis strategy.ResultsTo evaluate the contribution of chromatin structure to transcription, we used the vast amount of available nucleosome analyzed data. Analysis of this data allowed for the comprehensive identification of the connections between promoter nucleosome positioning patterns and various transcription-dependent properties. Further, we characterised the function of nucleosome destabilisation in the context of transcription regulation. Our results indicate that genes with similar nucleosome occupancy patterns share general transcription attributes. We identified the local regulatory correlation (LRC) regions for two distinct types of nucleosomes and we assessed their regulatory properties. We also estimated the nucleosome reproducibility and measurement accuracy for high-confidence transcripts. We found that by maintaining a distance of ~13 bp between the upstream border of the +1 nucleosome and the transcription start sites (TSSs), the stable +1 nucleosome may form a barrier against the accessibility of the TSS and shape an optimum chromatin conformation for gene regulation. An in-depth analysis of nucleosome positioning in normally growing and heat shock cells suggested that the extent and patterns of nucleosome sliding are associated with gene activation.ConclusionsOur results, which combine different types of data, suggest that cross-platform information, including discrepancy and consistency, reflects the mechanisms of nucleosome packaging in vivo more faithfully than individual studies. Furthermore, nucleosomes can be divided into two classes according to their stable and dynamic characteristics. We found that two different nucleosome-positioning characteristics may significantly impact transcription programs. Besides, some positioned-nucleosomes are involved in the transition from stable state to dynamic state in response to abrupt environmental changes.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jihua Feng

Missing value imputation for microarray gene expression data using histone acetylation information

Robustness of transcriptional regulatory program influences gene expression variability

A Novel Molecular Representation Learning for Molecular Property Prediction with a Multiple SMILES-Based Augmentation

Cladistic analysis of iridoviruses based on protein and DNA sequences

New insights into two distinct nucleosome distributions: comparison of cross-platform positioning datasets in the yeast genome

Contact Info

Product

Resources

About