Peng Sun scite author profile

The human genome is diploid, and knowledge of the variants on each chromosome is important for the interpretation of genomic information. Here we report the assembly of a haplotype-resolved diploid genome without using a reference genome. Our pipeline relies on fosmid pooling together with whole-genome shotgun strategies, based solely on next-generation sequencing and hierarchical assembly methods. We applied our sequencing method to the genome of an Asian individual and generated a 5.15-Gb assembled genome with a haplotype N50 of 484 kb. Our analysis identified previously undetected indels and 7.49 Mb of novel coding sequences that could not be aligned to the human reference genome, which include at least six predicted genes. This haplotype-resolved genome represents the most complete de novo human genome assembly to date. Application of our approach to identify individual haplotype differences should aid in translating genotypes to phenotypes for the development of personalized medicine.

show abstract

Disrupted spermatogenesis in a metabolic syndrome model: the role of vitamin A metabolism in the gut–testis axis

Zhang

Sun

Geng

et al. 2021

Gut

115

View full text Add to dashboard Cite

ObjectiveEffects of the diet-induced gut microbiota dysbiosis reach far beyond the gut. We aim to uncover the direct evidence involving the gut–testis axis in the aetiology of impaired spermatogenesis.DesignAn excessive-energy diet-induced metabolic syndrome (MetS) sheep model was established. The testicular samples, host metabolomes and gut microbiome were analysed. Faecal microbiota transplantation (FMT) confirmed the linkage between gut microbiota and spermatogenesis.ResultsWe demonstrated that the number of arrested spermatogonia was markedly elevated by using 10× single-cell RNA-seq in the MetS model. Furthermore, through using metabolomics profiling and 16S rDNA-seq, we discovered that the absorption of vitamin A in the gut was abolished due to a notable reduction of bile acid levels, which was significantly associated with reduced abundance of Ruminococcaceae_NK4A214_group. Notably, the abnormal metabolic effects of vitamin A were transferable to the testicular cells through the circulating blood, which contributed to abnormal spermatogenesis, as confirmed by FMT.ConclusionThese findings define a starting point for linking the testicular function and regulation of gut microbiota via host metabolomes and will be of potential value for the treatment of male infertility in MetS.

show abstract

The expanded human disease network combining protein–protein interaction information

Zhang

Jiang

et al. 2011

Eur J Hum Genet

View full text Add to dashboard Cite

The human disease network (HDN) has become a powerful tool for revealing disease-disease associations. Some studies have shown that genes that share similar or same disease phenotypes tend to encode proteins that interact with each other. Therefore, protein-protein interactions (PPIs) may help us to further understand the relationships between diseases with overlapping clinical phenotypes. In this study, we constructed the expanded HDN (eHDN) by combining disease gene information with PPI information, and analyzed its topological features and functional properties. We found that the network is hierarchical and, most diseases are connected to only a few diseases, whereas a small part of diseases are linked to many different diseases. Diseases in a specific disease class tend to cluster together, and genes associated with the same disease are functionally related. Comparing the eHDN with the original HDN (oHDN, constructed using disease gene information) revealed high consistency over all topological and functional properties. This, to some extent, indicates that our eHDN is reliable. In the eHDN, we found some new associations among diseases resulting from the shared genes interacting with disease genes. The new eHDN will provide a valuable reference for clinicians and medical researchers.

show abstract

Prediction of Human Disease-Related Gene Clusters by Clustering Analysis

Sun¹,

Gao²,

Han³

2011

Int. J. Biol. Sci.

View full text Add to dashboard Cite

Since genes associated with similar diseases/disorders show an increased tendency for their protein products to interact with each other through protein-protein interactions (PPI), clustering analysis obviously as an efficient technique can be easily used to predict human disease-related gene clusters/subnetworks. Firstly, we used clustering algorithms, Markov cluster algorithm (MCL), Molecular complex detection (MCODE) and Clique percolation method (CPM) to decompose human PPI network into dense clusters as the candidates of disease-related clusters, and then a log likelihood model that integrates multiple biological evidences was proposed to score these dense clusters. Finally, we identified disease-related clusters using these dense clusters if they had higher scores. The efficiency was evaluated by a leave-one-out cross validation procedure. Our method achieved a success rate with 98.59% and recovered the hidden disease-related clusters in 34.04% cases when removed one known disease gene and all its gene-disease associations. We found that the clusters decomposed by CPM outperformed MCL and MCODE as the candidates of disease-related clusters with well-supported biological significance in biological process, molecular function and cellular component of Gene Ontology (GO) and expression of human tissues. We also found that most of the disease-related clusters consisted of tissue-specific genes that were highly expressed only in one or several tissues, and a few of those were composed of housekeeping genes (maintenance genes) that were ubiquitously expressed in most of all the tissues.

show abstract

Drug repurposing by integrated literature mining and drug–gene–disease triangulation

Sun

Guo

Winnenburg

et al. 2017

Drug Discovery Today

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Peng Sun

De novo assembly of a haplotype-resolved human genome

Disrupted spermatogenesis in a metabolic syndrome model: the role of vitamin A metabolism in the gut–testis axis

The expanded human disease network combining protein–protein interaction information

Prediction of Human Disease-Related Gene Clusters by Clustering Analysis

Drug repurposing by integrated literature mining and drug–gene–disease triangulation

Contact Info

Product

Resources

About