Hantao Chen scite author profile

Cancer class discovery using biomolecular data is one of the most important tasks for cancer diagnosis and treatment. Tumor clustering from gene expression data provides a new way to perform cancer class discovery. Most of the existing research works adopt single-clustering algorithms to perform tumor clustering is from biomolecular data that lack robustness, stability, and accuracy. To further improve the performance of tumor clustering from biomolecular data, we introduce the fuzzy theory into the cluster ensemble framework for tumor clustering from biomolecular data, and propose four kinds of hybrid fuzzy cluster ensemble frameworks (HFCEF), named as HFCEF-I, HFCEF-II, HFCEF-III, and HFCEF-IV, respectively, to identify samples that belong to different types of cancers. The difference between HFCEF-I and HFCEF-II is that they adopt different ensemble generator approaches to generate a set of fuzzy matrices in the ensemble. Specifically, HFCEF-I applies the affinity propagation algorithm (AP) to perform clustering on the sample dimension and generates a set of fuzzy matrices in the ensemble based on the fuzzy membership function and base samples selected by AP. HFCEF-II adopts AP to perform clustering on the attribute dimension, generates a set of subspaces, and obtains a set of fuzzy matrices in the ensemble by performing fuzzy c-means on subspaces. Compared with HFCEF-I and HFCEF-II, HFCEF-III and HFCEF-IV consider the characteristics of HFCEF-I and HFCEF-II. HFCEF-III combines HFCEF-I and HFCEF-II in a serial way, while HFCEF-IV integrates HFCEF-I and HFCEF-II in a concurrent way. HFCEFs adopt suitable consensus functions, such as the fuzzy c-means algorithm or the normalized cut algorithm (Ncut), to summarize generated fuzzy matrices, and obtain the final results. The experiments on real data sets from UCI machine learning repository and cancer gene expression profiles illustrate that 1) the proposed hybrid fuzzy cluster ensemble frameworks work well on real data sets, especially biomolecular data, and 2) the proposed approaches are able to provide more robust, stable, and accurate results when compared with the state-of-the-art single clustering algorithms and traditional cluster ensemble approaches.

show abstract

Electrooxidative Metal‐Free Cyclization of 4‐Arylaminocoumarins with DMF as C1‐Source

Weng

Chen

et al. 2021

Adv Synth Catal

View full text Add to dashboard Cite

An environmentally‐benign electrochemical approach for the construction of quinoline derivatives employing N,N‐dimethylformamide (DMF) as the methine source has been devised by cyclization of 4‐(phenylamino)‐2H‐chromen‐2‐ones. In a user‐friendly undivided cell, 6H‐chromeno[4,3‐b]quinolin‐6‐ones were obtained under chemical oxidant‐free and transition‐metal‐free conditions in 43–92% yields with high functional tolerance.

show abstract

Tandem Electrochemical Oxidative Azidation/Heterocyclization of Tryptophan‐Containing Peptides under Buffer Conditions

Weng

Chen

et al. 2022

Angew Chem Int Ed

View full text Add to dashboard Cite

As the aromatic tryptophan (Trp) side chain plays a pivotal role in influencing the structure and function of peptides and proteins, it has become an attractive target for the late-stage modification of these important biomolecules. Herein, we report an electrochemical approach for late-stage functionalization of peptides containing a Trp side chain through manganese-catalyzed tandem radical azidation/heterocyclization. This electrochemical oxidative strategy provides access to azide-substituted tetrazolo[1,5-a]indole-containing peptides with broad functional group tolerance, high site selectivity, and good yields of products (up to 87 %) under mild buffer conditions. Moreover, the modified Trp-containing peptides bearing an azide functionality are promising building blocks, paving the way for the construction of various derivatives, such as "click" chemistry products.

show abstract

Self-templated formation of hierarchically yolk–shell-structured ZnS/NC dodecahedra with superior lithium storage properties

et al. 2021

View full text Add to dashboard Cite

show abstract

Hybrid <inline-formula> <tex-math notation="LaTeX">$k$ </tex-math> </inline-formula>-Nearest Neighbor Classifier

Chen

Liuxs

et al. 2016

IEEE Trans. Cybern.

104

View full text Add to dashboard Cite

Conventional k -nearest neighbor (KNN) classification approaches have several limitations when dealing with some problems caused by the special datasets, such as the sparse problem, the imbalance problem, and the noise problem. In this paper, we first perform a brief survey on the recent progress of the KNN classification approaches. Then, the hybrid KNN (HBKNN) classification approach, which takes into account the local and global information of the query sample, is designed to address the problems raised from the special datasets. In the following, the random subspace ensemble framework based on HBKNN (RS-HBKNN) classifier is proposed to perform classification on the datasets with noisy attributes in the high-dimensional space. Finally, the nonparametric tests are proposed to be adopted to compare the proposed method with other classification approaches over multiple datasets. The experiments on the real-world datasets from the Knowledge Extraction based on Evolutionary Learning dataset repository demonstrate that RS-HBKNN works well on real datasets, and outperforms most of the state-of-the-art classification approaches.

show abstract

Adaptive Fuzzy Consensus Clustering Framework for Clustering Analysis of Cancer Data

Chen

You

et al. 2015

IEEE/ACM Trans. Comput. Biol. and Bioinf.

View full text Add to dashboard Cite

Performing clustering analysis is one of the important research topics in cancer discovery using gene expression profiles, which is crucial in facilitating the successful diagnosis and treatment of cancer. While there are quite a number of research works which perform tumor clustering, few of them considers how to incorporate fuzzy theory together with an optimization process into a consensus clustering framework to improve the performance of clustering analysis. In this paper, we first propose a random double clustering based cluster ensemble framework (RDCCE) to perform tumor clustering based on gene expression data. Specifically, RDCCE generates a set of representative features using a randomly selected clustering algorithm in the ensemble, and then assigns samples to their corresponding clusters based on the grouping results. In addition, we also introduce the random double clustering based fuzzy cluster ensemble framework (RDCFCE), which is designed to improve the performance of RDCCE by integrating the newly proposed fuzzy extension model into the ensemble framework. RDCFCE adopts the normalized cut algorithm as the consensus function to summarize the fuzzy matrices generated by the fuzzy extension models, partition the consensus matrix, and obtain the final result. Finally, adaptive RDCFCE (A-RDCFCE) is proposed to optimize RDCFCE and improve the performance of RDCFCE further by adopting a self-evolutionary process (SEPP) for the parameter set. Experiments on real cancer gene expression profiles indicate that RDCFCE and A-RDCFCE works well on these data sets, and outperform most of the state-of-the-art tumor clustering algorithms.

show abstract

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hantao Chen

2D conductive MOFs with sufficient redox sites: reduced graphene oxide/Cu-benzenehexathiolate composites as high capacity anode materials for lithium-ion batteries

Peptide late-stage C(sp³)–H arylation by native asparagine assistance without exogenous directing groups

Hybrid Fuzzy Cluster Ensemble Framework for Tumor Clustering from Biomolecular Data

Electrooxidative Metal‐Free Cyclization of 4‐Arylaminocoumarins with DMF as C1‐Source

Tandem Electrochemical Oxidative Azidation/Heterocyclization of Tryptophan‐Containing Peptides under Buffer Conditions

Self-templated formation of hierarchically yolk–shell-structured ZnS/NC dodecahedra with superior lithium storage properties

Hybrid <inline-formula> <tex-math notation="LaTeX">$k$ </tex-math> </inline-formula>-Nearest Neighbor Classifier

Adaptive Fuzzy Consensus Clustering Framework for Clustering Analysis of Cancer Data

Contact Info

Product

Resources

About

Hantao Chen

2D conductive MOFs with sufficient redox sites: reduced graphene oxide/Cu-benzenehexathiolate composites as high capacity anode materials for lithium-ion batteries

Peptide late-stage C(sp3)–H arylation by native asparagine assistance without exogenous directing groups

Hybrid Fuzzy Cluster Ensemble Framework for Tumor Clustering from Biomolecular Data

Electrooxidative Metal‐Free Cyclization of 4‐Arylaminocoumarins with DMF as C1‐Source

Tandem Electrochemical Oxidative Azidation/Heterocyclization of Tryptophan‐Containing Peptides under Buffer Conditions

Self-templated formation of hierarchically yolk–shell-structured ZnS/NC dodecahedra with superior lithium storage properties

Hybrid <inline-formula> <tex-math notation="LaTeX">$k$ </tex-math> </inline-formula>-Nearest Neighbor Classifier

Adaptive Fuzzy Consensus Clustering Framework for Clustering Analysis of Cancer Data

Contact Info

Product

Resources

About

Peptide late-stage C(sp³)–H arylation by native asparagine assistance without exogenous directing groups