Xiaomin Chu scite author profile

Implicit discourse relation recognition (IDRR) is a critical task in discourse analysis. Previous studies only regard it as a classification task and lack an in-depth understanding of the semantics of different relations. Therefore, we first view IDRR as a generation task and further propose a method joint modeling of the classification and generation. Specifically, we propose a joint model, CG-T5, to recognize the relation label and generate the target sentence containing the meaning of relations simultaneously. Furthermore, we design three target sentence forms, including the question form, for the generation model to incorporate prior knowledge. To address the issue that large discourse units are hardly embedded into the target sentence, we also propose a target sentence construction mechanism that automatically extracts core sentences from those large discourse units. Experimental results both on Chinese MCDTB and English PDTB datasets show that our model CG-T5 achieves the best performance against several state-of-the-art systems.

show abstract

SemanticCAP: Chromatin Accessibility Prediction Enhanced by Features Learning from a Language Model

Zhang

Chu

Jiang

et al. 2022

Genes

View full text Add to dashboard Cite

A large number of inorganic and organic compounds are able to bind DNA and form complexes, among which drug-related molecules are important. Chromatin accessibility changes not only directly affect drug–DNA interactions, but they can promote or inhibit the expression of the critical genes associated with drug resistance by affecting the DNA binding capacity of TFs and transcriptional regulators. However, the biological experimental techniques for measuring it are expensive and time-consuming. In recent years, several kinds of computational methods have been proposed to identify accessible regions of the genome. Existing computational models mostly ignore the contextual information provided by the bases in gene sequences. To address these issues, we proposed a new solution called SemanticCAP. It introduces a gene language model that models the context of gene sequences and is thus able to provide an effective representation of a certain site in a gene sequence. Basically, we merged the features provided by the gene language model into our chromatin accessibility model. During the process, we designed methods called SFA and SFC to make feature fusion smoother. Compared to DeepSEA, gkm-SVM, and k-mer using public benchmarks, our model proved to have better performance, showing a 1.25% maximum improvement in auROC and a 2.41% maximum improvement in auPRC.

show abstract

Online meta-learning for POI recommendation

Lv¹,

Sang²,

Tai

et al. 2022

Geoinformatica

View full text Add to dashboard Cite

Hierarchical Macro Discourse Parsing Based on Topic Segmentation

Jiang¹,

Fan²,

Chu³

et al. 2021

AAAI

View full text Add to dashboard Cite

Hierarchically constructing micro (i.e., intra-sentence or inter-sentence) discourse structure trees using explicit boundaries (e.g., sentence and paragraph boundaries) has been proved to be an effective strategy. However, it is difficult to apply this strategy to document-level macro (i.e., inter-paragraph) discourse parsing, the more challenging task, due to the lack of explicit boundaries at the higher level. To alleviate this issue, we introduce a topic segmentation mechanism to detect implicit topic boundaries and then help the document-level macro discourse parser to construct better discourse trees hierarchically. In particular, our parser first splits a document into several sections using the topic boundaries that the topic segmentation detects. Then it builds a smaller and more accurate discourse sub-tree in each section and sequentially forms a whole tree for a document. The experimental results on both Chinese MCDTB and English RST-DT show that our proposed method outperforms the state-of-the-art baselines significantly.

show abstract

Employing Beautiful Sentence Evaluation to Automatic Chinese Essay Scoring

Chu

2023

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xiaomin Chu

Effect of Mindfulness Based Cognitive Therapy on the Mental Health and Quality of Life in Patients with Breast Cancer

Recognizing Macro Chinese Discourse Structure on Label Degeneracy Combination Model

Constructing Chinese Macro Discourse Tree via Multiple Views and Word Pair Similarity

Not Just Classification: Recognizing Implicit Discourse Relation on Joint Modeling of Classification and Generation

SemanticCAP: Chromatin Accessibility Prediction Enhanced by Features Learning from a Language Model

Online meta-learning for POI recommendation

Hierarchical Macro Discourse Parsing Based on Topic Segmentation

Employing Beautiful Sentence Evaluation to Automatic Chinese Essay Scoring

Contact Info

Product

Resources

About