Semi-Supervised Clustering with Contrastive Learning for Discovering New Intents

Feng, Wei; Cheng, Ziyong; Hao, Zhenghong; Yang, Fengxin; Wang, Hua; Han, Bing; Guo, Shouwu

doi:10.48550/arxiv.2201.07604

Cited by 4 publications

(3 citation statements)

References 12 publications

(33 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this learning, which is done through backpropagation, pairwise constraints are used for better learning of document representation. A method based on semi-supervised text clustering by Wei et al [22] is presented in which labeled samples are used for deep contrastive semi-supervised clustering (DCSC), which jointly optimizes the clustering and representation learning. In the paper presented by Vilhagra et al [23], deep clustering with a convolutional Siamese network has also been used to learn data representation with pairwise constraints, and the K-Means algorithm is used for unsupervised clustering.…”

Section: Related Workmentioning

confidence: 99%

An Improved Deep Text Clustering Via Local Manifold of an Autoencoder Embedding

et al. 2022

View full text Add to dashboard Cite

Section: Related Workmentioning

confidence: 99%

An Improved Deep Text Clustering Via Local Manifold of an Autoencoder Embedding

et al. 2022

View full text Add to dashboard Cite

“…Most existing NID methods (Lin et al, 2020;Zhang et al, 2021;Wei et al, 2022;Zhang et al, 2022;An et al, 2023) adopt a two-stage training strategy: pre-training on labeled data, then learning clustering-friendly representation with pseudo supervisory signals. However, previous methods only rely on semantic similarities to generate supervisory signals based on the assumption that samples within the feature hypersphere belong to the same category as the hypersphere anchor, e.g.…”

Section: Introductionmentioning

confidence: 99%

A Diffusion Weighted Graph Framework for New Intent Discovery

Shi,

An,

Tian

et al. 2023

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

New Intent Discovery (NID) aims to recognize both new and known intents from unlabeled data with the aid of limited labeled data containing only known intents. Without considering structure relationships between samples, previous methods generate noisy supervisory signals which cannot strike a balance between quantity and quality, hindering the formation of new intent clusters and effective transfer of the pre-training knowledge. To mitigate this limitation, we propose a novel Diffusion Weighted Graph Framework (DWGF) to capture both semantic similarities and structure relationships inherent in data, enabling more sufficient and reliable supervisory signals. Specifically, for each sample, we diffuse neighborhood relationships along semantic paths guided by the nearest neighbors for multiple hops to characterize its local structure discriminately. Then, we sample its positive keys and weigh them based on semantic similarities and local structures for contrastive learning. During inference, we further propose Graph Smoothing Filter (GSF) to explicitly utilize the structure relationships to filter high-frequency noise embodied in semantically ambiguous samples on the cluster boundary. Extensive experiments show that our method outperforms state-of-the-art models on all evaluation metrics across multiple benchmark datasets. Code and data are available at https://github.com/yibai-shi/DWGF.

show abstract

“…Contrastive learning has shown impressive outcomes in unsupervised sentence representation learning (Tang et al, 2022;Wei et al, 2022). The fundamental concept entails generating positive pairs and negative pairs via data augmentation (Wei and Zou, 2019), and feeding these pairs into a pre-trained model to minimize the distance between positive pairs while maximizing the distance between negative pairs.…”

mentioning

confidence: 99%

A lightweight mixup-based short texts clustering for contrastive learning

Xu,

Zan,

2024

Front. Comput. Neurosci.

View full text Add to dashboard Cite

Traditional text clustering based on distance struggles to distinguish between overlapping representations in medical data. By incorporating contrastive learning, the feature space can be optimized and applies mixup implicitly during the data augmentation phase to reduce computational burden. Medical case text is prevalent in everyday life, and clustering is a fundamental method of identifying major categories of conditions within vast amounts of unlabeled text. Learning meaningful clustering scores in data relating to rare diseases is difficult due to their unique sparsity. To address this issue, we propose a contrastive clustering method based on mixup, which involves selecting a small batch of data to simulate the experimental environment of rare diseases. The contrastive learning module optimizes the feature space based on the fact that positive pairs share negative samples, and clustering is employed to group data with comparable semantic features. The module mitigates the issue of overlap in data, whilst mixup generates cost-effective virtual features, resulting in superior experiment scores even when using small batch data and reducing resource usage and time overhead. Our suggested technique has acquired cutting-edge outcomes and embodies a favorable strategy for unmonitored text clustering.

show abstract

Semi-Supervised Clustering with Contrastive Learning for Discovering New Intents

Cited by 4 publications

References 12 publications

An Improved Deep Text Clustering Via Local Manifold of an Autoencoder Embedding

An Improved Deep Text Clustering Via Local Manifold of an Autoencoder Embedding

A Diffusion Weighted Graph Framework for New Intent Discovery

A lightweight mixup-based short texts clustering for contrastive learning

Contact Info

Product

Resources

About