Xifeng Yan scite author profile

Frequent pattern mining has been a focused theme in data mining research for over a decade. Abundant literature has been dedicated to this research and tremendous progress has been made, ranging from efficient and scalable algorithms for frequent itemset mining in transaction databases to numerous research frontiers, such as sequential pattern mining, structured pattern mining, correlation mining, associative classification, and frequent pattern-based clustering, as well as their broad applications. In this article, we provide a brief overview of the current status of frequent pattern mining and discuss a few promising research directions. We believe that frequent pattern mining research has substantially broadened the scope of data analysis and will have deep impact on data mining methodologies and applications in the long run. However, there are still some challenging research issues that need to be solved before frequent pattern mining can claim a cornerstone approach in data mining applications.

show abstract

CloSpan: Mining: Closed Sequential Patterns in Large Datasets

Yan¹,

Han²,

Afshar³

2003

525

487

View full text Add to dashboard Cite

Discriminative Frequent Pattern Analysis for Effective Classification

et al. 2007

View full text Add to dashboard Cite

show abstract

Mining significant graph patterns by leap search

Yan¹,

et al. 2008

View full text Add to dashboard Cite

Mining coherent dense subgraphs across massive biological networks for functional discovery

Hu¹,

Yan²,

Huang³

et al. 2005

Bioinformatics

282

247

View full text Add to dashboard Cite

show abstract

Integrating meta-path selection with user-guided object clustering in heterogeneous information networks

et al. 2012

View full text Add to dashboard Cite

Real-world, multiple-typed objects are often interconnected, forming heterogeneous information networks. A major challenge for link-based clustering in such networks is its potential to generate many different results, carrying rather diverse semantic meanings. In order to generate desired clustering, we propose to use meta-path, a path that connects object types via a sequence of relations, to control clustering with distinct semantics. Nevertheless, it is easier for a user to provide a few examples ("seeds") than a weighted combination of sophisticated meta-paths to specify her clustering preference. Thus, we propose to integrate meta-path selection with user-guided clustering to cluster objects in networks, where a user first provides a small set of object seeds for each cluster as guidance. Then the system learns the weights for each meta-path that are consistent with the clustering result implied by the guidance, and generates clusters under the learned weights of meta-paths. A probabilistic approach is proposed to solve the problem, and an effective and efficient iterative algorithm, PathSelClus, is proposed to learn the model, where the clustering quality and the meta-path weights are mutually enhancing each other. Our experiments with several clustering tasks in two real networks demonstrate the power of the algorithm in comparison with the baselines.

show abstract

Substructure similarity search in graph databases

Yu²,

2005

View full text Add to dashboard Cite

Synthesizing Near-Optimal Malware Specifications from Suspicious Behaviors

et al. 2010

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xifeng Yan

Frequent pattern mining: current status and future directions

CloSpan: Mining: Closed Sequential Patterns in Large Datasets

Discriminative Frequent Pattern Analysis for Effective Classification

Mining significant graph patterns by leap search

Mining coherent dense subgraphs across massive biological networks for functional discovery

Integrating meta-path selection with user-guided object clustering in heterogeneous information networks

Substructure similarity search in graph databases

Synthesizing Near-Optimal Malware Specifications from Suspicious Behaviors

Contact Info

Product

Resources

About