Xiao Han scite author profile

Extreme multi-label classification (XMC) refers to supervised multi-label learning involving hundreds of thousands or even millions of labels. In this paper, we develop a suite of algorithms, called Bonsai, which generalizes the notion of label representation in XMC, and partitions the labels in the representation space to learn shallow trees. We show three concrete realizations of this label representation space including: (i) the input space which is spanned by the input features, (ii) the output space spanned by label vectors based on their co-occurrence with other labels, and (iii) the joint space by combining the input and output representations. Furthermore, the constraint-free multi-way partitions learnt iteratively in these spaces lead to shallow trees. By combining the effect of shallow trees and generalized label representation, Bonsai achieves the best of both worlds-fast training which is comparable to state-of-the-art tree-based methods in XMC, and much better prediction accuracy, particularly on tail-labels. On a benchmark Amazon-3M dataset with 3 million labels, Bonsai outperforms a state-of-the-art one-vs-rest method in terms of prediction accuracy, while being approximately 200 times faster to train. The code for Bonsai is available at https ://githu b.com/xmc-aalto /bonsa i.

show abstract

An RSSI based DV-hop algorithm for wireless sensor networks

Han

Zhang

Wang

et al. 2017

View full text Add to dashboard Cite

Mixed Norm Constrained Sparse APA Algorithm for Satellite and Network Echo Channel Estimation

Jiang

Osman

et al. 2018

IEEE Access

View full text Add to dashboard Cite

The Influence Factors on Young Consumers' Green Purchase Behavior: Perspective Based on Theory of Consumption Value

Wang

Han

Kuang

et al. 2018

View full text Add to dashboard Cite

Blocked Maximum Correntropy Criterion Algorithm for Cluster-Sparse System Identifications

Jiang

Shi

et al. 2019

IEEE Trans. Circuits Syst. II

View full text Add to dashboard Cite

Incremental scene understanding on dense SLAM

Han

Tateno

et al. 2016

View full text Add to dashboard Cite

Experimental demonstration of underwater acoustic communication using bionic signals

Han¹,

Yin²,

Du³

et al. 2014

Applied Acoustics

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xiao Han

The influence of identity-driven customer engagement on purchase intention

Bonsai: diverse and shallow trees for extreme multi-label classification

An RSSI based DV-hop algorithm for wireless sensor networks

Mixed Norm Constrained Sparse APA Algorithm for Satellite and Network Echo Channel Estimation

The Influence Factors on Young Consumers' Green Purchase Behavior: Perspective Based on Theory of Consumption Value

Blocked Maximum Correntropy Criterion Algorithm for Cluster-Sparse System Identifications

Incremental scene understanding on dense SLAM

Experimental demonstration of underwater acoustic communication using bionic signals

Contact Info

Product

Resources

About