Feature Screening for Ultrahigh Dimensional Categorical Data With Applications

Huang, Danyang; Li, Runze; Wang, Hansheng

doi:10.1080/07350015.2013.863158

Cited by 64 publications

(58 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…() and for categorical variables in, for example, Huang et al. () and Ni and Fang (). For continuous data, the method in Li et al.…”

Section: Methodsmentioning

confidence: 99%

“…One is the sufficient dimension reduction method (Li, 1991;Cook, 1994). The other way is covariate selection or feature screening, which has been developed for continuous variables in, for example, Li et al (2012) and for categorical variables in, for example, Huang et al (2014) and Ni and Fang (2016). For continuous data, the method in Li et al (2012) is based on the following marginal correlation measurement between Y (j) and X (ν) , the νth component of X,…”

Section: Covariate Screeningmentioning

confidence: 99%

See 1 more Smart Citation

Optimal Treatment Assignment to Maximize Expected Outcome with Multiple Treatments

Lou

Shao

2017

Biometrics

View full text Add to dashboard Cite

When there is substantial heterogeneity of treatment effectiveness, it is crucial to identify individualized treatment assignment rules for comparative treatment selection. Traditional approaches directly model clinical outcome and define optimal treatment rule according to the interactions between treatment and covariates. This approach relies on the success of separating the main effects from the covariate-treatment interaction effects, which may not be easy. To overcome this shortcoming, a recent approach, called outcome weighted learning, focuses on building an optimal treatment rule by maximizing the expected clinical outcome related with differential treatments. However, there seems to be a lack of approaches to explicitly deal with three or more treatments. In this article, we propose an outcome weighted learning method that extends estimating individualized treatment rules to multi-treatment case by using a vector hinge loss as a target function. Consistency of the resulting estimator is shown in the article. We demonstrate the performance of our approach in simulation studies and in a real data analysis.

show abstract

“…() and for categorical variables in, for example, Huang et al. () and Ni and Fang (). For continuous data, the method in Li et al.…”

Section: Methodsmentioning

confidence: 99%

Section: Covariate Screeningmentioning

confidence: 99%

Optimal Treatment Assignment to Maximize Expected Outcome with Multiple Treatments

Lou

Shao

2017

Biometrics

View full text Add to dashboard Cite

show abstract

“…When both response and feature variables are categorical, it is not difficult to use a test of independence statistic as marginal utility for feature screening. Huang et al [33] employed the Pearson χ 2 -test statistic for independence as a marginal utility for feature screening. They further established the sure screening procedure of their screening procedure under mild conditions.…”

Section: Model-free Feature Screeningmentioning

confidence: 99%

A selective overview of feature screening for ultrahigh-dimensional data

Liu

Zhong

2015

Sci. China Math.

Self Cite

View full text Add to dashboard Cite

High-dimensional data have frequently been collected in many scientific areas including genomewide association study, biomedical imaging, tomography, tumor classifications, and finance. Analysis of high-dimensional data poses many challenges for statisticians. Feature selection and variable selection are fundamental for high-dimensional data analysis. The sparsity principle, which assumes that only a small number of predictors contribute to the response, is frequently adopted and deemed useful in the analysis of high-dimensional data. Following this general principle, a large number of variable selection approaches via penalized least squares or likelihood have been developed in the recent literature to estimate a sparse model and select significant variables simultaneously. While the penalized variable selection methods have been successfully applied in many high-dimensional analyses, modern applications in areas such as genomics and proteomics push the dimensionality of data to an even larger scale, where the dimension of data may grow exponentially with the sample size. This has been called ultrahigh-dimensional data in the literature. This work aims to present a selective overview of feature screening procedures for ultrahigh-dimensional data. We focus on insights into how to construct marginal utilities for feature screening on specific models and motivation for the need of model-free feature screening procedures.

show abstract

“…Both computational speed and classification accuracy are also expected to be taken into account. For categorical features, statistical test (e.g., Chi-square test) [ 8 , 9 ], information theory (e.g., information gain, mutual information, cross entropy) [ 10 , 11 , 12 , 13 ], and Bayesian methods [ 14 , 15 ] are usually used for feature screening, especially in the field of text classification. In this study, we propose a novel model-free feature screening method called weighted mean squared deviation (WMSD), which can be considered as a simplified version of Chi-square statistic and mutual information.…”

Section: Introductionmentioning

confidence: 99%

Weighted Mean Squared Deviation Feature Screening for Binary Features

Wang

Guan

2020

Entropy

View full text Add to dashboard Cite

In this study, we propose a novel model-free feature screening method for ultrahigh dimensional binary features of binary classification, called weighted mean squared deviation (WMSD). Compared to Chi-square statistic and mutual information, WMSD provides more opportunities to the binary features with probabilities near 0.5. In addition, the asymptotic properties of the proposed method are theoretically investigated under the assumption log p = o ( n ) . The number of features is practically selected by a Pearson correlation coefficient method according to the property of power-law distribution. Lastly, an empirical study of Chinese text classification illustrates that the proposed method performs well when the dimension of selected features is relatively small.

show abstract

Feature Screening for Ultrahigh Dimensional Categorical Data With Applications

Cited by 64 publications

References 20 publications

Optimal Treatment Assignment to Maximize Expected Outcome with Multiple Treatments

Optimal Treatment Assignment to Maximize Expected Outcome with Multiple Treatments

A selective overview of feature screening for ultrahigh-dimensional data

Weighted Mean Squared Deviation Feature Screening for Binary Features

Contact Info

Product

Resources

About