Detecting PE malware files is now commonly approached using statistical and machine learning models. While these models commonly use features extracted from the structure of PE files, we propose that icons from these files can also help better predict malware. We propose an innovative machine learning approach to extract information from icons. Our proposed approach consists of two steps: 1) extracting icon features using summary statics, histogram of gradients (HOG), and a convolutional autoencoder, 2) clustering icons based on the extracted icon features. Using publicly available data and by using machine learning experiments, we show our proposed icon clusters significantly boost the efficacy of malware prediction models. In particular, our experiments show an average accuracy increase of 10% when icon clusters are used in the prediction model.
We are interested in survival analysis of hemodialysis patients for whom several biomarkers are recorded over time. Motivated by this challenging problem, we propose a general framework for multivariate joint longitudinal-survival modeling that can be used to examine the association between several longitudinally recorded covariates and a time-to-event endpoint. Our method allows for simultaneous modeling of longitudinal covariates by taking their correlation into account. This leads to a more efficient method for modeling their trajectories over time, and hence, it can better capture their relationship to the survival outcomes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.