Empirical assessment of machine learning-based malware detectors for Android

Allix, Kevin; Bissyandé, Tegawendé F.; Jérôme, Quentin; Klein, Jacques; State, Radu; Traon, Yves Le

doi:10.1007/s10664-014-9352-6

Cited by 117 publications

(138 citation statements)

References 27 publications

Supporting

Mentioning

131

Contrasting

Order By: Relevance

“…We found that the performance decreases (but still with Fmeasure above 0.86) with the ratio of malware in the set. Such a finding was already shown in Allix et al's large scale empirical study with a different feature set [2]. RQ3:PCLs constitute good features for discriminating malicious apps from benign apps in a Machine learning-based malware detection scheme.…”

Section: Malware Identificationmentioning

confidence: 56%

“…The size of training sets and the parameters we use (e.g., malware/goodware ratio) take different values that appear to be unjustified since, as shown in [2], no survey has determined the appropriate values for malware detection. However, our results show the same trends of that shown in [2].…”

Section: Threats To Validitymentioning

confidence: 99%

“…Allix et al [2] empirically investigated the assessment of machine learning-based malware detectors for Android apps to measure the impact of datasets size and goodware/malware ratio, and the importance of validation scenarios. Our work is related in that we also measure the impact of several parameters and we raise one more factor to take into account when evaluating a malware detection approach: One specific approach may perform well only on a subset of Android applications.…”

Section: Related Workmentioning

confidence: 99%

“…Machine learning techniques, by allowing to sift through large sets of apps to detect malicious apps, appear to be promising for large-scale malware detection and eventually to keep malicious apps from entering app markets [2]. Stateof-the-art machine learning approaches for Android malware detection mainly differ in the feature sets that are considered for training the classifiers.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Potential Component Leaks in Android Apps: An Investigation into a New Feature Set for Malware Detection

Allix

et al. 2015

2015 IEEE International Conference on Software Quality, Reliability and Security

Self Cite

View full text Add to dashboard Cite

Abstract-We discuss the capability of a new feature set for malware detection based on potential component leaks (PCLs). PCLs are defined as sensitive data-flows that involve Android inter-component communications. We show that PCLs are common in Android apps and that malicious applications indeed manipulate significantly more PCLs than benign apps. Then, we evaluate a machine learning-based approach relying on PCLs. Experimental validations show high performance for identifying malware, demonstrating that PCLs can be used for discriminating malicious apps from benign apps.

show abstract

Section: Malware Identificationmentioning

confidence: 56%

Section: Threats To Validitymentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Potential Component Leaks in Android Apps: An Investigation into a New Feature Set for Malware Detection

Allix

et al. 2015

2015 IEEE International Conference on Software Quality, Reliability and Security

Self Cite

View full text Add to dashboard Cite

show abstract

“…It is thus obvious that the performance of the detector is tightly dependent on the quality of the training dataset. Previous works have even shown that the accuracy of such detectors can be degraded by orders of magnitude if the training data is faulty [26]. Following these ndings, one can easily infer that it is also possible to articially improve the performance of malware detectors by selecting a ground truth that splits around malware corner cases.…”

Section: Introductionmentioning

confidence: 88%

On the Lack of Consensus in Anti-Virus Decisions: Metrics and Insights on Building Ground Truths of Android Malware

Hurier

Allix

Bissyandé

et al. 2016

Detection of Intrusions and Malware, and Vulnerability Assessment

Self Cite

View full text Add to dashboard Cite

Abstract. There is generally a lack of consensus in Antivirus (AV) engines' decisions on a given sample. This challenges the building of authoritative ground-truth datasets. Instead, researchers and practitioners may rely on unvalidated approaches to build their ground truth, e.g., by considering decisions from a selected set of Antivirus vendors or by setting up a threshold number of positive detections before classifying a sample.Both approaches are biased as they implicitly either decide on ranking AV products, or they consider that all AV decisions have equal weights.In this paper, we extensively investigate the lack of agreement among AV engines. To that end, we propose a set of metrics that quantitatively describe the dierent dimensions of this lack of consensus. We show how our metrics can bring important insights by using the detection results of 66 AV products on 2 million Android apps as a case study. Our analysis focuses not only on AV binary decision but also on the notoriously hard problem of labels that AVs associate with suspicious les, and allows to highlight biases hidden in the collection of a malware ground trutha foundation stone of any malware detection approach.

show abstract

CDGDroid: Android Malware Detection Based on Deep Learning Using CFG and DFG

Ren

Qin

et al. 2018

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Android malware has become a serious threat in our daily digital life, and thus there is a pressing need to effectively detect or defend against them. Recent techniques have relied on the extraction of lightweight syntactic features that are suitable for machine learning classification, but despite of their promising results, the features they extract are often too simple to characterise Android applications, and thus may be insufficient when used to detect Android malware. In this paper, we propose CDGDroid, an effective approach for Android malware detection based on deep learning. We use the semantics graph representations, that is, control flow graph, data flow graph, and their possible combinations, as the features to characterise Android applications. We encode the graphs into matrices, and use them to train the classification model via Convolutional Neural Network (CNN). We have conducted some experiments on Marvin, Drebin, VirusShare and ContagioDump datasets to evaluate our approach and have identified that the classification model taking the horizontal combination of CFG and DFG as features offers the best performance in terms of accuracy among all combinations. We have also conducted experiments to compare our approach against Yeganeh Safaei et al.'s approach, Allix et al.'s approach, Drebin and many antivirus tools gathered in VirusTotal, and the experimental results have confirmed that our classification model gives a better performance than the others.

show abstract

Empirical assessment of machine learning-based malware detectors for Android

Cited by 117 publications

References 27 publications

Potential Component Leaks in Android Apps: An Investigation into a New Feature Set for Malware Detection

Potential Component Leaks in Android Apps: An Investigation into a New Feature Set for Malware Detection

On the Lack of Consensus in Anti-Virus Decisions: Metrics and Insights on Building Ground Truths of Android Malware

CDGDroid: Android Malware Detection Based on Deep Learning Using CFG and DFG

Contact Info

Product

Resources

About